Alessandro Achille, a senior applied scientist at Amazon Web Services, is seen standing outside at night with a display of colored lights in the background
Alessandro Achille, a senior applied scientist at Amazon Web Services, is tackling fundamental challenges that are shaping the future of computer vision and large generative-AI models.

“I don't remember a time in my life when I wasn't interested in science"

From the urgent challenge of "machine unlearning" to overcoming the problem of critical learning periods in deep neural networks, Alessandro Achille is tackling fundamental issues on behalf of Amazon customers.

It was on a “hunting trip” to Italy in 2015 that computer vision pioneer Stefano Soatto first came across Alessandro Achille. More accurately, it was a mind-hunting trip, to the prestigious Scuola Normale Superiore in Pisa. The university was founded by Napoleon, and its alumni include Nobel-Prize-winning physicists Enrico Fermi and Carlo Rubbia and Field-Medal-winning mathematician Alessio Figalli. “It puts students through a grueling selection and training process,” says Soatto, “so those who survive are usually highly capable — and rugged.”

It was a successful trip that evolved into a powerful research partnership. Today, Achille is working as a senior applied scientist at Amazon Web Services' (AWS') AI Lab, on the California Institute of Technology (Caltech) campus, tackling fundamental challenges that are shaping the future of computer vision (CV) and large generative-AI models.

But back in 2015, Achille was immersed in a master’s in pure mathematics, “spiced up”, as he puts it, with algebraic topology.

Related content
Early on, Giovanni Paolini knew little about machine learning — now he’s leading new science on artificial intelligence that could inform AWS products.

“I don't remember a time in my life when I wasn't interested in science,” he says. Achille was particularly interested in the foundations of mathematics. “I focused on logic, because I’ve always had this nagging problem at the back of my mind of exactly why things are the way they are in mathematics.”

Achille’s first taste of computer vision arose when he and his peers decided to augment an annual school tradition: a 24-hour foosball tournament between mathematicians and physicists. Besides a sport competition, the event had become a showcase of the students’ engineering capabilities. That year, after adding live streaming and a fully automated scorekeeping system, the students thought it was time to add real-time tracking of the ball.

“It’s just a white blob moving on a green background. How hard could it be?” says Achille. The short answer is, harder than they thought. So Achille took a class that would teach him more — a choice that would eventually lead to an invitation from Soatto to join him at the University of California, Los Angeles, for a PhD in computer vision.

“In Italian education, it sometimes feels like there is a hierarchy,” says Achille. “The more abstract you are, the better you are doing!” So why the departure from pure mathematics? In the end, says Soatto, “Alessandro’s work became so abstract he couldn’t see a path to impact. That’s very frustrating for a really smart person who wants to make a difference in the world.”

Deep learning takes off

Achille’s PhD coincided with the rise of deep learning (DL), which would become a game-changing technology in machine learning and computer vision. “At the time, we didn't know if it was anything more than just a new, slightly more powerful tool. We didn’t know if DL had the power of abstraction, reasoning, and so on,” says Achille.

Related content
Two recent trends in the theory of deep learning are examinations of the double-descent phenomenon and more-realistic approaches to neural kernel methods.

The power of deep learning was becoming clear, though. During an internship in 2017, Achille worked on a computer vision model that could learn a representation of a dynamic scene — a 3-D shape that was moving, changing color, changing orientation, and so on.

The idea was to capture and isolate the semantic components of the scene — shape, size, color, or angle of rotation — rather than capturing the totality of the scene’s characteristics. Humans do this disentangling naturally. That’s how you would understand the sight of a blue banana, even if you had never seen one before: “banana” and “blue” are separate semantic components.

While Achille enjoyed the project and appreciated its importance, he was struck by the artificiality of the setting. “I was not working backwards from a use case,” he says. Shortly after, Achille became an intern at the AWS AI Lab that had just been established at the Caltech campus, where he was immediately given a real-world challenge to solve on a newly launched product called Custom Label.

Real-world problems

At the time, Custom Label allowed Amazon customers to access CV models that could be trained to identify, say, their company’s products in images — a particular faucet, for example. The models could also be trained to perform tasks like identifying something in a video or analyzing a satellite image.

AWS researchers realized it was impractical to expect a single model to accurately deal with such a range of esoteric image possibilities. A better approach was to pretrain many expert models on different imagery domains and then select the most appropriate one to fine-tune on the customer’s data. The problem for AWS was, how could it efficiently discover which of 100 or more pretrained CV models would perform best?

Alessandro Achille: The information in a deep neural network

During his research in machine learning, Achille became passionate about information theory — a mathematical framework for quantifying, storing, and communicating information. So he used that approach on this so-called model selection problem. “For a hammer, everything looks like a nail,” he laughs.

The problem is how to measure the “distance" between two learning tasks — the task a given AWS model has been pretrained on and the novel customer task. In other words, how much additional information is required by the pretrained model to produce a good performance on the customer task? The less additional information required, the better.

Achille was impressed by the task because it was an important customer issue with a fundamental mathematical problem behind it. “We formulated an algorithm to compute this efficiently, so we could easily select the expert model best suited to solving the customer’s task,” says Achille. “It was the first solution to this problem.”

Achille found Amazon’s applied approach to be a compelling way to work, and when Soatto established the AWS AI Labs, Achille was happy to join him there.

“One of the beauties of being at Amazon is that we’re tackling some of the world's most challenging emerging problems,” says Soatto. “Because when AWS customers have difficult problems to address, they come to us. From a scientific perspective, this is a goldmine.”

Machine unlearning

Achille is currently staking out a vein of research gold in a critical new area of artificial intelligence (AI): AI model disgorgement, more popularly known as "machine unlearning". It is critical in any implementation of machine learning models that the data used to train the model are used responsibly, in a privacy-preserving manner, and in accordance with the appropriate regulations and intellectual-property rights.

Related content
At this year’s ACL, Amazon researchers won an outstanding-paper award for showing that knowledge distillation using contrastive decoding in the teacher model and counterfactual reasoning in the student model improves the consistency of “chain of thought” reasoning.

Modern ML models have become very large and complex, requiring a great deal of data and computational resources to train. But what if, once a model is trained, the contributor of some of those training data decides, or is obligated by law, to withdraw the data from the model? Or what if some of the training data is discovered to be biased? Retraining a large model afresh, with some data withheld, may be impractical, particularly if the requirement for such changes becomes commonplace in the shifting legal landscape.

The next level

In 2019 that Soatto, Achille, and Achille's fellow UCLA PhD student Aditya Golatkar published a paper entitled “Eternal Sunshine of the Spotless Net: Selective Forgetting in Deep Networks”; the paper established a novel method for removing the effects of a subset of a deep neural network's training data, without requiring retraining.

Eternal sunshine of the spotless net: Selective forgetting in deep networks

“I was happy to see interest in ‘selective forgetting’ explode after we published this paper,” says Achille. “Model disgorgement is a fascinating problem, and not only because it's very important for AWS customers. It also demands that we understand everything about a model’s neural network. We need to understand where information is held in a model’s weights, how it is encoded, how it is measured.”

It is in this fundamental work that Achille took the field to “the next level”, says Soatto. And this year, Achille and Soatto, on a team also featuring Amazon Scholar Michael Kearns, coauthor of the book The Ethical Algorithm, led the field by introducing a taxonomy of possible disgorgement methods applicable to modern ML systems.

The paper also describes ways to train future models so that they are amenable to subsequent disgorgement.

Related content
The surprising dynamics related to learning that are common to artificial and biological systems.

“It is better for models to learn in a compartmentalized fashion, so in the event that some data is found to be problematic, everything that touched those data gets thrown away, while the rest of the model survives without having to retrain it from scratch,” says Soatto.

This work has been particularly satisfying, says Achille, as it obliged computer scientists, mathematicians, lawyers, and policymakers to work closely together to solve a pressing modern problem.

Critical learning periods

The breadth of Achille’s interests is formidable. His other prominent research includes work on “critical learning periods” in the training of deep networks. The work arose through serendipity, after a friend studying for a medical exam on the profound effect of critical learning periods in humans jokingly asked Achille if his networks also had them. Interest piqued, Achille explored the idea, and found some striking similarities.

Related content
Technique that mixes public and private training data can meet differential-privacy criteria while cutting error increase by 60%-70%.

For example, take infantile strabismus, a condition in which a person's eyes do not align properly from birth or early infancy. If not treated early, the condition can cause amblyopia, whereby the brain learns to trust the properly working eye and to ignore the visual input from the misaligned eye, to avoid double vision.

This one-sided competition between the two eyes (data sources) leads to worsening vision in the misaligned eye and of course the loss of stereo vision, which is important for depth perception. Amblyopia is difficult to reverse if left untreated into adulthood. But treating the eyes early, enabling them to work together optimally, makes for a robust vision system.

Similarly, in the early training of multimodal deep neural networks, one type of data may become favored over another, simply through expediency. For example, in a visual-question-answering model, which is trained on images and captions, the easy-to-use textual information may outcompete visual information, leading to models that are effectively blind to visual information. Achille and his colleagues suggest that when a DL model takes such shortcuts, it has irreversible effects on the subsequent performance of the model, making it less flexible — and therefore less useful — when fine-tuned on novel data.

Off the charts

Having explored the causes of critical learning periods in deep networks, the team offered new techniques for stabilizing the early learning dynamics in model training and showed how this approach can actually prevent critical periods in deep networks. The practical benefits of this research aside, Achille enjoys exploring the parallelisms of artificial and biological systems.

“Look, we can all recognize that the actual hardware of a network and a brain are completely different, but can we also recognize that they are both systems that are trying to process information efficiently and trying to learn something?” he asks. Are there some fundamental dynamics of learning, and how it relates to the acquisition of information, that are shared between synthetic and biological systems? Watch this space.

Looking back on the eight years since his hunting trip to Pisa, Soatto considers what he most appreciates about his Amazon colleague.

“First, the brilliance of the way Alessandro frames problems: he thinks very abstractly, yet he is also a hacker who thinks broadly, all the way from mathematics to neuroscience, from art to engineering — this is very rare. Second, his curiosity, which is absolutely off the charts.”

For Achille’s part, when asked if he prefers tackling the challenges that arise from AWS products or working on fundamental science problems, he demurs. “I don’t need to split my time between product and fundamental research. For me, it ends up being the same thing.”

Indeed, one of Amazon’s most abstract thinkers has found a path to true impact.

Research areas

Related content

IN, HR, Gurugram
Our customers have immense faith in our ability to deliver packages timely and as expected. A well planned network seamlessly scales to handle millions of package movements a day. It has monitoring mechanisms that detect failures before they even happen (such as predicting network congestion, operations breakdown), and perform proactive corrective actions. When failures do happen, it has inbuilt redundancies to mitigate impact (such as determine other routes or service providers that can handle the extra load), and avoids relying on single points of failure (service provider, node, or arc). Finally, it is cost optimal, so that customers can be passed the benefit from an efficiently set up network. Amazon Shipping is hiring Applied Scientists to help improve our ability to plan and execute package movements. As an Applied Scientist in Amazon Shipping, you will work on multiple challenging machine learning problems spread across a wide spectrum of business problems. You will build ML models to help our transportation cost auditing platforms effectively audit off-manifest (discrepancies between planned and actual shipping cost). You will build models to improve the quality of financial and planning data by accurately predicting ship cost at a package level. Your models will help forecast the packages required to be pick from shipper warehouses to reduce First Mile shipping cost. Using signals from within the transportation network (such as network load, and velocity of movements derived from package scan events) and outside (such as weather signals), you will build models that predict delivery delay for every package. These models will help improve buyer experience by triggering early corrective actions, and generating proactive customer notifications. Your role will require you to demonstrate Think Big and Invent and Simplify, by refining and translating Transportation domain-related business problems into one or more Machine Learning problems. You will use techniques from a wide array of machine learning paradigms, such as supervised, unsupervised, semi-supervised and reinforcement learning. Your model choices will include, but not be limited to, linear/logistic models, tree based models, deep learning models, ensemble models, and Q-learning models. You will use techniques such as LIME and SHAP to make your models interpretable for your customers. You will employ a family of reusable modelling solutions to ensure that your ML solution scales across multiple regions (such as North America, Europe, Asia) and package movement types (such as small parcel movements and truck movements). You will partner with Applied Scientists and Research Scientists from other teams in US and India working on related business domains. Your models are expected to be of production quality, and will be directly used in production services. You will work as part of a diverse data science and engineering team comprising of other Applied Scientists, Software Development Engineers and Business Intelligence Engineers. You will participate in the Amazon ML community by authoring scientific papers and submitting them to Machine Learning conferences. You will mentor Applied Scientists and Software Development Engineers having a strong interest in ML. You will also be called upon to provide ML consultation outside your team for other problem statements. If you are excited by this charter, come join us!
US, MA, Boston
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Senior Applied Scientist with a strong deep learning background, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As a Senior Applied Scientist with the AGI team, you will work with talented peers to lead the development of novel algorithms and modeling techniques, to advance the state of the art with LLMs. Your work will directly impact our customers in the form of products and services that make use of speech and language technology. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in generative artificial intelligence (GenAI). About the team The AGI team has a mission to push the envelope in LLMs and multimodal systems, in order to provide the best-possible experience for our customers.
IN, KA, Bengaluru
The Amazon Alexa AI team in India is seeking a talented, self-driven Applied Scientist to work on prototyping, optimizing, and deploying ML algorithms within the realm of Generative AI. Key responsibilities include: - Research, experiment and build Proof Of Concepts advancing the state of the art in AI & ML for GenAI. - Collaborate with cross-functional teams to architect and execute technically rigorous AI projects. - Thrive in dynamic environments, adapting quickly to evolving technical requirements and deadlines. - Engage in effective technical communication (written & spoken) with coordination across teams. - Conduct thorough documentation of algorithms, methodologies, and findings for transparency and reproducibility. - Publish research papers in internal and external venues of repute - Support on-call activities for critical issues Basic Qualifications: - Master’s or PhD in computer science, statistics or a related field - 2-7 years experience in deep learning, machine learning, and data science. - Proficiency in coding and software development, with a strong focus on machine learning frameworks. - Experience in Python, or another language; command line usage; familiarity with Linux and AWS ecosystems. - Understanding of relevant statistical measures such as confidence intervals, significance of error measurements, development and evaluation data sets, etc. - Excellent communication skills (written & spoken) and ability to collaborate effectively in a distributed, cross-functional team setting. - Papers published in AI/ML venues of repute Preferred Qualifications: - Track record of diving into data to discover hidden patterns and conducting error/deviation analysis - Ability to develop experimental and analytic plans for data modeling processes, use of strong baselines, ability to accurately determine cause and effect relations - The motivation to achieve results in a fast-paced environment. - Exceptional level of organization and strong attention to detail - Comfortable working in a fast paced, highly collaborative, dynamic work environment
GB, London
Are you looking to work at the forefront of Machine Learning and AI? Would you be excited to apply cutting edge Generative AI algorithms to solve real world problems with significant impact? The AWS Industries Team at AWS helps AWS customers implement Generative AI solutions and realize transformational business opportunities for AWS customers in the most strategic industry verticals. This is a team of data scientists, engineers, and architects working step-by-step with customers to build bespoke solutions that harness the power of generative AI. The team helps customers imagine and scope the use cases that will create the greatest value for their businesses, select and train and fine tune the right models, define paths to navigate technical or business challenges, develop proof-of-concepts, and build applications to launch these solutions at scale. The AWS Industries team provides guidance and implements best practices for applying generative AI responsibly and cost efficiently. You will work directly with customers and innovate in a fast-paced organization that contributes to game-changing projects and technologies. You will design and run experiments, research new algorithms, and find new ways of optimizing risk, profitability, and customer experience. In this Data Scientist role you will be capable of using GenAI and other techniques to design, evangelize, and implement and scale cutting-edge solutions for never-before-solved problems. Key job responsibilities - Collaborate with AI/ML scientists, engineers, and architects to research, design, develop, and evaluate cutting-edge generative AI algorithms and build ML systems to address real-world challenges - Interact with customers directly to understand the business problem, help and aid them in implementation of generative AI solutions, deliver briefing and deep dive sessions to customers and guide customer on adoption patterns and paths to production - Create and deliver best practice recommendations, tutorials, blog posts, publications, sample code, and presentations adapted to technical, business, and executive stakeholder - Provide customer and market feedback to Product and Engineering teams to help define product direction About the team Diverse Experiences Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why AWS Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship and Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
US, WA, Seattle
The Seller Fees organization drives the monetization infrastructure powering Amazon's global marketplace, processing billions of transactions for over two million active third-party sellers worldwide. Our team owns the complete technical stack and strategic vision for fee computation systems, leveraging advanced machine learning to optimize seller experiences and maintain fee integrity at unprecedented scale. We're seeking an exceptional Applied Scientist to push the boundaries of large-scale ML systems in a business-critical domain. This role presents unique opportunities to • Architect and deploy state-of-the-art transformer-based models for fee classification and anomaly detection across hundreds of millions of products • Pioneer novel applications of multimodal LLMs to analyze product attributes, images, and seller metadata for intelligent fee determination • Build production-scale generative AI systems for fee integrity and seller communications • Advance the field of ML through novel research in high-stakes, large-scale transaction processing • Develop SOTA causal inference frameworks integrated with deep learning to understand fee impacts and optimize seller outcomes • Collaborate with world-class scientists and engineers to solve complex problems at the intersection of deep learning, economics, and large business systems. If you're passionate about advancing the state-of-the-art in applied ML/AI while tackling challenging problems at global scale, we want you on our team! Key job responsibilities Responsibilities: . Design measurable and scalable science solutions that can be adopted across stores worldwide with different languages, policy and requirements. · Integrate AI (both generative and symbolic) into compound agentic workflows to transform complex business systems into intelligent ones for both internal and external customers. · Develop large scale classification and prediction models using the rich features of text, image and customer interactions and state-of-the-art techniques. · Research and implement novel machine learning, statistical and econometrics approaches. · Write high quality code and implement scalable models within the production systems. · Stay up to date with relevant scientific publications. · Collaborate with business and software teams both within and outside of the fees organization.
US, WA, Seattle
The Selling Partner Experience (SPX) organization strives to make Amazon the best place for Selling Partners to do business. The SPX Science team is building an AI-powered conversational assistant to transform the Selling Partner experience. The Selling Assistant is a trusted partner and a seasoned advisor that’s always available to enable our partners to thrive in Amazon’s stores. It takes away the cognitive load of selling on Amazon by providing a single interface to handle a diverse set of selling needs. The assistant always stays by the seller's side, talks to them in their language, enables them to capitalize on opportunities, and helps them accomplish their business goals with ease. It is powered by the state-of-the-art Generative AI, going beyond a typical chatbot to provide a personalized experience to sellers running real businesses, large and small. Do you want to join an innovative team of scientists, engineers, product and program managers who use the latest Generative AI and Machine Learning technologies to help Amazon create a delightful Selling Partner experience? Do you want to build solutions to real business problems by automatically understanding and addressing sellers’ challenges, needs and opportunities? Are you excited by the prospect of contributing to one of Amazon’s most strategic Generative AI initiatives? If yes, then you may be a great fit to join the Selling Partner Experience Science team. Key job responsibilities - Use state-of-the-art Machine Learning and Generative AI techniques to create the next generation of the tools that empower Amazon's Selling Partners to succeed. - Design, develop and deploy highly innovative models to interact with Sellers and delight them with solutions. - Work closely with teams of scientists and software engineers to drive real-time model implementations and deliver novel and highly impactful features. - Establish scalable, efficient, automated processes for large scale data analyses, model benchmarking, model validation and model implementation. - Research and implement novel machine learning and statistical approaches. - Participate in strategic initiatives to employ the most recent advances in ML in a fast-paced, experimental environment. About the team Selling Partner Experience Science is a growing team of scientists, engineers and product leaders engaged in the research and development of the next generation of ML-driven technology to empower Amazon's Selling Partners to succeed. We draw from many science domains, from Natural Language Processing to Computer Vision to Optimization to Economics, to create solutions that seamlessly and automatically engage with Sellers, solve their problems, and help them grow. We are focused on building seller facing AI-powered tools using the latest science advancements to empower sellers to drive the growth of their business. We strive to radically simplify the seller experience, lowering the cognitive burden of selling on Amazon by making it easy to accomplish critical tasks such as launching new products, understanding and complying with Amazon’s policies and taking actions to grow their business.
US, WA, Seattle
Join us in the evolution of Amazon’s Seller business! The Selling Partner Growth organization is the growth and development engine for our Store. Partnering with business, product, and engineering, we catalyze SP growth with comprehensive and accurate data, unique insights, and actionable recommendations and collaborate with WW SP facing teams to drive adoption and create feedback loops. We strongly believe that any motivated SP should be able to grow their businesses and reach their full potential supported by Amazon tools and resources. We are looking for a Senior Applied Scientist to lead us to identify data-driven insight and opportunities to improve our SP growth strategy and drive new seller success. As a successful applied scientist on our talented team of scientists and engineers, you will solve complex problems to identify actionable opportunities, and collaborate with engineering, research, and business teams for future innovation. You need to have deep understanding on the business domain and have the ability to connect business with science. You are also strong in ML modeling and scientific foundation with the ability to collaborate with engineering to put models in production to answer specific business questions. You are an expert at synthesizing and communicating insights and recommendations to audiences of varying levels of technical sophistication. You will continue to contribute to the research community, by working with scientists across Amazon, as well as collaborating with academic researchers and publishing papers (www.aboutamazon.com/research). Key job responsibilities As a Sr. Applied Scientist in the team, you will: - Identify opportunities to improve SP growth and translate those opportunities into science problems via principled statistical solutions (e.g. ML, causal, RL). - Mentor and guide the applied scientists in our organization and hold us to a high standard of technical rigor and excellence in MLOps. - Design and lead roadmaps for complex science projects to help SP have a delightful selling experience while creating long term value for our shoppers. - Work with our engineering partners and draw upon your experience to meet latency and other system constraints. - Identify untapped, high-risk technical and scientific directions, and simulate new research directions that you will drive to completion and deliver. - Be responsible for communicating our science innovations to the broader internal & external scientific community.
US, CA, Sunnyvale
Our team leads the development and optimization of on-device ML models for Amazon's hardware products, including audio, vision, and multi-modal AI features. We work at the critical intersection of ML innovation and silicon design, ensuring AI capabilities can run efficiently on resource-constrained devices. Currently, we enable production ML models across multiple device families, including Echo, Ring/Blink, and other consumer devices. Our work directly impacts Amazon's customer experiences in consumer AI device market. The solutions we develop determine which AI features can be offered on-device versus requiring cloud connectivity, ultimately shaping product capabilities and customer experience across Amazon's hardware portfolio. This is a unique opportunity to shape the future of AI in consumer devices at unprecedented scale. You'll be at the forefront of developing industry-first model architectures and compression techniques that will power AI features across millions of Amazon devices worldwide. Your innovations will directly enable new AI features that enhance how customers interact with Amazon products every day. Come join our team! Key job responsibilities As a Principal Applied Scientist, you will: • Own the technical architecture and optimization strategy for ML models deployed across Amazon's device ecosystem, from existing to yet-to-be-shipped products. • Develop novel model architectures optimized for our custom silicon, establishing new methodologies for model compression and quantization. • Create an evaluation framework for model efficiency and implement multimodal optimization techniques that work across vision, language, and audio tasks. • Define technical standards for model deployment and drive research initiatives in model efficiency to guide future silicon designs. • Spend the majority of your time doing deep technical work - developing novel ML architectures, writing critical optimization code, and creating proof-of-concept implementations that demonstrate breakthrough efficiency gains. • Influence architecture decisions impacting future silicon generations, establish standards for model optimization, and mentor others in advanced ML techniques.
US, CO, Boulder
The Advertising Incrementality Measurement (AIM) team is looking for an Applied Scientist II with experience in causal inference, experimentation, and ML development to help us expand our causal modeling solutions for understanding advertising effectiveness. Our work is foundational to providing customer-facing experimentation tools, furthering internal research & development, and building out Amazon's new Multi-Touch Attribution (MTA) measurement offerings. Incrementality measurement is a lynchpin for the next generation of Amazon Advertising measurement solutions and this role will play a key role in the release and expansion of these offerings. Key job responsibilities * Partner with economists and senior team members to drive science improvements and implement technical solutions at the state-of-the-art of machine learning and econometrics * Partner with engineering and other science collaborators to design, implement, prototype, deploy, and maintain large-scale causal ML models. * Carry out in-depth research and analysis exploring advertising-related data sets, including large sets of real-world experimental data, to understand advertiser behavior, highlight model improvement opportunities, and understand shortcomings and limitations. * Define data quality standards for understanding typical behavior, capturing outliers, and detecting model performance issues. * Work with product stakeholders to help improve our ability to provide quality measurement of advertising effectiveness for our customers. About the team AIM is a cross disciplinary team of engineers, product managers, economists, data scientists, and applied scientists with a charter to build scientifically-rigorous causal inference methodologies at scale. Our job is to help customers cut through the noise of the modern advertising landscape and understand what actions, behaviors, and strategies actually have a real, measurable impact on key outcomes. The data we produce becomes the effective ground truth for advertisers and partners making decisions affecting $10s to $100s of millions in advertising spend.
US, WA, Seattle
The Measurement, Ad Tech, and Data Science (MADS) team at Amazon Ads is at the forefront of developing cutting-edge solutions that help our tens of millions of advertisers understand the value of their ad spend while prioritizing customer privacy and measurement quality. We develop cutting-edge deterministic algorithms, machine learning models, causal models, and statistical approaches to empower advertisers with insights on the effectiveness of their ads in guiding customers from awareness to purchases. Our insights help advertisers build full-funnel advertising strategies. We maximize the information we extract from incomplete traffic signals and alternative sources to capture the impact of their ad spending for both Amazon recognized and anonymous traffic. Our vision is to lead the industry in extracting and combining information from several sources to enable advertisers to optimize their return on their ad spend. As an Applied Scientist on this team, you will: - Drive end-to-end Machine Learning projects that have a high degree of ambiguity, scale, complexity. - Perform hands-on analysis and modeling of enormous data sets to develop insights that increase traffic monetization and merchandise sales, without compromising the shopper experience. - Build machine learning models, perform proof-of-concept, experiment, optimize, and deploy your models into production; work closely with software engineers to assist in productionizing your ML models. - Run A/B experiments, gather data, and perform statistical analysis. - Establish scalable, efficient, automated processes for large-scale data analysis, machine-learning model development, model validation and serving. - Research new and innovative machine learning approaches. - Recruit Applied Scientists to the team and provide mentorship. Why you will love this opportunity: Amazon is investing heavily in building a world-class advertising business. This team defines and delivers a collection of advertising products that drive discovery and sales. Our solutions generate billions in revenue and drive long-term growth for Amazon’s Retail and Marketplace businesses. We deliver billions of ad impressions, millions of clicks daily, and break fresh ground to create world-class products. We are a highly motivated, collaborative, and fun-loving team with an entrepreneurial spirit - with a broad mandate to experiment and innovate. Impact and Career Growth: You will invent new experiences and influence customer-facing shopping experiences to help suppliers grow their retail business and the auction dynamics that leverage native advertising; this is your opportunity to work within the fastest-growing businesses across all of Amazon! Define a long-term science vision for our advertising business, driven from our customers' needs, translating that direction into specific plans for research and applied scientists, as well as engineering and product teams. This role combines science leadership, organizational ability, technical strength, product focus, and business understanding. Team video https://youtu.be/zD_6Lzw8raE Key job responsibilities * Lead the development of ad measurement models and solutions that address the full spectrum of an advertiser's investment, focusing on scalable and efficient methodologies. * Collaborate closely with cross-functional teams including engineering, product management, and business teams to define and implement measurement solutions. * Use state-of-the-art scientific technologies including Generative AI, Classical Machine Learning, Causal Inference, Natural Language Processing, and Computer Vision to develop cutting-edge models that measure the impact of ad spend across multiple platforms and timescales. * Drive experimentation and the continuous improvement of ML models through iterative development, testing, and optimization. * Translate complex scientific challenges into clear and impactful solutions for business stakeholders. * Mentor and guide junior scientists, fostering a collaborative and high-performing team culture. * Foster collaborations between scientists to move faster, with broader impact. * Regularly engage with the broader scientific community with presentations, publications, and patents. A day in the life You will solve real-world problems by getting and analyzing large amounts of data, generate business insights and opportunities, design simulations and experiments, and develop statistical and ML models. The team is driven by business needs, which requires collaboration with other Scientists, Engineers, and Product Managers across the advertising organization. You will prepare written and verbal presentations to share insights to audiences of varying levels of technical sophistication. Team video https://advertising.amazon.com/help/G4LNN5YWHP6SM9TJ