Alessandro Achille, a senior applied scientist at Amazon Web Services, is seen standing outside at night with a display of colored lights in the background
Alessandro Achille, a senior applied scientist at Amazon Web Services, is tackling fundamental challenges that are shaping the future of computer vision and large generative-AI models.

“I don't remember a time in my life when I wasn't interested in science"

From the urgent challenge of "machine unlearning" to overcoming the problem of critical learning periods in deep neural networks, Alessandro Achille is tackling fundamental issues on behalf of Amazon customers.

It was on a “hunting trip” to Italy in 2015 that computer vision pioneer Stefano Soatto first came across Alessandro Achille. More accurately, it was a mind-hunting trip, to the prestigious Scuola Normale Superiore in Pisa. The university was founded by Napoleon, and its alumni include Nobel-Prize-winning physicists Enrico Fermi and Carlo Rubbia and Field-Medal-winning mathematician Alessio Figalli. “It puts students through a grueling selection and training process,” says Soatto, “so those who survive are usually highly capable — and rugged.”

It was a successful trip that evolved into a powerful research partnership. Today, Achille is working as a senior applied scientist at Amazon Web Services' (AWS') AI Lab, on the California Institute of Technology (Caltech) campus, tackling fundamental challenges that are shaping the future of computer vision (CV) and large generative-AI models.

But back in 2015, Achille was immersed in a master’s in pure mathematics, “spiced up”, as he puts it, with algebraic topology.

Related content
Early on, Giovanni Paolini knew little about machine learning — now he’s leading new science on artificial intelligence that could inform AWS products.

“I don't remember a time in my life when I wasn't interested in science,” he says. Achille was particularly interested in the foundations of mathematics. “I focused on logic, because I’ve always had this nagging problem at the back of my mind of exactly why things are the way they are in mathematics.”

Achille’s first taste of computer vision arose when he and his peers decided to augment an annual school tradition: a 24-hour foosball tournament between mathematicians and physicists. Besides a sport competition, the event had become a showcase of the students’ engineering capabilities. That year, after adding live streaming and a fully automated scorekeeping system, the students thought it was time to add real-time tracking of the ball.

“It’s just a white blob moving on a green background. How hard could it be?” says Achille. The short answer is, harder than they thought. So Achille took a class that would teach him more — a choice that would eventually lead to an invitation from Soatto to join him at the University of California, Los Angeles, for a PhD in computer vision.

“In Italian education, it sometimes feels like there is a hierarchy,” says Achille. “The more abstract you are, the better you are doing!” So why the departure from pure mathematics? In the end, says Soatto, “Alessandro’s work became so abstract he couldn’t see a path to impact. That’s very frustrating for a really smart person who wants to make a difference in the world.”

Deep learning takes off

Achille’s PhD coincided with the rise of deep learning (DL), which would become a game-changing technology in machine learning and computer vision. “At the time, we didn't know if it was anything more than just a new, slightly more powerful tool. We didn’t know if DL had the power of abstraction, reasoning, and so on,” says Achille.

Related content
Two recent trends in the theory of deep learning are examinations of the double-descent phenomenon and more-realistic approaches to neural kernel methods.

The power of deep learning was becoming clear, though. During an internship in 2017, Achille worked on a computer vision model that could learn a representation of a dynamic scene — a 3-D shape that was moving, changing color, changing orientation, and so on.

The idea was to capture and isolate the semantic components of the scene — shape, size, color, or angle of rotation — rather than capturing the totality of the scene’s characteristics. Humans do this disentangling naturally. That’s how you would understand the sight of a blue banana, even if you had never seen one before: “banana” and “blue” are separate semantic components.

While Achille enjoyed the project and appreciated its importance, he was struck by the artificiality of the setting. “I was not working backwards from a use case,” he says. Shortly after, Achille became an intern at the AWS AI Lab that had just been established at the Caltech campus, where he was immediately given a real-world challenge to solve on a newly launched product called Custom Label.

Real-world problems

At the time, Custom Label allowed Amazon customers to access CV models that could be trained to identify, say, their company’s products in images — a particular faucet, for example. The models could also be trained to perform tasks like identifying something in a video or analyzing a satellite image.

AWS researchers realized it was impractical to expect a single model to accurately deal with such a range of esoteric image possibilities. A better approach was to pretrain many expert models on different imagery domains and then select the most appropriate one to fine-tune on the customer’s data. The problem for AWS was, how could it efficiently discover which of 100 or more pretrained CV models would perform best?

Alessandro Achille: The information in a deep neural network

During his research in machine learning, Achille became passionate about information theory — a mathematical framework for quantifying, storing, and communicating information. So he used that approach on this so-called model selection problem. “For a hammer, everything looks like a nail,” he laughs.

The problem is how to measure the “distance" between two learning tasks — the task a given AWS model has been pretrained on and the novel customer task. In other words, how much additional information is required by the pretrained model to produce a good performance on the customer task? The less additional information required, the better.

Achille was impressed by the task because it was an important customer issue with a fundamental mathematical problem behind it. “We formulated an algorithm to compute this efficiently, so we could easily select the expert model best suited to solving the customer’s task,” says Achille. “It was the first solution to this problem.”

Achille found Amazon’s applied approach to be a compelling way to work, and when Soatto established the AWS AI Labs, Achille was happy to join him there.

“One of the beauties of being at Amazon is that we’re tackling some of the world's most challenging emerging problems,” says Soatto. “Because when AWS customers have difficult problems to address, they come to us. From a scientific perspective, this is a goldmine.”

Machine unlearning

Achille is currently staking out a vein of research gold in a critical new area of artificial intelligence (AI): AI model disgorgement, more popularly known as "machine unlearning". It is critical in any implementation of machine learning models that the data used to train the model are used responsibly, in a privacy-preserving manner, and in accordance with the appropriate regulations and intellectual-property rights.

Related content
At this year’s ACL, Amazon researchers won an outstanding-paper award for showing that knowledge distillation using contrastive decoding in the teacher model and counterfactual reasoning in the student model improves the consistency of “chain of thought” reasoning.

Modern ML models have become very large and complex, requiring a great deal of data and computational resources to train. But what if, once a model is trained, the contributor of some of those training data decides, or is obligated by law, to withdraw the data from the model? Or what if some of the training data is discovered to be biased? Retraining a large model afresh, with some data withheld, may be impractical, particularly if the requirement for such changes becomes commonplace in the shifting legal landscape.

The next level

In 2019 that Soatto, Achille, and Achille's fellow UCLA PhD student Aditya Golatkar published a paper entitled “Eternal Sunshine of the Spotless Net: Selective Forgetting in Deep Networks”; the paper established a novel method for removing the effects of a subset of a deep neural network's training data, without requiring retraining.

Eternal sunshine of the spotless net: Selective forgetting in deep networks

“I was happy to see interest in ‘selective forgetting’ explode after we published this paper,” says Achille. “Model disgorgement is a fascinating problem, and not only because it's very important for AWS customers. It also demands that we understand everything about a model’s neural network. We need to understand where information is held in a model’s weights, how it is encoded, how it is measured.”

It is in this fundamental work that Achille took the field to “the next level”, says Soatto. And this year, Achille and Soatto, on a team also featuring Amazon Scholar Michael Kearns, coauthor of the book The Ethical Algorithm, led the field by introducing a taxonomy of possible disgorgement methods applicable to modern ML systems.

The paper also describes ways to train future models so that they are amenable to subsequent disgorgement.

Related content
The surprising dynamics related to learning that are common to artificial and biological systems.

“It is better for models to learn in a compartmentalized fashion, so in the event that some data is found to be problematic, everything that touched those data gets thrown away, while the rest of the model survives without having to retrain it from scratch,” says Soatto.

This work has been particularly satisfying, says Achille, as it obliged computer scientists, mathematicians, lawyers, and policymakers to work closely together to solve a pressing modern problem.

Critical learning periods

The breadth of Achille’s interests is formidable. His other prominent research includes work on “critical learning periods” in the training of deep networks. The work arose through serendipity, after a friend studying for a medical exam on the profound effect of critical learning periods in humans jokingly asked Achille if his networks also had them. Interest piqued, Achille explored the idea, and found some striking similarities.

Related content
Technique that mixes public and private training data can meet differential-privacy criteria while cutting error increase by 60%-70%.

For example, take infantile strabismus, a condition in which a person's eyes do not align properly from birth or early infancy. If not treated early, the condition can cause amblyopia, whereby the brain learns to trust the properly working eye and to ignore the visual input from the misaligned eye, to avoid double vision.

This one-sided competition between the two eyes (data sources) leads to worsening vision in the misaligned eye and of course the loss of stereo vision, which is important for depth perception. Amblyopia is difficult to reverse if left untreated into adulthood. But treating the eyes early, enabling them to work together optimally, makes for a robust vision system.

Similarly, in the early training of multimodal deep neural networks, one type of data may become favored over another, simply through expediency. For example, in a visual-question-answering model, which is trained on images and captions, the easy-to-use textual information may outcompete visual information, leading to models that are effectively blind to visual information. Achille and his colleagues suggest that when a DL model takes such shortcuts, it has irreversible effects on the subsequent performance of the model, making it less flexible — and therefore less useful — when fine-tuned on novel data.

Off the charts

Having explored the causes of critical learning periods in deep networks, the team offered new techniques for stabilizing the early learning dynamics in model training and showed how this approach can actually prevent critical periods in deep networks. The practical benefits of this research aside, Achille enjoys exploring the parallelisms of artificial and biological systems.

“Look, we can all recognize that the actual hardware of a network and a brain are completely different, but can we also recognize that they are both systems that are trying to process information efficiently and trying to learn something?” he asks. Are there some fundamental dynamics of learning, and how it relates to the acquisition of information, that are shared between synthetic and biological systems? Watch this space.

Looking back on the eight years since his hunting trip to Pisa, Soatto considers what he most appreciates about his Amazon colleague.

“First, the brilliance of the way Alessandro frames problems: he thinks very abstractly, yet he is also a hacker who thinks broadly, all the way from mathematics to neuroscience, from art to engineering — this is very rare. Second, his curiosity, which is absolutely off the charts.”

For Achille’s part, when asked if he prefers tackling the challenges that arise from AWS products or working on fundamental science problems, he demurs. “I don’t need to split my time between product and fundamental research. For me, it ends up being the same thing.”

Indeed, one of Amazon’s most abstract thinkers has found a path to true impact.

Research areas

Related content

US, WA, Seattle
Innovators wanted! Are you an entrepreneur? A builder? A dreamer? This role is part of an Amazon Special Projects team that takes the company’s Think Big leadership principle to the next level. We focus on creating entirely new products and services with a goal of positively impacting the lives of our customers. No industries or subject areas are out of bounds. If you’re interested in innovating at scale to address big challenges in the world, this is the team for you. As a Senior Research Scientist, you will work with a unique and gifted team developing exciting products for consumers and collaborate with cross-functional teams. Our team rewards intellectual curiosity while maintaining a laser-focus in bringing products to market. Competitive candidates are responsive, flexible, and able to succeed within an open, collaborative, entrepreneurial, startup-like environment. At the intersection of both academic and applied research in this product area, you have the opportunity to work together with some of the most talented scientists, engineers, and product managers. Here at Amazon, we embrace our differences. We are committed to furthering our culture of inclusion. We have thirteen employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We are constantly learning through programs that are local, regional, and global. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust. Our team highly values work-life balance, mentorship and career growth. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We care about your career growth and strive to assign projects and offer training that will challenge you to become your best.
US, VA, Arlington
Are you excited to help the US Intelligence Community design, build, and implement AI algorithms, including advanced Generative AI solutions, to augment decision making while meeting the highest standards for reliability, transparency, and scalability? The Amazon Web Services (AWS) US Federal Professional Services team works directly with US Intelligence Community agencies and other public sector entities to achieve their mission goals through the adoption of Machine Learning (ML) and Generative AI methods. We build models for text, image, video, audio, and multi-modal use cases, leveraging both traditional ML approaches and state-of-the-art generative models including Large Language Models (LLMs), text-to-image generation, and other advanced AI capabilities to fit the mission. Our team collaborates across the entire AWS organization to bring access to product and service teams, to get the right solution delivered and drive feature innovation based on customer needs. At AWS, we're hiring experienced data scientists with a background in both traditional and generative AI who can help our customers understand the opportunities their data presents, and build solutions that earn the customer trust needed for deployment to production systems. In this role, you will work closely with customers to deeply understand their data challenges and requirements, and design tailored solutions that best fit their use cases. You should have broad experience building models using all kinds of data sources, and building data-intensive applications at scale. You should possess excellent business acumen and communication skills to collaborate effectively with stakeholders, develop key business questions, and translate requirements into actionable solutions. You will provide guidance and support to other engineers, sharing industry best practices and driving innovation in the field of data science and AI. This position requires that the candidate selected be a US Citizen and currently possess and maintain an active Top Secret security clearance. Key job responsibilities As a Data Scientist, you will: - Collaborate with AI/ML scientists and architects to research, design, develop, and evaluate AI algorithms to address real-world challenges - Interact with customers directly to understand the business problem, help and aid them in implementation of AI solutions, deliver briefing and deep dive sessions to customers and guide customer on adoption patterns and paths to production. - Create and deliver best practice recommendations, tutorials, blog posts, sample code, and presentations adapted to technical, business, and executive stakeholder - Provide customer and market feedback to Product and Engineering teams to help define product direction - This position may require up to 25% local travel. About the team Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (diversity) conferences, inspire us to never stop embracing our uniqueness. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why flexible work hours and arrangements are part of our culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
US, WA, Seattle
Amazon Advertising is one of Amazon's fastest growing businesses. Amazon's advertising portfolio helps merchants, retail vendors, and brand owners succeed via native advertising, which grows incremental sales of their products sold through Amazon. The primary goals are to help shoppers discover new products they love, be the most efficient way for advertisers to meet their business objectives, and build a sustainable business that continuously innovates on behalf of customers. Our products and solutions are strategically important to enable our Retail and Marketplace businesses to drive long-term growth. We deliver billions of ad impressions and millions of clicks and break fresh ground in product and technical innovations every day! The Creative X team within Amazon Advertising time aims to democratize access to high-quality creatives (audio, images, videos, text) by building AI-driven solutions for advertisers. To accomplish this, we are investing in understanding how best users can leverage Generative AI methods such as latent-diffusion models, large language models (LLM), generative audio (music and speech synthesis), computer vision (CV), reinforced learning (RL) and related. As an Applied Scientist you will be part of a close-knit team of other applied scientists and product managers, UX and engineers who are highly collaborative and at the top of their respective fields. We are looking for talented Applied Scientists who are adept at a variety of skills, especially at the development and use of multi-modal Generative AI and can use state-of-the-art generative music and audio, computer vision, latent diffusion or related foundational models that will accelerate our plans to generate high-quality creatives on behalf of advertisers. Every member of the team is expected to build customer (advertiser) facing features, contribute to the collaborative spirit within the team, publish, patent, and bring SOTA research to raise the bar within the team. As an Applied Scientist on this team, you will: - Drive the invention and development of novel multi-modal agentic architectures and models for the use of Generative AI methods in advertising. - Work closely and integrate end-to-end proof-of-concept Machine Learning projects that have a high degree of ambiguity, scale and complexity. - Build interface-oriented systems that use Machine Learning models, perform proof-of-concept, experiment, optimize, and deploy your models into production; work closely with software engineers to assist in productionizing your ML models. - Curate relevant multi-modal datasets. - Perform hands-on analysis and modeling of experiments with human-in-the-loop that eg increase traffic monetization and merchandise sales, without compromising the shopper experience. - Run A/B experiments, gather data, and perform statistical analysis. - Establish scalable, efficient, automated processes for large-scale data analysis, machine-learning model development, model validation and serving. - Mentor and help recruit Applied Scientists to the team. - Present results and explain methods to senior leadership. - Willingness to publish research at internal and external top scientific venues. - Write and pursue IP submissions. Key job responsibilities This role is focused on developing new multi-modal Generative AI methods to augment generative imagery and videos. You will develop new multi-modal paradigms, models, datasets and agentic architectures that will be at the core of advertising-facing tools that we are launching. You may also work on development of ML and GenAI models suitable for advertising. You will conduct literature reviews to stay on the SOTA of the field. You will regularly engage with product managers, UX designers and engineers who will partner with you to productize your work. For reference see our products: Enhanced Video Generator, Creative Agent and Creative Studio. A day in the life On a day-to-day basis, you will be doing your independent research and work to develop models, you will participate in sprint planning, collaborative sessions with your peers, and demo new models and share results with peers, other partner teams and leadership. About the team The team is a dynamic team of applied scientists, UX researchers, engineers and product leaders. We reside in the Creative X organization, which focuses on creating products for advertisers that will improve the quality of the creatives within Amazon Ads. We are open to hiring candidates to work out of one of the following locations: UK (London), USA (Seattle).
US, WA, Seattle
Do you want to join an innovative team of scientists who use machine learning and statistical techniques to help Amazon provide the best customer experience by preventing eCommerce fraud? Are you excited by the prospect of analyzing and modeling terabytes of data and creating state-of-the-art algorithms to solve real world problems? Do you like to own end-to-end business problems/metrics and directly impact the profitability of the company? Do you enjoy collaborating in a diverse team environment? If yes, then you may be a great fit to join the Amazon Selling Partner Trust & Store Integrity Science Team. We are looking for a talented scientist who is passionate to build advanced machine learning systems that help manage the safety of millions of transactions every day and scale up our operation with automation. Key job responsibilities Innovate with the latest GenAI/LLM/VLM technology to build highly automated solutions for efficient risk evaluation and automated operations Design, develop and deploy end-to-end machine learning solutions in the Amazon production environment to create impactful business value Learn, explore and experiment with the latest machine learning advancements to create the best customer experience A day in the life You will be working within a dynamic, diverse, and supportive group of scientists who share your passion for innovation and excellence. You'll be working closely with business partners and engineering teams to create end-to-end scalable machine learning solutions that address real-world problems. You will build scalable, efficient, and automated processes for large-scale data analyses, model development, model validation, and model implementation. You will also be providing clear and compelling reports for your solutions and contributing to the ongoing innovation and knowledge-sharing that are central to the team's success.
US, WA, Seattle
Are you passionate about applying machine learning and advanced statistical techniques to protect one of the world's largest online marketplaces? Do you want to be at the forefront of developing innovative solutions that safeguard Amazon's customers and legitimate sellers while ensuring a fair and trusted shopping experience? Do you thrive in a collaborative environment where diverse perspectives drive breakthrough solutions? If yes, we invite you to join the Amazon Risk Intelligence Science Team. We're seeking an exceptional scientist who can revolutionize how we protect our marketplace through intelligent automation. As a key member of our team, you'll develop and deploy state-of-the-art machine learning systems that analyze millions of seller interactions daily, ensuring the integrity and trustworthiness of Amazon's marketplace while scaling our operations to new heights. Your work will directly impact the safety and security of the shopping experience for hundreds of millions of customers worldwide, while supporting the growth of honest entrepreneurs and businesses. Key job responsibilities • Use machine learning and statistical techniques to create scalable abuse detection solutions that identify fraudulent seller behavior, account takeovers, and marketplace manipulation schemes • Innovate with the latest GenAI technology to build highly automated solutions for efficient seller verification, transaction monitoring, and risk assessment • Design, develop and deploy end-to-end machine learning solutions in the Amazon production environment to prevent and detect sophisticated abuse patterns across the marketplace • Learn, explore and experiment with the latest machine learning advancements to protect customer trust and maintain marketplace integrity while supporting legitimate selling partners • Collaborate with cross-functional teams to develop comprehensive risk models that can adapt to evolving abuse patterns and emerging threats About the team You'll be working closely with business partners and engineering teams to create end-to-end scalable machine learning solutions that address real-world problems. You will build scalable, efficient, and automated processes for large-scale data analyses, model development, model validation, and model implementation. You will also be providing clear and compelling reports for your solutions and contributing to the ongoing innovation and knowledge-sharing that are central to the team's success.
US, WA, Seattle
Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports – including Amazon MGM Studios-produced series and movies; licensed fan favorites; and exclusive access to coverage of live sports. All customers regardless of whether they have a Prime membership or not, can access programming from subscriptions such as Apple TV, Peacock Premium Plus, HBO Max, FOX One, Crunchyroll and MGM+, as well as more than 900 free ad-support (FAST) Channels, rent or buy titles, and enjoy even more content for free with ads. Are you interested in shaping the future of entertainment? Prime Video's technology teams are creating best-in-class digital video experience. As a Prime Video technologist, you’ll have end-to-end ownership of the product, user experience, design, and technology required to deliver state-of-the-art experiences for our customers. You’ll get to work on projects that are fast-paced, challenging, and varied. You’ll also be able to experiment with new possibilities, take risks, and collaborate with remarkable people. We’ll look for you to bring your diverse perspectives, ideas, and skill-sets to make Prime Video even better for our customers. With global opportunities for talented technologists, you can decide where a career Prime Video Tech takes you! Interested in influencing what customers around the world see when they turn on Prime Video? The Prime Video Personalization and Discovery team matches customers with the right content at the right time, at all touch points throughout the content discovery journey. We are looking for a customer-focused, solutions-oriented Senior Data Scientist to build and guide new data-driven frameworks to understand what makes new personalization and content discovery innovations successful for users and the business. You'll be part of an embedded science team on projects that are fast-paced, challenging, and ultimately influence what millions of customers around the world see when the log into Prime Video. The ideal candidate brings strong problem-solving skills, stakeholder communication skills, and the ability to balance technical rigor with delivery speed and customer impact. You will build cross-functional support within Prime Video, assess business problems, define metrics, and support iterative scientific solutions that balance short-term delivery with long-term science roadmaps. Key job responsibilities - Use advanced statistical and machine learning techniques to extract insights from complex, large-scale data sets - Design and implement end-to-end data science workflows, from data acquisition and cleaning to model development, testing, and deployment - Support scalable, self-service data analyses by building datasets for analytics, reporting and ML use cases - Partner with product stakeholders and science peers to identify strategic data-driven opportunities to improve the customer experience - Communicate findings, conclusions, and recommendations to technical and non-technical stakeholders - Stay up-to-date on the latest data science tools, techniques, and best practices and help evangelize them across the organization
US, CA, Sunnyvale
Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports – including Amazon MGM Studios-produced series and movies; licensed fan favorites; and programming from Prime Video add-on subscriptions such as Apple TV+, Max, Crunchyroll and MGM+. All customers, regardless of whether they have a Prime membership or not, can rent or buy titles via the Prime Video Store, and can enjoy even more content for free with ads. Are you interested in shaping the future of entertainment? Prime Video's technology teams are creating best-in-class digital video experience. As a Prime Video team member, you’ll get to work on projects that are fast-paced, challenging, and varied. You’ll also be able to experiment with new possibilities, take risks, and collaborate with remarkable people. We’ll look for you to bring your diverse perspectives, ideas, and skill-sets to make Prime Video even better for our customers. We are looking for an Applied Scientist to push the envelope of AI content generation. As a scientist at Prime Video, you will contribute directly to productions using innovative tools in computer vision, deep learning, and generative AI to transform entertainment experiences. The ideal candidate has deep knowledge in one of: graphics, deep learning, generative AI and/or reinforcement learning and experience applying them real-world problems. You understand tradeoffs between business needs and model complexity, and you take calculated risks in developing rapid prototypes and iterative model improvements. You are excited to learn from and alongside seasoned scientists, engineers, and business leaders. You are an excellent communicator and effectively translate technical findings into production systems and business action (and customer delight). Key job responsibilities • Build generative AI models that create production-ready content, including movie content, localized assets, and visual marketing materials used across Prime Video's global platform. • Drive end-to-end machine learning projects that have a high degree of ambiguity, scale, complexity. • Build machine learning models, perform proof-of-concept, experiment, optimize, and deploy your models. • Run experiments, gather data, and perform statistical analysis. • Establish scalable, efficient, automated processes for large-scale data analysis, machine-learning model development, model validation and serving. • Research new and innovative machine learning approaches. • Share knowledge and research outcomes via internal and external conferences and journal publications A day in the life In this role, you will invent science and systems for content localization, generation, including graphics and machine learning-based modeling systems. You will work with a team of scientists and product managers to design customer-facing products, and you will work with technology teams to productize and maintain the associated solutions.
US, WA, Bellevue
The Amazon Fulfillment Technologies (AFT) Science team is seeking an exceptional Applied Scientist with strong operations research and optimization expertise to develop production solutions for one of the most complex systems in the world: Amazon's Fulfillment Network. At AFT Science, we design, build, and deploy optimization, statistics, machine learning, and GenAI/LLM solutions that power production systems running across Amazon Fulfillment Centers worldwide. We tackle a wide range of challenges throughout the network, including labor planning and staffing, pick scheduling, stow guidance, and capacity risk management. Our mission is to develop innovative, scalable, and reliable science-driven production solutions that exceed the published state of the art, enabling systems to run optimally and continuously (from every few minutes to every few hours) across our large-scale network. Key job responsibilities As an Applied Scientist, you will collaborate with scientists, software engineers, product managers, and operations leaders to develop optimization-driven solutions that directly impact process efficiency and associate experience in the fulfillment network. Your key responsibilities include: - Develop deep understanding and domain knowledge of operational processes, system architecture, and business requirements - Dive deep into data and code to identify opportunities for continuous improvement and disruptive new approaches - Design and develop scalable mathematical models for production systems to derive optimal or near-optimal solutions for existing and emerging challenges - Create prototypes and simulations for agile experimentation of proposed solutions - Advocate for technical solutions with business stakeholders, engineering teams, and senior leadership - Partner with software engineers to integrate prototypes into production systems - Design and execute experiments to test new or incremental solutions launched in production - Build and monitor metrics to track solution performance and business impact About the team Amazon Fulfillment Technology (AFT) designs, develops, and operates end-to-end fulfillment technology solutions for all Amazon Fulfillment Centers (FCs). We harmonize the physical and virtual worlds so Amazon customers can get what they want, when they want it. The AFT Science team brings expertise in operations research, optimization, statistics, machine learning, and GenAI/LLM, combined with deep domain knowledge of operational processes within FCs and their unique challenges. We prioritize advancements that support AFT tech teams and focus areas rather than specific fields of research or individual business partners. We influence each stage of innovation from inception to deployment, which includes both developing novel solutions and improving existing approaches. Our production systems rely on a diverse set of technologies, and our teams invest in multiple specialties as the needs of each focus area evolve.
CA, ON, Toronto
Are you interested in shaping the future of Advertising and B2B Sales? We are a growing science and engineering team with an exciting charter and need your passion, innovative thinking, and creativity to help take our products to new heights. Amazon Advertising is one of Amazon's fastest growing and most profitable businesses, responsible for defining and delivering a collection of advertising products that drive discovery and sales. Our products are strategically important to our businesses driving long term growth. We break fresh ground in product and technical innovations every day! Within the Advertising Sales organization, we are building a central AI/ML team and are seeking top science talent to build new, science-backed services to drive success for our customers. Our goal is to transform the way account teams operate by creating actionable insights and recommendations they can share with their advertising accounts, and ingesting Generative AI throughout their end-to-end workflows to improve their work efficiency. As a part of our team, you will bring deep expertise in Generative AI and quantitative modeling (forecasting, recommender systems, reinforcement learning, causal inferencing or generative artificial intelligence) to build and refine models that can be implemented in production. You will contribute to chart new courses with our ad sales support technologies, and you have the communication skills necessary to explain complex technical approaches to a variety of stakeholders and customers. You will be part of a team of fellow scientists and engineers taking on iterative approaches to tackle big, long-term problems. Why you will love this opportunity: Amazon has invested heavily in building a world-class advertising business. This team defines and delivers a collection of advertising products that drive discovery and sales. Our solutions generate billions in revenue and drive long-term growth for Amazon's Retail and Marketplace businesses. We deliver billions of ads impressions, millions of clicks daily, and break fresh ground to create world-class products. We are a highly motivated, collaborative, and fun-loving team with an entrepreneurial spirit with a broad mandate to experiment and innovate. Impact and Career Growth: You will invent new experiences and influence customer-facing shopping experiences; this is your opportunity to work within the fastest growing businesses across all of Amazon! Define a long-term scientific vision for our advertising sales business, driven from our customers' needs, translating that direction into specific plans for scientists, engineers and product teams. This role combines scientific leadership, organizational ability, technical strength, product focus, and business understanding. Key job responsibilities - Conceptualize and lead state-of-the-art research on new Machine Learning and Generative Artificial Intelligence solutions to optimize all aspects of the Ad Sales business - Guide the technical approach for the design and implementation of successful models and algorithms in support of expert cross-functional teams delivering on demanding projects - Conduct deep data analysis to derive insights to the business, and identify gaps and new opportunities - Run regular A/B experiments, gather data, and perform statistical analysis - Work closely with software engineers to deliver end-to-end solutions into production - Improve the scalability, efficiency and automation of large-scale data analytics, model training, deployment and serving About the team Sales AI is a central science and engineering organization within Amazon Advertising Sales that powers selling motions and account team workflows via state-of-the-art of AI/ML services. Sales AI is investing in a range of sales intelligence models, including the development of advertiser insights, recommendations and Generative AI-powered applications throughout account team workflows.
US, NY, New York
In this role, you will build scalable solutions and sophisticated models that identify and drive growth opportunities for Amazon Ads teams, specifically within Amazon's Demand Side Platform (ADSP). You will leverage machine learning, simulation, and advanced statistical techniques to explain complex patterns, quantify business impact, predict future trends, and prescribe actionable strategies that inform critical business decisions at the highest levels of the organization. You will work with various stakeholders to align on priorities, with the understanding that scope and direction may evolve based on organizational needs. You will translate business goals into agile, insightful analytics that create tangible value for both stakeholders and customers, and communicate your findings clearly and actionably to managers and senior leaders so they can quickly understand insights and take decisive action. You will set the strategy for ads delivery and quality and establish the measurement and decision frameworks. A core mandate for this role is to identify, instrument, and operationalize the input metrics that most directly drive ads delivery, quality, and performance, ensuring we optimize the levers that move outcomes rather than simply reporting on lagging KPIs. Key job responsibilities * You will define and execute in-depth data analysis that drives data-informed decision making for product, sales, and finance teams who speak on behalf of advertisers. * You will establish and drive data hygiene best practices to ensure coherence and integrity of data feeding into production ML/AI solutions. * You will identify, instrument, and operationalize the input metrics that most directly drive ads delivery, quality, and performance, creating robust measurement frameworks. * You will collaborate with colleagues across science and engineering disciplines for fast turnaround proof-of-concept prototyping at scale. * You will partner with product managers and stakeholders to define forward-looking product visions and prospective business use cases. * You will set the strategy for ads delivery and quality, establishing decision frameworks that enable teams to move from reactive reporting to proactive optimization. * You will drive and lead a culture of data-driven innovations within the Amazon AdTech org.