Code and Datasets

Alexa Teacher Model (AlexaTM 20B)

Saleh Soltan, Shankar Ananthakrishnan, Jack G. M. FitzGerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan

2022

Last updated May 22, 2023

Access

Share

Share

Share

Share

Access

Share

Share

A 20 billion parameter multilingual seq2seq model called Alexa Teacher Model (AlexaTM 20B), which achieves state-of-the-art (SOTA) performance on 1-shot summarization tasks, outperforming a much larger 540B PaLM decoder model. AlexaTM 20B also achieves SOTA in 1-shot machine translation, especially for low-resource languages, across almost all language pairs supported by the model (Arabic, English, French, German, Hindi, Italian, Japanese, Marathi, Portuguese, Spanish, Tamil, and Telugu) on Flores-101 dataset. We also show in zero-shot setting, AlexaTM 20B outperforms GPT3 (175B) on SuperGLUE and SQuADv2 datasets and provides SOTA performance on multilingual tasks such as XNLI, XCOPA, Paws-X, and XWinograd. Overall, our results present a compelling case for seq2seq models as a powerful alternative to decoder-only models for Large-scale Language Model (LLM) training.

Types

Model

Amazon and University of Michigan give robots a sense of touch

Mani Nambi, Nima Fazeli

July 10, 2026

HydroShear, a new physics-based simulator, teaches robots how to use their sense of touch to perform complex manipulation tasks, in a way that transfers seamlessly to the real world.

Robotics
Capturing token IDs during agentic interactions for better reinforcement learning

Frederick Robinson

July 9, 2026

A new Rust proxy called Turnstile sits between the model backend and the agent harness to capture information lost in mere text transcripts.

Machine learning
Novel “Kaputt” dataset sets new benchmark for large-scale visual defect detection

Sebastian Hoefer

October 2, 2025

A new dataset with over 238,000 images challenges and advances the state of the art in visual defect detection for complex retail applications.

Computer vision

Applied Scientist, Observability, Prime Video

GB, London

Come build the future of entertainment with us. Are you interested in shaping the future of movies and television? Do you want to define the next generation of how and what Amazon customers are watching? Prime Video is a premium streaming service that offers customers a vast collection of TV shows and movies - all with the ease of finding what they love to watch in one place. We offer customers thousands of popular movies and TV shows including Amazon Originals and exclusive licensed content to exciting live sports events. We also offer our members the opportunity to subscribe to add-on channels which they can cancel at anytime and to rent or buy new release movies and TV box sets on the Prime Video Store. Prime Video is a fast-paced, growth business - available in over 200 countries and territories worldwide. The team works in a dynamic environment where innovating on behalf of our customers is at the heart of everything we do. If this sounds exciting to you, please read on. PV observability team's mission is to deliver efficient, zero-touch observability solutions that combine log management, tracing, and AI-powered analytics, enabling teams to detect, diagnose, and resolve Prime Video issues at unprecedented speed. We are looking for an Applied Scientist for our London office experienced in generative AI and large models. This is a wide impact role working with development teams across the UK, India, and the US. You will develop and deploy customized models for PV builders needs at scale, and explore emerging techniques that help us make better decisions faster for agentic solutions. This is a hands-on role working with a high performing and high visibility multidisciplinary group of engineers and scientists in the London office, focused on improving the PV builders experience for Prime Video organization. You will have strong technical ability, excellent teamwork and communication skills, and a strong motivation to deliver customer value from your research. Our position offers opportunities to grow your technical and non-technical skills and make a global impact immediately. Key job responsibilities - Develop machine learning algorithms for high-scale recommendations problems - Rapidly design, prototype and test many possible hypotheses in a high-ambiguity environment, making use of both quantitative analysis and business judgement - Collaborate with software engineers to integrate successful experimental results into Prime Video wide processes - Report and share results with the team and wider scientific community by authoring documents that are both statistically rigorous and compellingly relevant, exemplifying good scientific practice in a business environment A day in the life You will lead the design of machine learning models that scale to very large quantities of data across multiple dimensions. You will embody scientific rigor, designing and executing experiments to demonstrate the technical effectiveness and business value of your methods. You will work alongside other scientists and engineering teams to deliver your research into production systems. About the team Our team owns Prime Video observability features for development teams. We consume PBs of data daily which feed into multiple observability features focussed on reducing the customer impact time.

Data Scientist III - AMZ9976173

US, CA, Santa Clara

MULTIPLE POSITIONS AVAILABLE Employer: AMAZON.COM SERVICES LLC Offered Position: Data Scientist III Job Location: Santa Clara, California Job Number: AMZ9976173 Position Responsibilities: Own the data science elements of various products to help with data-based decision making, product performance optimization, and product performance tracking. Work directly with product managers to help drive the design of the product. Work with Technical Product Managers to help drive the build planning. Translate business problems and products into data requirements and metrics. Initiate the design, development, and implementation of scientific analysis projects or deliverables. Own the analysis, modelling, system design, and development of data science solutions for products. Write documents and make presentations that explain model/analysis results to the business. Bridge the degree of uncertainty in both problem definition and data scientific solution approaches. Build consensus on data, metrics, and analysis to drive business and system strategy. 40 hours / week, 8:00am-5:00pm, Salary Range: $183,000/year to $247,600/year. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, visit: https://www.aboutamazon.com/workplace/employee-benefits. Amazon.com is an Equal Opportunity-Affirmative Action Employer – Minority / Female / Disability / Veteran / Gender Identity / Sexual Orientation.#0000

Human-Robot Interaction Applied Scientist , Fauna

US, NY, New York

We are seeking a Human-Robot Interaction (HRI) Applied Scientist to develop cutting-edge interactions that make robots feel alive, personal, and fun. In this role, you will focus on verbal and non-verbal conversational systems, social dynamics, memory, and long-term relationship formation between robots, their environments, and the people they interact with. Your contributions will be essential in advancing robotics by enabling expressive, socially intelligent, and trustworthy interactions between robots and humans. Key job responsibilities - Develop interactive systems that leverage large language models, multimodal inputs and outputs, reinforcement learning from human feedback, or other advanced techniques to achieve fluid, engaging, and socially appropriate robot behavior - Design and implement intelligent conversational systems that handle turn-taking, grounding, interruption, and incorporates context drawn from a robot's physical environment and shared history with a user - Integrate perceptual sensor streams including gaze, facial expression, gesture, posture, and more to understand social context and produce coherent, lifelike interactions. - Develop memory and personalization systems that allow robots to form lasting relationships with individual users, learn their environments, and adapt their behavior over weeks and months - Stay updated on advancements in HRI, NLP, multimodal AI, and cognitive and social science to apply cutting-edge techniques to robot interaction challenges - Lead technical projects from conception through production deployment - Mentor junior scientists and engineers - Bridge research initiatives with practical engineering implementation

Applied Scientist, Secure 3P Tools

US, TX, Austin

Amazon Security is seeking an Applied Scientist to work on GenAI acceleration within the Secure Third Party Tools (S3T) organization. The S3T team has bold ambitions to re-imagine security products that serve Amazon's pace of innovation at our global scale. This role will focus on leveraging large language models and agentic AI to transform third-party security risk management, automate complex vendor assessments, streamline controllership processes, and dramatically reduce assessment cycle times. You will drive builder efficiency and deliver bar-raising security engagements across Amazon. Key job responsibilities Own and drive end-to-end technical delivery for scoped science initiatives focused on third-party security risk management, independently defining research agendas, success metrics, and multi-quarter roadmaps with minimal oversight. Understanding approaches to automate third-party security review processes using state-of-the-art large language models, development intelligent systems for vendor assessment document analysis, security questionnaire automation, risk signal extraction, and compliance decision support. Build advanced GenAI and agentic frameworks including multi-agent orchestration, RAG pipelines, and autonomous workflows purpose-built for third-party risk evaluation, security documentation processing, and scalable vendor assessment at enterprise scale. Build ML-powered risk intelligence capabilities that enhance third-party threat detection, vulnerability classification, and continuous monitoring throughout the vendor lifecycle. Coordinate with Software Engineering and Data Engineering to deploy production-grade ML solutions that integrate seamlessly with existing third-party risk management workflows and scale across the organization. About the team Security is central to maintaining customer trust and delivering delightful customer experiences. At Amazon, our Security organization is designed to drive bar-raising security engagements. Our vision is that Builders raise the Amazon security bar when they use our recommended tools and processes, with no overhead to their business. Diverse Experiences Amazon Security values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why Amazon Security? At Amazon, security is central to maintaining customer trust and delivering delightful customer experiences. Our organization is responsible for creating and maintaining a high bar for security across all of Amazon’s products and services. We offer talented security professionals the chance to accelerate their careers with opportunities to build experience in a wide variety of areas including cloud, devices, retail, entertainment, healthcare, operations, and physical stores. Inclusive Team Culture In Amazon Security, it’s in our nature to learn and be curious. Ongoing DEI events and learning experiences inspire us to continue learning and to embrace our uniqueness. Addressing the toughest security challenges requires that we seek out and celebrate a diversity of ideas, perspectives, and voices. Training & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, training, and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why flexible work hours and arrangements are part of our culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve.

Economist, Amazon Customer Service

US, WA, Seattle

Amazon Customer Service (CS) Data Intelligence builds the data and Artificial Intelligence (AI) foundations for CS to ensure Amazon delivers the best customer service possible. CS Economics sits within CS DI and contributes to the CS knowledge base and decision frameworks. CS Economics seeks economists to apply economic methods to solve business problems. The ideal candidate will work with engineers and applied scientists to design models that leverage large scale and unstructured data, design scalable agents for non-tech CS partners to understand the impact of their actions, and propose mechanism designs to robustly match customers to our services. CS Economics is looking for optimistic critical-thinkers who combine a strong technical economic toolbox with a desire to learn from other disciplines, and who know how to execute and deliver on big ideas as part of an interdisciplinary technical team. Ideal candidates enjoy working in a team setting with individuals from diverse disciplines and backgrounds. They will work with teammates to develop scientific models and conduct data analysis, modeling, and experimentation that is necessary for estimating and validating models. They will work closely with engineering teams to develop scalable data resources to support rapid insights, and take successful models and findings into production as new products and services. They will be customer-centric and will communicate scientific approaches and findings to business leaders, listening to and incorporate their feedback, and delivering successful scientific solutions. Key job responsibilities - Design and conduct rigorous evaluations of CS actions - Develop experiments to evaluate product launches - Communicate complex findings to business stakeholders in clear, actionable terms - Work with engineering teams to develop scalable tools that automate and streamline evaluation processes A day in the life Work with teammates to apply economic methods to business problems, e.g., identify the appropriate research question and identification strategy, write code to estimate heterogeneous treatment effects or conduct experiment analysis, write and present a document with findings to business leaders. We collaborate with partner teams within and outside of CS throughout the process, from understanding their challenges, to developing a research agenda that will address those challenges, to help them implement solutions. About the team Amazon Customer Service (CS) Economics provides estimates and measures of the causal impact of CS actions on costs and benefits. We build agents and guide leadership to establish processes to scale valid experimentation, causal inference, and mechanism design.

Senior Applied Scientist, PXT

US, WA, Seattle

Do you want to leverage your expertise in translating innovative science into impactful products to improve the lives and work of over a million people worldwide? If so, People eXperience Technology Core Science team would love to discuss how you can make that a reality. Our team is an interdisciplinary team that uses behavioral science, statistics, and machine learning to identify products, mechanisms, and process improvements that enhance Amazonians' well-being and their ability to deliver value for Amazon's customers. We collaborate with HR teams across Amazon to make Amazon PXT the most scientific human resources organization in the world. In this role, you will spearhead science design and technical implementation innovations across our talent solution science work-streams. You'll enhance existing models and create new ones, empowering leaders throughout Amazon to make data-driven business decisions. You'll collaborate with scientists and engineers to deliver solutions while working closely with business stakeholders to address their specific needs. Your work will span various business domains (corporate, operations, safety) and analysis levels (individual, group, organizational), utilizing a range of modeling approaches (linear, tree-based, deep neural networks, and LLM-based). You'll develop end-to-end ML solutions from problem formulation to deployment, maintaining high scientific standards and technical excellence throughout the process. As an Applied Scientist, you'll also contribute to the team's science strategy, keeping pace with emerging AI/ML trends. You'll mentor junior scientists, fostering their growth by identifying high-impact opportunities. Your guidance will span different analysis levels and modeling approaches, enabling stakeholders to make informed, strategic decisions. If you excel at building advanced scientific solutions and are passionate about developing technologies that drive organizational change in the AI era, join us as we work hard, have fun, and make history. Key job responsibilities Key job responsibilities • Model Development & Innovation: Design and implement novel GenAI/LLM solutions using foundation models (e.g., Claude, GPT) and AWS services including Amazon Bedrock, SageMaker, and other AWS AI/ML tools • Research & Experimentation: Conduct applied research to advance the state-of-the-art in LLM applications, including prompt engineering, few-shot learning, fine-tuning, and model evaluation • Production Deployment: Build scalable, production-ready AI systems that serve millions of requests with high reliability, low latency, and cost efficiency • Cross-Functional Collaboration: Partner with product managers, engineers, and business stakeholders to translate business requirements into technical solutions and drive measurable impact • Technical Leadership: Mentor junior scientists, contribute to technical strategy, and establish best practices for GenAI development across the organization • Evaluation & Metrics: Design rigorous evaluation frameworks to measure model performance, bias, safety, and business impact • Documentation & Influence: Publish technical papers, create documentation, and influence both technical and non-technical audiences About the team The People eXperience and Technology (PXT) Core Science Team uses science, engineering, and customer-obsessed problem solving to proactively identify mechanisms, process improvements, and products that simultaneously improve Amazon and Amazonians' lives, wellbeing, and value of work. As an interdisciplinary team combining talents from machine learning, statistics, economics, behavioral science, engineering, and product development, the Core Science team develops and delivers measurable solutions through innovation and rapid prototyping to accelerate informed, accurate, and reliable decision-making backed by science and data. We are building a talent intelligence layer — fusing natural language understanding, network science, and large-scale predictive modeling into a unified platform that continuously learns from how people work, collaborate, and grow across one of the world's largest and most complex workforces.

Senior Applied Scientist, ASCS AI Lab Team

US, WA, Seattle

We are seeking a Senior Applied Scientist to join our team in developing pioneering AI research, Generative AI, Agentic AI, Large Language Models (LLMs), Diffusion and Flow Models, and other advanced Machine Learning and Deep Learning solutions for Amazon Selection and Catalog Systems, within the AI Lab Team. This role offers a unique opportunity to work on AI research and AI products that will shape the future of online shopping experiences. Our team operates at the forefront of AI research and development, working on challenges that directly impact millions of customers worldwide. We push the boundaries of AI at both the foundational and application layers. As a Senior Applied Scientist, you will have the chance to experiment with LLMs and deep learning techniques, apply your research to solve real-world problems at an unprecedented scale, and collaborate with experienced scientists to contribute to Amazon's scientific innovation. Join us in redefining the future of shopping. Your work will directly influence how customers interact with the world's largest online store. Key job responsibilities - Design and implement novel AI solutions for Amazon catalog of products - Develop and train state-of-the-art LLMs, Diffusion Models, and other Generative AI models - Build and deploy autonomous AI Agents in Amazon production ecosystem - Scale AI models to handle billions of diverse products across multiple languages and geographies - Conduct research in areas such as Autonomous AI Agents, Generative AI, Language Modeling, Multi-modality Computer Vision, Diffusion Models, Reinforcement Learning - Collaborate with cross-functional teams to integrate AI models into Amazon's production ecosystem - Contribute to the scientific community through publications and conference presentations

Sr. Applied Scientist, Pricing Science

US, WA, Seattle

Here's the job description with causal ML woven in: We are looking for a talented, organized, and customer-focused applied researcher to join our Pricing Optimization science group, with a charter to measure, refine, and launch customer-obsessed improvements to our algorithmic pricing and promotion models across all products listed on Amazon. This role requires an individual with exceptional machine learning modeling and architecture expertise — particularly in deep learning, neural networks, and transformer-based architectures applied to price prediction and forecasting problems. Equally important is deep expertise in causal machine learning — including causal inference, treatment-effect estimation, and experimentation methods (e.g., uplift modeling, double/debiased machine learning, instrumental variables, and A/B and quasi-experimental design) — to isolate the true impact of pricing and promotion decisions on customer behavior and business outcomes. The ideal candidate brings a strong foundation in applied statistics and probabilistic modeling, excellent cross-functional collaboration skills, business acumen, and an entrepreneurial spirit. We are looking for an experienced innovator who is a self-starter, comfortable with ambiguity, demonstrates strong attention to detail, and has the ability to work in a fast-paced and ever-changing environment. Key job responsibilities See the big picture. Understand and influence the long-term vision for Amazon's science-based competitive, perception-preserving pricing techniques. Develop and advance price prediction models leveraging deep learning frameworks, transformer architectures, and advanced statistical methods to drive pricing accuracy at scale. Build strong collaborations. Partner with product, engineering, and science teams within Pricing & Promotions to deploy machine learning price estimation and error correction solutions at Amazon scale. Design and implement neural network-based architectures — including sequence models and transformers — for large-scale price prediction and optimization. Stay informed. Establish mechanisms to stay up to date on the latest scientific advancements in deep learning, transformer architectures, applied statistics, neural network design, probabilistic forecasting, and multi-objective optimization techniques. Identify opportunities to apply them to relevant Pricing & Promotions business problems. Keep innovating for our customers. Foster an environment that promotes rapid experimentation, continuous learning, and incremental value delivery. Leverage statistical rigor and modern deep learning approaches to validate hypotheses and drive measurable pricing improvements. Successfully execute & deliver. Apply your exceptional technical machine learning expertise — including deep neural networks, attention-based models, and applied statistical analysis — to incrementally move the needle on some of our hardest pricing problems. A day in the life We are hiring a Sr. Applied Scientist to drive our pricing optimization initiatives. We drive cross-domain and cross-system improvements through: * shape and extend our RL optimization platform - a pricing centric tool that automates the optimization of various system parameters and price inputs. * Error detection and price quality guardrails at scale. * Identifying opportunities to optimally price across systems and contexts (marketplaces, request types, event periods) Price is a highly relevant input into Stores architectures; this role creates the opportunity to drive extremely large impact (measured in Bs not Ms), but demands careful thought and clear communication. About the team The Pricing Optimization science group builds and refines Amazon's algorithmic pricing and promotion models at scale. Our team combines expertise in deep learning, transformer architectures, applied statistics, and probabilistic forecasting to develop price prediction systems that directly impact the customer experience. The team also brings hands-on experience with causal modeling and inference — including uplift modeling and treatment effect estimation — to rigorously measure the impact of pricing decisions on customer behavior and business outcomes. We partner closely with product, engineering, and business teams to take solutions from research through production deployment.

Applied Scientist II, ASCS AI Lab Team

US, CA, Sunnyvale

We are seeking an Applied Scientist II to join our team in developing pioneering AI research, Generative AI, Agentic AI, Large Language Models (LLMs), Diffusion and Flow Models, and other advanced Machine Learning and Deep Learning solutions for Amazon Selection and Catalog Systems, within the AI Lab Team. This role offers a unique opportunity to work on AI research and AI products that will shape the future of online shopping experiences. Our team operates at the forefront of AI research and development, working on challenges that directly impact millions of customers worldwide. We push the boundaries of AI at both the foundational and application layers. As a Applied Scientist, you will have the chance to experiment with LLMs and deep learning techniques, apply your research to solve real-world problems at an unprecedented scale, and collaborate with experienced scientists to contribute to Amazon's scientific innovation. Join us in redefining the future of shopping. Your work will directly influence how customers interact with the world's largest online store. Key job responsibilities - Design and implement novel AI solutions for Amazon catalog of products - Develop and train state-of-the-art LLMs, Diffusion Models, and other Generative AI models - Build and deploy autonomous AI Agents in Amazon production ecosystem - Scale AI models to handle billions of diverse products across multiple languages and geographies - Conduct research in areas such as Autonomous AI Agents, Generative AI, Language Modeling, Multi-modality Computer Vision, Diffusion Models, Reinforcement Learning - Collaborate with cross-functional teams to integrate AI models into Amazon's production ecosystem - Contribute to the scientific community through publications and conference presentations

Sr. Applied Scientist, MAPLE

IN, KA, Bengaluru

Are you excited by the idea of developing personalized experiences for Amazon customers as they shop? Are you looking for new challenges and to solve hard science problems while applying state-of-the-art recommendation system modeling and GenAI techniques? Join us and you'll help millions of customers make informed purchase decisions while also advancing the state of Amazon's science by publishing research! Key job responsibilities - Participate in the design, development, evaluation, deployment and updating of data-driven models for shopping personalization. - Develop and test new signals for improving recommendation models - Use supervised and uplift learning algorithms to improve customer experience - Contribute to production code and science tooling - Design A/B tests and conduct statistical analysis on their results - Work with distributed machine learning and statistical algorithms to harness enormous volumes of data at scale to serve our customers - Work closely with internal stakeholders like the business teams, engineering teams and partner teams and align them with respect to your focus area - Present and publish science research internally and externally, contributing to Amazon's science community - Mentor junior engineers and scientists. About the team Our team's mission is to surface the right payments-related recommendations to customers at the right time, helping create a rewarding and successful shopping experience for Amazon's customers. Our team's culture is highly collaborative, with an emphasis on supporting each other and learning from one another. We dedicate time each week to focus on personal development and expanding our knowledge as a team. We also highly value having a big impact, both for Amazon's business and for our customers.