A quick guide to Amazon’s papers at ICML 2023

Across a range of topics, Amazon research blends the theoretical and the practical.

At this year's International Conference on Machine Learning (ICML), Amazon researchers have several papers on bandit problems and differential privacy, two topics of perennial interest. But they also explore a variety of other subjects, with a mix of theoretical analysis and practical application.

Adaptive neural computation

Adaptive neural computation means tailoring the number of computations performed by a neural model to the input, on the fly. At ICML, Amazon researchers apply this approach to automatic speech recognition.

Lookahead when it matters: Adaptive non-causal transformers for streaming neural transducers
Grant Strimel, Yi Xie, Brian King, Martin Radfar, Ariya Rastrow, Athanasios Mouchtaris

Bandit problems

Bandit problems — which take their name from slot machines, or one-armed bandits — involve the explore-exploit dilemma: an agent interacting with the environment must simultaneously maximize some reward and learn how to maximize that reward. They commonly arise in reinforcement learning.

Delay-adapted policy optimization and improved regret for adversarial MDP with delayed bandit feedback
Tal Lancewick, Aviv Rosenberg, Dmitry Sotnikov

Incentivizing exploration with linear contexts and combinatorial actions
Mark Sellke

Multi-task off-policy learning from bandit feedback
Joey Hong, Branislav Kveton, Manzil Zaheer, Sumeet Katariya, Mohammad Ghavamzadeh

Thompson sampling with diffusion generative prior
Yu-Guan Hsieh, Shiva Kasiviswanathan, Branislav Kveton, Patrick Bloebaum

Differential privacy

Differential privacy is a statistical guarantee of privacy that involves minimizing the probability that an attacker can determine whether a given data item is or is not included in a particular dataset.

DP bookkeeper.png
Forward pass and back-propagation for the lth linear layer in the method proposed in "Differentially private optimization on large model at small cost". Standard training is in black; differentially private training is added in red.

Differentially private optimization on large model at small cost
Zhiqi Bu, Yu-Xiang Wang, Sheng Zha, George Karypis

Fast private kernel density estimation via locality sensitive quantization
Tal Wagner, Yonatan Naamad, Nina Mishra

Distribution shift

Distribution shift is the problem that real-world data may turn out to have a different distribution than the datasets on which a machine learning model is trained. At ICML, Amazon researchers present a new dataset that can help combat this problem.

RLSbench: Domain adaptation under relaxed label shift
Saurabh Garg, Nick Erickson, James Sharpnack, Alex Smola, Sivaraman Balakrishnan, Zachary Lipton

RLS bench.png
Overview of setup considered in "RLSbench: Domain adaptation under relaxed label shift". In existing benchmarks, the label marginal p(y) doesn’t shift, but with RLSbench, p(y) can shift arbitrarily. The class conditionals p(x|y) shift in seemingly natural ways following popular benchmarks.

Ensemble methods

Ensemble methods combine the outputs of several different models to arrive at a final conclusion. At ICML, Amazon researchers prove new theoretical results about stacked-generalization ensembles, in which a higher-level model combines the outputs of lower-level models.

Theoretical guarantees of learning ensembling strategies with applications to time series forecasting
Hilaf Hasson, Danielle Maddix Robinson, Bernie Wang, Youngsuk Park, Gaurav Gupta

Explainable AI

Neural networks’ training methods mean that even their developers may have no idea what computations they’re performing. At ICML, Amazon researchers explore an explainable-AI method called sample-based explanation, which seeks to identify the training examples most responsible for a given model output.

Representer point selection for explaining regularized high-dimensional models
Che-Ping Tsai, Jiong Zhang, Hsiang-Fu Yu, Jyun-Yu Jiang, Eli Chien, Cho-Jui Hsieh, Pradeep Ravikumar

Extreme multilabel classification

Extreme multilabel classification is the problem of classifying data when the space of labels (classification categories) is enormous. At ICML, Amazon researchers explore the use of side information, such as label metadata or instance correlation signals, to improve classifiers’ performance.

PINA: Leveraging side information in eXtreme multi-label classification via predicted instance neighborhood aggregation
Eli Chien, Jiong Zhang, Cho-Jui Hsieh, Jyun-Yu Jiang, Wei-Cheng Chang, Olgica Milenkovic, Hsiang-Fu Yu

Graph neural networks

Graph neural networks produce vector representations of graph nodes that factor in information about the nodes themselves and their neighbors in the graph. At ICML, Amazon researchers investigate techniques for better initializing such networks.

On the initialization of graph neural networks
Jiahang Li, Yakun Song, Xiang Song, David Paul Wipf

Hypergraphs

A hypergraph is a generalization of the graph structure: where an edge in an ordinary graph links exactly two nodes, an edge in a hypergraph can link multiple nodes. At ICML, Amazon researchers present a novel approach to constructing hypergraph neural networks.

From hypergraph energy functions to hypergraph neural networks
Yuxin Wang, Quan Gan, Xipeng Qiu, Xuanjing Huang, David Paul Wipf

Hyperparameter optimization

Hyperparameters are attributes of neural-network models such as the learning rate and the number and width of network layers, and optimizing them is a standard step in model training. At ICML, Amazon researchers propose conformal quantile regression as an alternative to the standard Gaussian processes as a way to model functions during hyperparameter optimization.

Optimizing hyperparameters with conformal quantile regression
David Salinas, Jacek Golebiowski, Aaron Klein, Matthias Seeger, Cédric Archambeau

Function approximation.png
An illustration of the data-sampling method proposed in "Optimizing hyperparameters with conformal quantile regression", together with the acquisition functions for the new method (QR) and the Gaussian-process (GP) method.

Independence testing

Independence testing is a crucial step in many statistical analyses, aiming to determine whether two variables are independent or not. At ICML, Amazon researchers present an approach to independence testing that adjusts the number of samples collected to the difficulty of determining independence.

Sequential kernelized independence testing
Sasha Podkopaev, Patrick Bloebaum, Shiva Kasiviswanathan, Aaditya Ramdas

Model selection

Model selection, which determines the particular model architecture and hyperparameter settings for a given task, typically involves a validation set split off from the model’s training data. At ICML, Amazon researchers propose using synthetic data for validation in situations where training data is sparse.

Synthetic data for model selection
Alon Shoshan, Nadav Bhonker, Igor Kviatkovsky, Matan Fintz, Gérard Medioni

Physical models

Deep-learning methods have shown promise for scientific computing, where they can be used to predict solutions to partial differential equations (PDEs). At ICML, Amazon researchers investigate the problems of adding known physics constraints — such as adherence to conservation laws — to the predictive outputs of machine learning models when computing the solutions to PDEs.

Learning physical models that can respect conservation laws
Derek Hansen, Danielle Maddix Robinson, Shima Alizadeh, Gaurav Gupta, Michael Mahoney

Tabular data

Extending the power of Transformer models to tabular data has been a burgeoning area of research in recent years. At ICML, Amazon researchers show how to improve the generalizability of models trained using tabular data.

XTab: Cross-table pretraining for tabular transformers
Bingzhao Zhu, Xingjian Shi, Nick Erickson, Mu Li, George Karypis, Mahsa Shoaran

Research areas

Related content

IN, KA, Bengaluru
Do you want to join an innovative team of scientists who use machine learning and statistical techniques to create state-of-the-art solutions for providing better value to Amazon’s customers? Do you want to build and deploy advanced algorithmic systems that help optimize millions of transactions every day? Are you excited by the prospect of analyzing and modeling terabytes of data to solve real world problems? Do you like to own end-to-end business problems/metrics and directly impact the profitability of the company? Do you like to innovate and simplify? If yes, then you may be a great fit to join the Machine Learning and Data Sciences team for India Consumer Businesses. If you have an entrepreneurial spirit, know how to deliver, love to work with data, are deeply technical, highly innovative and long for the opportunity to build solutions to challenging problems that directly impact the company's bottom-line, we want to talk to you. Major responsibilities - Use machine learning and analytical techniques to create scalable solutions for business problems - Analyze and extract relevant information from large amounts of Amazon’s historical business data to help automate and optimize key processes - Design, development, evaluate and deploy innovative and highly scalable models for predictive learning - Research and implement novel machine learning and statistical approaches - Work closely with software engineering teams to drive real-time model implementations and new feature creations - Work closely with business owners and operations staff to optimize various business operations - Establish scalable, efficient, automated processes for large scale data analyses, model development, model validation and model implementation - Mentor other scientists and engineers in the use of ML techniques
US, NY, New York
MULTIPLE POSITIONS AVAILABLE Employer: Amazon Development Center U.S., Inc. Offered Position: Applied Scientist III - AMZ007408 Job Location: New York, NY Position Responsibilities: Participate in the design, development, evaluation, deployment, and updating of formal reasoning systems for security, privacy, and data protection applications. Drive technical and scientific innovation in security automation, data protection, and privacy-preserving technologies, with a focus on developing scalable solutions for cloud environments. Develop and/or apply formal verification techniques and automated theorem proving methods for different applications in cloud security and privacy. Collaborate with internal and external users to understand requirements and enhance formal verification and automated reasoning capabilities. Lead research and development efforts in AI security, specifically evaluate emerging threats and opportunities, including securing Generative AI systems and designing robust safeguards. Proactively identify and explore new opportunities for deploying and leveraging formal reasoning solutions across various domains.
GB, London
The Agentic Automated Reasoning Group is building the next generation of software verification tools combining advances in artificial intelligence, the computational capacity of the cloud, and our deep expertise in the domain. Join us if you want to be a part of this transformational endeavor. The Strata team (https://github.com/strata-org) is seeking an applied scientist with broad interest and expertise in model checking, interactive theorem proving, programming language semantics, and generative AI. You will combine your expertise with that of your coworkers to build new tools that solve code analysis problems previously considered beyond reach. Our application areas span all the way from Infrastructure as Code to high-performance cryptography written in assembly code, while our methods span from interactive theorem proving to automated test generation. Each day, hundreds of thousands of developers make billions of transactions worldwide on AWS. They harness the power of the cloud to enable innovative applications, websites, and businesses. Using automated reasoning technology and mathematical proofs, AWS allows customers to answer questions about security, availability, durability, and functional correctness. We call this provable security, absolute assurance in security of the cloud and in the cloud. https://aws.amazon.com/security/provable-security/ Key job responsibilities Work with customer teams to understand the nature of their software and the properties they need to establish of it. Identify tools and methods capable of addressing the verification needs of customers, including any novel analysis capabilities required. Use techniques spanning property-based testing to model checkers, and interactive theorem provers to establish program properties. Explore generative AI techniques to help customers formalize their requirements, find revealing tests, generate required boiler plate for testing and model checking, and find and repair program proofs. About the team The Agentic Automated Reasoning Group at AWS develops and applies state of the art formal methods and automated reasoning techniques to ensure the security, reliability, and correctness of AWS services and customer applications, with a strong focus on AI based agents. Our work innovates tools and services to perform verification at scale and apply them to build safe and secure systems at AWS. We are also pioneering the use of formal verification and automated reasoning to develop agentic systems, ensuring AI agents operate within defined safety boundaries.
US, CA, San Francisco
Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll work alongside world-renowned AI pioneers to lead key initiatives in robotic intelligence. As a Member of Technical Staff, you'll spearhead the development of breakthrough foundation models that enable robots to perceive, understand, and interact with the world in unprecedented ways. You'll drive technical excellence in areas such as perception, manipulation, science understanding, sim2real transfer, multi-modal foundation models, and multi-task learning, designing novel algorithms that bridge the gap between state-of-the-art research and real-world deployment at Amazon scale. In this role, you'll combine hands-on technical work with scientific leadership, ensuring your team delivers robust solutions for dynamic real-world environments. You'll leverage Amazon's vast computational resources to tackle ambitious problems in areas like very large multi-modal robotic foundation models and efficient, promptable model architectures that can scale across diverse robotic applications. Key job responsibilities - Lead technical initiatives in robotics foundation models, driving breakthrough approaches through hands-on research and development in areas like open-vocabulary panoptic scene understanding, scaling up multi-modal LLMs, sim2real/real2sim techniques, end-to-end vision-language-action models, efficient model inference, video tokenization - Design and implement novel deep learning architectures that push the boundaries of what robots can understand and accomplish - Guide technical direction for specific research initiatives, ensuring robust performance in production environments - Mentor and support fellow scientists while maintaining strong individual technical contributions - Collaborate with engineering teams to optimize and scale models for real-world applications - Influence technical decisions and implementation strategies within your area of focus A day in the life - Develop and implement novel foundation model architectures, working hands-on with our extensive compute infrastructure - Guide and support fellow scientists in solving complex technical challenges, from sim2real transfer to efficient multi-task learning - Lead focused technical initiatives from conception through deployment, ensuring successful integration with production systems - Drive technical discussions within your team and with key stakeholders - Conduct experiments and prototype new ideas using our massive compute cluster - Mentor team members while maintaining significant hands-on contribution to technical solutions Amazon offers a full range of benefits that support you and eligible family members, including domestic partners and their children. Benefits can vary by location, the number of regularly scheduled hours you work, length of employment, and job status such as seasonal or temporary employment. The benefits that generally apply to regular, full-time employees include: 1. Medical, Dental, and Vision Coverage 2. Maternity and Parental Leave Options 3. Paid Time Off (PTO) 4. 401(k) Plan If you are not sure that every qualification on the list above describes you exactly, we'd still love to hear from you! At Amazon, we value people with unique backgrounds, experiences, and skillsets. If you’re passionate about this role and want to make an impact on a global scale, please apply! About the team At Frontier AI & Robotics, we're not just advancing robotics – we're reimagining it from the ground up. Our team is building the future of intelligent robotics through ground breaking foundation models and end-to-end learned systems. We tackle some of the most challenging problems in AI and robotics, from developing sophisticated perception systems to creating adaptive manipulation strategies that work in complex, real-world scenarios. What sets us apart is our unique combination of ambitious research vision and practical impact. We leverage Amazon's massive computational infrastructure and rich real-world datasets to train and deploy state-of-the-art foundation models. Our work spans the full spectrum of robotics intelligence – from multimodal perception using images, videos, and sensor data, to sophisticated manipulation strategies that can handle diverse real-world scenarios. We're building systems that don't just work in the lab, but scale to meet the demands of Amazon's global operations. Join us if you're excited about pushing the boundaries of what's possible in robotics, working with world-class researchers, and seeing your innovations deployed at unprecedented scale.
US, WA, Seattle
Innovators wanted! Are you an entrepreneur? A builder? A dreamer? This role is part of an Amazon Special Projects team that takes the company’s Think Big leadership principle to the limits. If you’re interested in innovating at scale to address big challenges in the world, this is the team for you. As an Applied Scientist on our team, you will focus on building state-of-the-art ML models for biology. Our team rewards curiosity while maintaining a laser-focus in bringing products to market. Competitive candidates are responsive, flexible, and able to succeed within an open, collaborative, entrepreneurial, startup-like environment. At the forefront of both academic and applied research in this product area, you have the opportunity to work together with a diverse and talented team of scientists, engineers, and product managers and collaborate with other teams. Key job responsibilities - Build, adapt and evaluate ML models for life sciences applications - Collaborate with a cross-functional team of ML scientists, biologists, software engineers and product managers
US, WA, Seattle
Join us at the forefront of Amazon's sustainability initiatives to work on environmental and social advancements that support Amazon's long-term worldwide sustainability strategy. At Amazon, we're working to be the most customer-centric company on earth. To get there, we need exceptionally talented, bright, and driven people who are passionate about making a meaningful impact on communities and the environment while helping shape the future of sustainable business practices. Sustainability Science and Innovation (SSI) is a multi-disciplinary team within WW Sustainability combining science, analytics, economics, statistics, machine learning, product development, and engineering expertise. We use data across the sustainability imperatives (carbon, water, waste, biodiversity, environmental risk and more) and these skills and capabilities to identify, develop, experiment, and scale the scientific solutions and innovations necessary for Amazon, customers and partners to help them solve their hardest unmet and evolving sustainability needs and goals. The Worldwide Sustainability (WWS) organization is seeking an exceptional scientific leader to join Amazon's Sustainability Science and Innovation team as a Researcher Scientist for Materials Chemistry Innovation. This role focuses on hands-on experimental research in materials chemistry to accelerate the discovery and validation of sustainable materials through systematic synthesis, characterization, and performance testing. You will lead the design and execution of experimental research campaigns targeting catalysts, functional materials, and sustainability-relevant chemistries across multivariate parameter spaces. You will establish scientific strategy and technical roadmaps for materials discovery while leading research initiatives that tackle complex sustainability challenges in critical industrial sectors. This position requires driving breakthrough solutions in materials synthesis and characterization through internal capabilities and strategic partnerships with universities, industry scientists, and government laboratories. You will mentor junior scientists and engineers while collaborating across Amazon's Innovation Lab Network to translate research into scalable solutions. Your leadership will be essential in developing early-stage, cost-effective materials that address significant technical and economic challenges fundamental to Amazon's operations, requiring you to navigate complex trade-offs between immediate deliverables and long-term environmental impact. You will also shape how emerging automation and AI tools are applied to accelerate materials discovery workflows. The ideal candidate demonstrates extensive experience in materials synthesis, advanced characterization techniques, and systematic experimental design for performance validations. You must possess proven ability to lead cross-functional teams, establish research priorities, and drive scientific innovation from concept to implementation. Deep technical expertise in materials testing methods, combined with strategic vision for translating research into practical applications is essential. Experience with high-throughput and combinatorial experimental approaches to efficiently explore large design spaces is highly valued. Your work will establish new paradigms in sustainable materials discovery through rigorous experimental research and performance testing, directly contributing to Amazon's sustainability goals while creating scalable solutions that extend beyond the company's immediate operations. Key job responsibilities - Develop scientific models that help solve complex and ambiguous sustainability problems, and extract strategic learnings from large datasets. - Work closely with applied scientists and software engineers to implement your scientific models. - Support early-stage strategic sustainability initiatives and effectively learn from, collaborate with, and influence stakeholders to scale-up high-value initiatives. - Support research and development of cross-cutting technologies for industrial decarbonization, including building the data foundation and analytics for new AI models. - Drive innovation in key focus areas including packaging materials, building materials, and alternative fuels. About the team Diverse Experiences: World Wide Sustainability (WWS) values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Inclusive Team Culture: It’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness. Mentorship & Career Growth: We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
US, WI, Madison
As a Data Scientist on the Shopbop/Zappos Catalog Tech team, you will design and implement scientific approaches to revolutionize how we manage and enhance our product catalog data for our world-class selection of Shoes, Kids, and Active wear. You will work with Zappos' Senior leadership team to solve complex data challenges through advanced analytics and machine learning - creating innovative solutions and influencing product decisions through data-driven insights. You will lead critical initiatives to reduce catalog errors, accelerate product data capture, and develop state-of-the-art image classification systems for fashion features. You will partner daily with engineering teams and business stakeholders to provide expert guidance on model selection and implementation. As a member of the Zappos technical staff, you will leverage machine learning technologies and have access to industry leaders in AI/ML and E-Commerce to help grow your expertise. You will also routinely collaborate with data science teams across our sister companies at Amazon.com and Shopbop.com. You will push the boundaries of what's possible with applied machine learning and bring innovative solutions to bear for customers (including computer vision, NLP, and advanced ML models). You will think big about how data science can transform our catalog operations and be persistent in delivering robust, scalable solutions. Key job responsibilities Design and implement machine learning approaches to improve catalog data quality. Develop and validate scientific methodologies for automated data capture and classification. Partner with engineering teams to integrate ML models into production systems. Create and present analysis that drives decision-making at the senior leadership level. A day in the life You start the day reviewing model performance metrics, noting some drift in the image classification system that needs investigation. You spend the morning developing a new approach to reduce product attribute errors using recent advances in LLMs. In the afternoon, you meet with engineering teams to advise on model architecture for a new feature, and wrap up by analyzing the results of your latest A/B test on data capture efficiency improvements. About the team Zappos/Shopbop Catalog Tech team owns the software that drives our photostudio, product cataloging, and integration to Amazon's marketplace. We use Amazon's Leadership Principals and Engineering Expertise but have our own fun vibe. We are located in Madison WI, and Las Vegas NV.
US, WA, Seattle
The Alexa Connections team is building the worlds most trusted AI assistant, that helps get things done and creates moments of joy. We build LLM-powered, next-generation communication features that help customers connect within the household with features like announcements and drop-In, and with people they care about with features like calling, emailing and texting. Using GenAI we proactively personalize every single connection moment whether its connecting with grandparents or communicating with kids teachers, and you can do it all hands free with Alexa! We are hiring a mature Sr. Manager of Applied Science who is an entrepreneurial big thinker that invents and then biases towards action to solve highest priority customer problems. Our portfolio initiative range from highly confidential innovations to data driven quality problems that needs sciences solutions. You will be leading a team of scientists and partnering with product, engineering and design organizations to building on the latest AGI/LLM systems. If you are holding out for an opportunity to: - Join a leadership group that moves fast towards big audacious goals - Lead an organization to balance solving towards problems while working towards the long term vision - Be surrounded by super passionate mission driven colleagues who will challenge you to grow every day - Solve difficult challenges using your expertise in science to invest practical solutions - Create applications at a massive scale used by millions of people - Work with AGI/LLM systems to deliver real experiences, not just research And you are experienced with… - Driving applied science projects end-to-end from ideation, analysis, prototyping, development, metrics, and monitoring - Conducting deep analyses on massive user and contextual data sets - Proposing viable modeling ideas to advance optimization or efficiency, with supporting argument, data, or, preferably, preliminary results - Designing, developing, and maintaining scalable ML models and LLM's with automated training, validation, monitoring and reporting - Staying familiar with the field and apply state-of-the-art ML techniques to NLP and related optimization problems - Publishing peer-reviewed scientific papers in top journals and conferences And you constantly look for opportunities to… - Innovate, simplify, reduce waste, and increase efficiency - Use data to make decisions and validate assumptions - Automate processes otherwise performed by humans - Learn from others and help those around you grow ...then lets chat! Key job responsibilities As a senior leader, you will play a critical role in elevating the team’s scientific and technical rigor, identifying and implementing best-in-class algorithms, methodologies, and infrastructure that enable rapid experimentation and scaling. You will establish a long-term vision for continued scientific innovation, setting strategic goals to future-proof the organization’s technical stacks and ML/LLM frameworks to support new and emerging business objectives. Additionally, you will grow talents, fostering a culture of excellence and continuous learning to enhance the organization’s ability to solve complex problems.
US, TX, Dallas
Amazon Web Services (AWS) Applied AI Solutions (AAIS) is on a mission to make AI real for enterprises. We build and deploy production AI solutions that drive measurable business outcomes at scale, bringing together applied scientists, AI architects, business development professionals, and GTM specialists to help customers move from AI experimentation to production impact. Within AAIS, the GTM Acceleration team activates the field, measures impact, and scales what works. We are the connective tissue between AAIS product and science teams and the worldwide field organization, ensuring our AI solutions reach customers effectively, that we quantify the value we deliver, and that we build repeatable motions that scale globally. We are looking for an Applied Scientist who will serve as a force multiplier across our customer engagement teams, building the analytical foundations, predictive models, and reusable tooling that power our go-to-market strategy. You will work at the intersection of data science, machine learning, and business strategy, building models that quantify our value proposition, and creating scalable analytical assets that accelerate every engagement. This is a highly visible, high-impact role where your work directly influences how we demonstrate and measure the value of AWS AI solutions for enterprise customers. You will operate with significant autonomy, owning the scientific direction of your projects while collaborating with software engineers, product managers, and business stakeholders. You will identify the right methodology for each problem, whether that is a classical statistical approach, a modern deep learning technique, or a novel combination, and communicate your findings clearly to both technical and non-technical audiences. This role spans Connect Customer initiatives and across the Applied AI solution portfolio, offering the opportunity to pioneer data science approaches that scale intelligent analytics worldwide. If you thrive at the intersection of rigorous science and customer-facing impact and are energized by translating complex model outputs into business decisions, we want to talk to you. Key job responsibilities Design, develop, and deploy statistical models and machine learning pipelines to drive product improvements, business decisions, and customer outcomes Work directly with customers during production pilots to build and deploy AI solutions that demonstrate measurable business value Design and execute A/B experiments and causal inference analyses to measure the impact of new features and model changes Build ROI models, business case tools, and forecasting systems for demand prediction, capacity planning, workforce optimization, and value quantification Apply NLP and generative AI techniques to extract insights from structured and unstructured data at scale, and partner with software engineers to productionize models with reliability, monitoring, and operational excellence Build and own customer analytics capabilities including segmentation (by size tier, AI adoption, product penetration, entitlement), usage trend analysis, propensity modeling, and foundational datasets combining service usage with sales data Create self-service analytics platforms and automated insight delivery mechanisms that enable leadership to pull strategic intelligence on demand Enable field teams with reusable analytical assets, diagnostic notebooks, benchmarking studies, and scalable tooling that accelerate customer engagements Own success metrics and create mechanisms to measure model performance, adoption, and business impact across customer cohorts Define strategic frameworks and GTM recommendations by segment, translating data patterns and market signals into actionable go-to-market motions and investment priorities Communicate findings and technical trade-offs to senior leadership and customer executives through written documents (6-pagers, science reviews) and presentations, operating as a shared resource across 2-3 teams simultaneously About the team Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Inclusive Team Culture AWS values curiosity and connection. Our employee-led and company-sponsored affinity groups promote inclusion and empower our people to take pride in what makes us unique. Our inclusion events foster stronger, more collaborative teams. Our continual innovation is fueled by the bold ideas, fresh perspectives, and passionate voices our teams bring to everything we do. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve.
US, WA, Seattle
The AWS Compliance & Security Assurance Engineering team builds tools and services that scale AWS's ability to exceed security and compliance expectations for our regulators, auditors, and customers globally. We create efficiencies for our teams while maintaining transparency for our customers. As a Data Scientist on our data team, you'll have a unique opportunity to build solutions that scale security assurance capabilities through AI-driven approaches. You'll leverage advanced analytics, machine learning techniques, and state-of-the-art AI technologies on top of highly complex datasets to transform our security assurance operations. You'll also help shape our data science roadmap, fostering data-driven decision-making and delivering significant business impact through innovative methodologies. The ideal candidate brings a strong background in automation by leveraging GenAI Large Language Models (LLMs), combined with experience analyzing complex datasets and a proven ability to transform business needs into deliverables. If you're passionate about applying AI to security challenges in a dynamic environment that offers startup-like autonomy with enterprise impact, we want to hear from you. Key job responsibilities * Analyze and extract hidden insights from complex, large-scale datasets using advanced statistical and AI techniques, using hands-on expertise in data wrangling and manipulation. * Identify high-impact business improvement opportunities, lead the design and execution of data science initiatives to address them. * Apply statistical and ML knowledge to specific business problems and data. * Leverage Vision Language Models (VLMs) and Large Language Models (LLMs) to analyze document/image structure, perform content understanding and metadata extraction. * Rapidly prototype and test AI solutions, while iterating quickly based on data and feedback. * Collaborate cross-functionally to deeply understand business requirements, customer needs, and technical constraints; transforming discovered information into data-driven solutions and actionable recommendations. * Communicate complex technical concepts to technical and non-technical stakeholders; present compelling insights and evidence-based recommendations to leadership. * Stay up-to-date on the latest advancements in AI and identify opportunities to apply emerging techniques to the space. * Collaborate with Business Intelligence, Data Engineers and SDEs to drive the collection of new data and refinement of existing data sources to continually improve data quality. * Actively participate in team design reviews, data modeling discussions, brainstorming activities while mentoring colleagues in state-of-the-art AI tools and methodologies. About the team Diverse Experiences Amazon Security values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why Amazon Security? At Amazon, security is central to maintaining customer trust and delivering delightful customer experiences. Our organization is responsible for creating and maintaining a high bar for security across all of Amazon’s products and services. We offer talented security professionals the chance to accelerate their careers with opportunities to build experience in a wide variety of areas including cloud, devices, retail, entertainment, healthcare, operations, and physical stores. Inclusive Team Culture In Amazon Security, it’s in our nature to learn and be curious. Ongoing DEI events and learning experiences inspire us to continue learning and to embrace our uniqueness. Addressing the toughest security challenges requires that we seek out and celebrate a diversity of ideas, perspectives, and voices. Training & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, training, and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why flexible work hours and arrangements are part of our culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve.