Are We Strategically Naive or Guided by Trust and Trustworthiness in Cheap-Talk Communication.png
Are We Strategically Naive or Guided by Trust and Trustworthiness in Cheap-Talk Communication?” was published in Management Science — the flagship journal of the Institute for Operations Research and the Management Sciences (INFORMS) in April 2021.
Glynis Condon

3 questions with Özalp Özer: How to build trust in business relationships

Özer’s paper published in INFORMS’ Management Science 2021 explores the dynamics behind “cheap-talk” communications.

Trust and trustworthiness are important in both our personal and business relationships. How then can we build environments that foster increased trust, trustworthiness and cooperation?

In the first edition of a new series that focuses on research papers published by scientists within the Amazon Supply Chain Optimization Technologies (SCOT) organization, we interview Özalp Özer, coauthor of “Are We Strategically Naive or Guided by Trust and Trustworthiness in Cheap-Talk Communication?”. The paper was published in Management Science — the flagship journal of the Institute for Operations Research and the Management Sciences (INFORMS) in April 2021.

Özalp Özer profile image
Özalp Özer is a senior principal scientist at Amazon, and George and Fonsa Brody Professor of Management Science at The University of Texas at Dallas.

Özer is a senior principal scientist at Amazon, and George and Fonsa Brody Professor of Management Science at The University of Texas at Dallas (UTD). He earned a PhD in operations research from Columbia University, before going on to serve on the faculty at Stanford and Columbia. Özer has published extensively on a diverse range of topics, from supply chain management, capacity and inventory management to pricing and revenue management.

Özer says that a guiding principle behind his research is to focus on solving problems that have a real-world impact at scale. At Stanford and then UTD, Özer found himself drawn to the field of behavioral and experimental economics — particularly the field of game theory and understanding how to model actions and emotions in scenarios involving multiple decision makers in dynamic environments.

Driven by his interest in tackling real-world business problems, Özer remained engaged with industry during his tenure as an academic. While working on a project focused on designing effective procurement contracts, he observed the important role that trust played in establishing and fostering business relationships.

In many cases, the interests of the parties engaging in a negotiation are not aligned. To give one example, suppliers can use product forecast information from a buyer to make capacity, inventory and other manufacturing-related decisions. However, buyers might often provide suppliers with overly optimistic forecasts to ensure an abundant supply. If the demand for the product turns out to be lower than anticipated, the supplier bears the excess investment risk.

Özer says that this scenario represents an example of “cheap talk communications.” He outlines three characteristics that are common to all cheap talk communications: they are costless (they are devoid of monetary penalties), they are non-binding (a buyer can provide a forecast without committing to it), and they are non-verifiable (no forecast can be completely accurate in the light of market uncertainty). To complicate matters, the objective functions that each party is trying to maximize are at odds (or not perfectly aligned) with each other.

Standard game theory suggests that each party in a business transaction will move toward an equilibrium that maximizes their own payoff. In a cheap-talk setting, where the information is costless, non-binding and non-verifiable, the theory suggests that each party will disregard the information supplied by the other.

However, Özer finds that people involved in business (as well as personal) transactions frequently factor into their decision-making information supplied by the other party, even when their incentives are not perfectly aligned and even when the information or recommendation may be perceived as “cheap”. They do this by taking the business context and the related relationship into account. Doing so results in higher returns for both parties involved. For example, third-party sellers are more likely to act on price reduction or replenishment recommendations from Amazon, if they find that these recommendations have previously resulted in an uptick in sales and profits.

Ozer says that “cheap talk” communications have the unfortunate emphasis on being “cheap” and less emphasis on how they are informative and can align incentives. In a series of publications, Ozer shows why, when, and how such communications and recommendations turn out to be informative, and how they help align business objectives, resulting in both parties making better decisions.   

In this interview, Özer talks about findings from the recently published INFORMS paper and discusses the implications of these findings for companies like Amazon.

Q. What are the two models that can be used to explain how cheap talk communications work between decision makers?

As our paper suggests, there are two contrasting economic theories that can be used to analyze cheap-talk communications.

The trust-embedded model — which takes a more optimistic view of humanity — suggests that decision makers are motivated by non-monetary motives to be trusting and trustworthy, besides the monetary incentives such as maximizing cash flow.   

Here, we define trust as instances of decision makers behaving voluntarily in a way that put themselves in vulnerable engagement due to the uncertain behavior of the other party (the trustee), based upon the expectation of a positive outcome from that engagement. Trustworthiness flips the perspective to that of the trustee. We define trustworthiness as an instance of a decision maker behaving voluntarily in a way not to take advantage of the trustor’s vulnerable position – even when faced with a self-serving decision that conflicts with the trustor’s objectives.

Humans use non-Bayesian, trust-based belief systems to update their rules governing interactions with other parties. In short, people involved in a business transaction are willing to be vulnerable and take risk.
Özalp Özer

The trust-embedded model suggests that when engaging with others, decision makers are averse to manipulating information in economic interactions. They incur disutility from lying. As a result, they assess the trustworthiness of the counterparty, and they form a trust factor towards them. This trust factor governs how decision makers interpret and use the information they receive from others.

In other words, humans use non-Bayesian, trust-based belief systems to update their rules governing interactions with other parties. In short, people involved in a business transaction are willing to be vulnerable and take risk. Because they assess — even sometimes incorrectly — that doing so yields positive outcomes, they engage in and cultivate behaviors conducive to enabling these outcomes.

The trust embedded model suggests that individuals are guided by more than self-interest or pecuniary motives as they engage in transactions. For example, senders of information are guided by factors such as fairness and tenets that are central to their company. As a result, they share more information and resources than strictly necessary.

In contrast to the trust-embedded model, the level-k model — the second model discussed in the paper — suggests that decision makers are limited in their ability to think strategically. Receivers of information cannot anticipate the extent to which the sender might have distorted the message. On the flip-side, senders cannot account for just how much receivers might discount their message. Consequently, senders share more than necessary, because they take a dim view of the receiver’s ability to discount their message.

It’s important to note that even the level-k model can sometimes explain why senders and receivers tend to overshare information in a cheap-talk setting, which contrasts with the outcome standard game theory models would predict. It’s just that their motivations are different – with the level-k model, oversharing is driven by a limited ability to think strategically, rather than by the willingness to be trusting and trustworthy.

Overall, our paper that analyzed existing cheap-talk experiment data, found more support for the trust-embedded model, suggesting that individuals are also driven by non-monetary incentives when conducting transactions.

Q. Why do you think that trust-embedded models do a better job of explaining cheap-talk communications? What are the implications for organizations engaging in relationships with businesses and partners?

During the internet age, we’ve seen e-commerce, hospitality and ride-sharing companies grow precisely because they’ve been able to create policies and tools that encourage trust.
Özalp Özer

The answer to your first question is relatively simple — human beings are far more sophisticated than the level-k model gives them credit for. For example, there are many sellers on Amazon’s website who are proficient in using a variety of tools they have developed to make decisions related to pricing and inventory.

As a result, if we want the tools we provide to earn sellers’ trust, we need to think of the system more holistically at both an architecture and policy level to truly understand what builds trust and what is a trust-buster.

During the internet age, we’ve seen e-commerce, hospitality and ride-sharing companies grow precisely because they’ve been able to create policies and tools that encourage trust. Product reviews, the ability to get refunds for a vacation rental because hosts might not have lived up to their promises, or the price for a ride being set in advance — these are some of the mechanisms that let you buy a product or rent a home from people you don’t know.

Q. How are the findings in your paper applicable to your work at Amazon?

We are leveraging the insights from this stream of research as well as others to augment our understanding of seller trust, particularly in relation to how sellers interact with our inventory management tools, and how fidelity of recommendations impact sellers’ trust.

There is no interaction at Amazon that I can think of that doesn’t have an element of trust.
Özalp Özer

We are designing our related processes to reduce barriers for trusting and trustworthy engagements among the participants of our stores; for example, by making specific investments to support seller growth in areas that benefit sellers and customers the most; by reducing perceived vulnerabilities in carrying excess inventory; by looking into ways in which we stabilize our policies; by creating visibility to the reasons for our recommendations; by looking into ways in which we can build interactive communication channels among participants in our stores; and by building reputation and feedback systems that foster trusting and trustworthy engagements and on and on.

Using large-scale data, scientific methods like causal machine learning to optimization, as well as continual engagement with selling partners and customers, we aim to identify at the extent to which sellers trust evolves — so we can identify and invest in processes that foster trust and as a result growth and economic prosperity.  

There is no interaction at Amazon that I can think of that doesn’t have an element of trust. Jeff Bezos has said, “You can’t ask for trust, you just have to do it the hard way, one step at a time.” In my time at the company, I have been struck by the tireless efforts of so many people to gain seller and customer trust. At Amazon, it is just part of everything we do.

Related content

ES, B, Barcelona
Are you interested in defining the science strategy that enables Amazon to market to millions of customers based on their lifecycle needs rather than one-size-fits-all campaigns? We are seeking a Applied Scientist to lead the science strategy for our Lifecycle Marketing Experimentation roadmap within the PRIMAS (Prime & Marketing analytics and science) team. The position is open to candidates in Amsterdam and Barcelona. In this role, you will own the end-to-end science approach that enables EU marketing to shift from broad, generic campaigns to targeted, cohort-based marketing that changes customer behavior. This is a high-ambiguity, high-impact role where you will define what problems are worth solving, build the science foundation from scratch, and influence senior business leaders on marketing strategy. You will work directly with Business Directors and channel leaders to solve critical business problems: how do we win back customers lost to competitors, convert Young Adults to Prime, and optimize marketing spend by de-averaging across customer cohorts. Key job responsibilities Science Strategy & Leadership: 1. Own the end-to-end science strategy for lifecycle marketing, defining the roadmap across audience targeting, behavioral modeling, and measurement 2. Navigate high ambiguity in defining customer journey frameworks and behavioral models – our most challenging science problem with no established playbook 3. Lead strategic discussions with business leaders translating business needs into science solutions and building trust across business and tech partners 4. Mentor and guide a team of 2-3 scientists and BIEs on technical execution while contributing hands-on to the hardest problems Advanced Customer Behavior Modeling: 1. Build sophisticated propensity models identifying customer cohorts based on lifecycle stage and complex behavioral patterns (e.g., Bargain hunters, Young adults Prime prospects) 2. Define customer journey frameworks using advanced techniques (Hidden Markov Models, sequential decision-making) to model how customers transition across lifecycle stages 3. Identify which customer behaviors and triggers drive lifecycle progression and what messaging/levers are most effective for each cohort 4. Integrate 1P behavioral data with 2P survey insights to create rich, actionable audience definitions Measurement & Cross-Workstream Integration: 1. Partner with measurement scientist to design experiments (RCTs) that isolate audience targeting effects from creative effects 2. Ensure audience definitions, journey models, and measurement frameworks work coherently across Meta, LiveRamp, and owned channels 3. Establish feedback loops connecting measurement insights back to model improvements About the team The PRIMAS (Prime & Marketing Analytics and Science) is the team that support the science & analytics needs of the EU Prime and Marketing organization, an org that supports the Prime and Marketing programs in European marketplaces and comprises 250-300 employees. The PRIMAS team, is part of a larger tech tech team of 100+ people called WIMSI (WW Integrated Marketing Systems and Intelligence). WIMSI core mission is to accelerate marketing technology capabilities that enable de-averaged customer experiences across the marketing funnel: awareness, consideration, and conversion.
IN, KA, Bengaluru
Do you want to join an innovative team of scientists who use machine learning and statistical techniques to create state-of-the-art solutions for providing better value to Amazon’s customers? Do you want to build and deploy advanced algorithmic systems that help optimize millions of transactions every day? Are you excited by the prospect of analyzing and modeling terabytes of data to solve real world problems? Do you like to own end-to-end business problems/metrics and directly impact the profitability of the company? Do you like to innovate and simplify? If yes, then you may be a great fit to join the Machine Learning and Data Sciences team for India Consumer Businesses. If you have an entrepreneurial spirit, know how to deliver, love to work with data, are deeply technical, highly innovative and long for the opportunity to build solutions to challenging problems that directly impact the company's bottom-line, we want to talk to you. Major responsibilities - Use machine learning and analytical techniques to create scalable solutions for business problems - Analyze and extract relevant information from large amounts of Amazon’s historical business data to help automate and optimize key processes - Design, development, evaluate and deploy innovative and highly scalable models for predictive learning - Research and implement novel machine learning and statistical approaches - Work closely with software engineering teams to drive real-time model implementations and new feature creations - Work closely with business owners and operations staff to optimize various business operations - Establish scalable, efficient, automated processes for large scale data analyses, model development, model validation and model implementation - Mentor other scientists and engineers in the use of ML techniques
ES, M, Madrid
At Amazon, we are committed to being the Earth's most customer-centric company. The European International Technology group (EU INTech) owns the enhancement and delivery of Amazon's engineering to all the varied customers and cultures of the world. We do this through a combination of partnerships with other Amazon technical teams and our own innovative new projects. You will be joining the Tamale team to work on Haul. As part of EU INTech and Haul, Tamale strives to create a discovery-driven shopping experience using challenging machine learning and ranking solutions. You will be exposed to large-scale recommendation systems, multi-objective optimization, and state-of-the-art deep learning architectures, and you'll be part of a key effort to improve our customers' browsing experience by building next-generation ranking models for Amazon Haul's endless scroll experience. We are looking for a passionate, talented, and inventive Scientist with a strong machine learning background to help build industry-leading ranking solutions. We strongly value your hard work and obsession to solve complex problems on behalf of Amazon customers. Key job responsibilities We look for applied scientists who possess a wide variety of skills. As the successful applicant for this role, you will work closely with your business partners to identify opportunities for innovation. You will apply machine learning solutions to optimize multi-objective ranking, improve discovery engagement through contextual signals, and scale ranking systems across multiple marketplaces. You will work with business leaders, scientists, and product managers to translate business and functional requirements into concrete deliverables, including the design, development, testing, and deployment of highly scalable distributed ranking services. You will be part of a team of scientists and engineers working on solving ranking and personalization challenges at scale. You will be able to influence the scientific roadmap of the team, setting the standards for scientific excellence. You will be working with state-of-the-art architectures and real-time feature serving systems. Your work will improve the experience of millions of daily customers using Amazon Haul worldwide. You will have the chance to have great customer impact and continue growing in one of the most innovative companies in the world. You will learn a huge amount - and have a lot of fun - in the process!
IN, HR, Gurugram
Do you want to join an innovative team of scientists who use machine learning and statistical techniques to create state-of-the-art solutions for providing better value to Amazon’s customers? Do you want to build and deploy advanced ML systems that help optimize millions of transactions every day? Are you excited by the prospect of analyzing and modeling terabytes of data to solve real-world problems? Do you like to own end-to-end business problems/metrics and directly impact the profitability of the company? Do you like to innovate and simplify? If yes, then you may be a great fit to join the Machine Learning team for International Emerging Stores (IES). Machine Learning, Big Data and related quantitative sciences have been strategic to Amazon from the early years. Amazon has been a pioneer in areas such as recommendation engines, ecommerce fraud detection and large-scale optimization of fulfillment center operations. As Amazon has rapidly grown and diversified, the opportunity for applying machine learning has exploded. We have a very broad collection of practical problems where machine learning systems can dramatically improve the customer experience, reduce cost, and drive speed and automation. These include product bundle recommendations for millions of products, safeguarding financial transactions across by building the risk models, improving catalog quality via extracting product attribute values from structured/unstructured data for millions of products, enhancing address quality by powering customer suggestions We are developing state-of-the-art machine learning solutions to accelerate the Amazon India growth story. Amazon is an exciting place to be at for a machine learning practitioner. We have the eagerness of a fresh startup to absorb machine learning solutions, and the scale of a mature firm to help support their development at the same time. As part of the International Machine Learning team, you will get to work alongside brilliant minds motivated to solve real-world machine learning problems that make a difference to millions of our customers. We encourage thought leadership and blue ocean thinking in ML. Key job responsibilities Use machine learning and analytical techniques to create scalable solutions for business problems Analyze and extract relevant information from large amounts of Amazon’s historical business data to help automate and optimize key processes Design, develop, evaluate and deploy, innovative and highly scalable ML models Work closely with software engineering teams to drive real-time model implementations Work closely with business partners to identify problems and propose machine learning solutions Establish scalable, efficient, automated processes for large scale data analyses, model development, model validation and model maintenance Work proactively with engineering teams and product managers to evangelize new algorithms and drive the implementation of large-scale complex ML models in production Leading projects and mentoring other scientists, engineers in the use of ML techniques About the team International Machine Learning Team is responsible for building novel ML solutions across International Emerging Store (India, MENA, Far-East, LatAm) problems and impact the bottom-line and top-line of India business. Learn more about our team from https://www.amazon.science/working-at-amazon/how-rajeev-rastogis-machine-learning-team-in-india-develops-innovations-for-customers-worldwide
US, MA, Boston
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.
US, MA, Boston
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.
US, MA, Boston
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.
US, WA, Bellevue
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.
US, MA, Boston
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.
US, MA, Boston
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.