Do large language models understand the world?

In addition to its practical implications, recent work on “meaning representations” could shed light on some old philosophical questions.

For centuries, theories of meaning have been of interest almost exclusively to philosophers, debated in seminar rooms and at conferences for small specialty audiences.

But the advent of large language models (LLMs) and other “foundation models” has changed that. Suddenly, mainstream media are alive with speculation about whether models trained only to predict the next word in a sequence can truly understand the world.

Trager:Soatto.png
Applied scientist Matthew Trager (left) and vice president and distinguished scientist Stefano Soatto (right).

Skepticism naturally arises. How can a machine that generates language in such a mechanical way grasp words’ meanings? Simply processing text, however fluently, would not seem to imply any sort of deeper understanding.

This kind of skepticism has a long history. In 1980, the philosopher John Searle proposed a thought experiment known as the Chinese room, in which a person who does not know Chinese follows a set of rules to manipulate Chinese characters, producing Chinese responses to Chinese questions. The experiment is meant to show that, since the person in the room never understands the language, symbolic manipulation alone cannot lead to semantic understanding.

Similarly, today’s critics often argue that since LLMs are able only to process “form” — symbols or words — they cannot in principle achieve understanding. Meaning depends on relations between form (linguistic expressions, or sequences of tokens in a language model) and something external, these critics argue, and models trained only on form learn nothing about those relations.

But is that true? In this essay, we will argue that language models not only can but do represent meanings.

Probability space

At Amazon Web Services (AWS), we have been investigating concrete ways to characterize meaning as represented by LLMs. The first challenge with these models is that there is no clear candidate for “where” meanings could reside. Today’s LLMs are usually decoder-only models; unlike encoder-only or encoder-decoder models, they do not use a vector space to represent data. Instead, they represent words in a distributed way, across the many layers and attention heads of a transformer model. How should we think of meaning representation in such models?

Related content
Novel architectures and carefully prepared training data enable state-of-the-art performance.

In our paper “Meaning representations from trajectories in autoregressive models”, we propose an answer to this question. For a given sentence, we consider the probability distribution over all possible sequences of tokens that can follow it, and the set of all such distributions defines a representational space.

To the extent that two sentences have similar continuation probabilities — or trajectories — they’re closer together in the representational space; to the extent that their probability distributions differ, they’re farther apart. Sentences that produce the same distribution of continuations are “equivalent”, and together, they define an equivalence class. A sentence’s meaning representation is then the equivalence class that it belongs to.

Trajectory likelihood distributions.png
Sentences with similar meanings produce similar score distributions over their continuations (top), while sentences with different meanings produce different score distributions over their continuations (bottom).

In the field of natural-language processing (NLP), it is widely recognized that the distribution of words in language is closely related to their meaning. This idea is known as the “distributional hypothesis” and is often invoked in the context of methods like word2vec embeddings, which build meaning representations from statistics on word co-occurrence. But we believe we are the first to use the distributions themselves as the primary way to represent meaning. This is possible since LLMs offer a way to evaluate these distributions computationally.

Related content
Two papers from Amazon Web Services AI present algorithms that alleviate the intensive hyperparameter search and fine-tuning required by privacy-preserving deep learning at very large scales.

Of course, the possible continuations of a single sentence are effectively infinite, so even using an LLM we can never completely describe their distribution. But this impossibility reflects the fundamental indeterminacy of meaning, which holds for people and AI models alike. Meanings are not directly observed: they are encoded in the billions of synapses in a brain or the billions of activations of a trained model, which can be used to produce expressions. Any finite number of expressions may be compatible with multiple (indeed, infinitely many) meanings; which meaning the human — or the language model — intends to convey can never be known for sure.

What is surprising, however, is that, despite the large dimensionality of today’s models, we do not need to sample billions or trillions of trajectories in order to characterize a meaning. A handful — say, 10 or 20 — is sufficient. Again, this is consistent with human linguistic practice. A teacher asked what a particular statement means will typically rephrase it in a few ways, in what could be described as an attempt to identify the equivalence class to which the statement belongs.

In experiments reported in our paper, we showed that a measure of sentence similarity that uses off-the-shelf LLMs to sample token trajectories largely agrees with human annotations. In fact, our strategy outperforms all competing approaches on zero-shot benchmarks for semantic textual similarity (STS).

Form and content

Does this suggest that our paper’s definition of meaning — a distribution over possible trajectories — reflects what humans do when they ascribe meaning? Again, skeptics would say that it couldn’t possibly: text continuations are based only on “form” and lack the external grounding necessary for meaning.

But probabilities over continuations may capture something deeper about how we interpret the world. Consider a sentence that begins “On top of the dresser stood … ” and the probabilities of three possible continuations of that sentence: (1) “a photo”; (2) “an Oscar statuette”; and (3) “an ingot of plutonium”. Don’t those probabilities tell you something about what, in fact, you can expect to find on top of someone’s dresser? The probabilities over all possible sentence continuations might be a good guide to the likelihood of finding different objects on the tops of dressers; in that case, the “formal” patterns encoded by the LLM would tell you something particular about the world.

Related content
Novel “checkpointing” scheme that uses CPU memory reduces the time wasted on failure recovery by more than 92%.

The skeptic might reply, however, that it’s the mapping of words to objects that gives the words meaning, and the mapping isn’t intrinsic to the words themselves; it requires human interpretation or some other mechanism external to the LLM.

But how do humans do that mapping? What happens inside you when you read the phrase “the objects on top of the dresser”? Maybe you envision something that feels somehow indefinite — a superposition of the dresser viewed from multiple angles or heights, say, with abstract objects in a certain range of sizes and colors on top. Maybe you also envision the possible locations of the dresser in the room, the room’s other furnishings, the feel of the wood of the dresser, the scent of the dresser or of the objects on top of it, and so on.

All of those possibilities can be captured by probability distributions, over data in multiple sensory modalities and in multiple conceptual schemas. So maybe meaning for humans involves probabilities over continuations, too, but in a multisensory space instead of a textual space. And on that view, when an LLM computes continuations of token sequences, it’s accessing meaning in a way that resembles what humans do, just in a more limited space.

Skeptics might argue that the passage from the multisensory realm to written language is a bottleneck that meaning can’t squeeze through. But that passage could also be interpreted as a simple projection, similar to the projection from a three-dimensional scene down to a two-dimensional image. The two-dimensional image provides only partial information, but in many situations, the scene remains quite understandable. And since language is our main tool for communicating our multisensory experiences, the projection into text might not be that "lossy" after all.

Multimodal projection.png
The passage from the multisensory realm to written language could be interpreted as a simple projection, similar to the projection from a three-dimensional scene down to a two-dimensional image.

This is not to say that today’s LLMs grasp meanings in the same way that humans do. Our work shows only that large language models develop internal representations with semantic value. We’ve also found evidence that such representations are composed of discrete entities, which relate to each other in complex ways — not just proximity but directionality, entailment, and containment.

But those structural relationships may differ from the structural relationships in the languages used to train the models. That would remain true even if we trained the model on sensory signals: we cannot directly see what meaning subtends a particular expression, for a model any more than for a human.

Related content
Finding that 70% of attention heads and 20% of feed-forward networks can be excised with minimal effect on in-context learning suggests that large language models are undertrained.

If the model and human have been exposed to similar data, however, and if they have shared enough experiences (today, annotation is the medium of sharing), then there is a basis on which to communicate. Alignment can then be seen as the process of translating between the model’s emergent “inner language” — we call it “neuralese” — and natural language.

How faithful can that alignment be? As we continue to improve these models, we will need to face the fact that even humans lack a stable, universal system of shared meanings. LLMs, with their distinct approach to processing information, may simply be another voice in a diverse chorus of interpretations.

In one form or another, questions about the relationship between the world and its representation have been central to philosophy for at least 400 years, and no definitive answers have emerged. As we move toward a future in which LLMs are likely to play a larger and larger role, we should not dismiss ideas based only on our intuitions but continue to ask these difficult questions. The apparent limitations of LLMs might be only a reflection of our poor understanding of what meaning actually is.

Research areas

Related content

US, WA, Bellevue
Conversational AI ModEling and Learning (CAMEL) team is part of Amazon Devices organization where our mission is to build a best-in-class Conversational AI that is intuitive, intelligent, and responsive, by developing superior Large Language Models (LLM) solutions and services which increase the capabilities built into the model and which enable utilizing thousands of APIs and external knowledge sources to provide the best experience for each request across millions of customers and endpoints. We are looking for a passionate, talented, and resourceful Senior Applied Scientist in the field of LLM, Artificial Intelligence (AI), Natural Language Processing (NLP), Recommender Systems and/or Information Retrieval, to invent and build scalable solutions for a state-of-the-art context-aware conversational AI. A successful candidate will have strong machine learning background and a desire to push the envelope in one or more of the above areas. The ideal candidate would also have hands-on experiences in building Generative AI solutions with LLMs, enjoy operating in dynamic environments, be self-motivated to take on challenging problems to deliver big customer impact, moving fast to ship solutions and then iterating on user feedback and interactions. Key job responsibilities As a Senior Applied Scientist, you will leverage your technical expertise and experience to demonstrate leadership in tackling large complex problems, setting the direction and collaborating with other talented applied scientists and engineers to research and develop LLM modeling and engineering techniques to reduce friction and enable natural and contextual conversations. You will analyze, understand and improve user experiences by leveraging Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in artificial intelligence. You will work on core LLM technologies, including Prompt Engineering, Model Fine-Tuning, Reinforcement Learning from Human Feedback (RLHF), Evaluation, etc. Your work will directly impact our customers in the form of novel products and services .
US, CA, Pasadena
The Amazon Web Services (AWS) Center for Quantum Computing (CQC) is a multi-disciplinary team of scientists, engineers, and technicians, on a mission to develop a fault-tolerant quantum computer. We are looking to hire a Research Scientist with fabrication and data analysis experience working on all elements of a superconducting circuit. The position is on-site at our lab, located on the in Pasadena, CA. The ideal candidate will have had prior experience building software tools for data analysis and visualization to enable deep diving into fabrication details, electrical test data. We are looking for candidates with strong engineering principles, resourcefulness and data science experience. Organization and communication skills are essential. Key job responsibilities * Develop and automate data pipeline pertinent to superconducting device fabrication. * Develop analytical tools to uncover new information about established and new processes. * Develop new or contribute to modifying existing data visualization tools. * Utilize machine learning to enable better deeper dives into fabrication and related data. * Interface with various software, design, fabrication and electrical test teams to enable new functionalities. A day in the life The role will be vital to the fabrication team and quantum computing device integration mechanism. The candidate will develop software based analytical tools to enable data driven decisions across projects related to fabrication and supporting infrastructure. Each fabrication run delivers additional data. The candidate will stay close to the details of fabrication providing data analysis and quick feedback to key stakeholders. At the end of fabrication runs custom and standardized reports will be generated by the candidate to provide insights into data generated from the run. This position may require occasional weekend work. About the team AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Hybrid Work We value innovation and recognize this sometimes requires uninterrupted time to focus on a build. We also value in-person collaboration and time spent face-to-face. Our team affords employees options to work in the office every day or in a flexible, hybrid work model near one of our U.S. Amazon offices.
CA, ON, Toronto
Amazon Advertising is one of Amazon's fastest growing and most profitable businesses, responsible for defining and delivering a collection of advertising products that drive discovery and sales. Our products and solutions are strategically important to enable our Retail and Marketplace businesses to drive long-term growth. We deliver billions of ad impressions and millions of clicks and break fresh ground in product and technical innovations every day! As an Applied Scientist on this team, you will: - Drive end-to-end Machine Learning projects that have a high degree of ambiguity, scale, complexity. - Perform hands-on analysis and modeling of enormous data sets to develop insights that increase traffic monetization and merchandise sales, without compromising the shopper experience. - Build machine learning models, perform proof-of-concept, experiment, optimize, and deploy your models into production; work closely with software engineers to assist in productionizing your ML models. - Run A/B experiments, gather data, and perform statistical analysis. - Establish scalable, efficient, automated processes for large-scale data analysis, machine-learning model development, model validation and serving. - Research new and innovative machine learning approaches. - Recruit Applied Scientists to the team and provide mentorship. Why you will love this opportunity: Amazon is investing heavily in building a world-class advertising business. This team defines and delivers a collection of advertising products that drive discovery and sales. Our solutions generate billions in revenue and drive long-term growth for Amazon’s Retail and Marketplace businesses. We deliver billions of ad impressions, millions of clicks daily, and break fresh ground to create world-class products. We are a highly motivated, collaborative, and fun-loving team with an entrepreneurial spirit - with a broad mandate to experiment and innovate. Impact and Career Growth: You will invent new experiences and influence customer-facing shopping experiences to help suppliers grow their retail business and the auction dynamics that leverage native advertising; this is your opportunity to work within the fastest-growing businesses across all of Amazon! Define a long-term science vision for our advertising business, driven from our customers' needs, translating that direction into specific plans for research and applied scientists, as well as engineering and product teams. This role combines science leadership, organizational ability, technical strength, product focus, and business understanding. Team video https://youtu.be/zD_6Lzw8raE
US, WA, Seattle
Amazon Advertising is one of Amazon's fastest growing and most profitable businesses, responsible for defining and delivering a collection of advertising products that drive discovery and sales. Our products and solutions are strategically important to enable our Retail and Marketplace businesses to drive long-term growth. We deliver billions of ad impressions and millions of clicks and break fresh ground in product and technical innovations every day! As the Data Science Manager on this team, you will: - Lead of team of scientists, business intelligence engineers, etc., on solving science problems with a high degree of complexity and ambiguity. - Develop science roadmaps, run annual planning, and foster cross-team collaboration to execute complex projects. - Perform hands-on data analysis, build machine-learning models, run regular A/B tests, and communicate the impact to senior management. - Hire and develop top talent, provide technical and career development guidance to scientists and engineers in the organization. - Analyze historical data to identify trends and support optimal decision making. - Apply statistical and machine learning knowledge to specific business problems and data. - Formalize assumptions about how our systems should work, create statistical definitions of outliers, and develop methods to systematically identify outliers. Work out why such examples are outliers and define if any actions needed. - Given anecdotes about anomalies or generate automatic scripts to define anomalies, deep dive to explain why they happen, and identify fixes. - Build decision-making models and propose effective solutions for the business problems you define. - Conduct written and verbal presentations to share insights to audiences of varying levels of technical sophistication. Why you will love this opportunity: Amazon has invested heavily in building a world-class advertising business. This team defines and delivers a collection of advertising products that drive discovery and sales. Our solutions generate billions in revenue and drive long-term growth for Amazon’s Retail and Marketplace businesses. We deliver billions of ad impressions, millions of clicks daily, and break fresh ground to create world-class products. We are a highly motivated, collaborative, and fun-loving team with an entrepreneurial spirit - with a broad mandate to experiment and innovate. Impact and Career Growth: You will invent new experiences and influence customer-facing shopping experiences to help suppliers grow their retail business and the auction dynamics that leverage native advertising; this is your opportunity to work within the fastest-growing businesses across all of Amazon! Define a long-term science vision for our advertising business, driven from our customers' needs, translating that direction into specific plans for research and applied scientists, as well as engineering and product teams. This role combines science leadership, organizational ability, technical strength, product focus, and business understanding. Team video ~ https://youtu.be/zD_6Lzw8raE
US, WA, Seattle
Amazon Advertising is one of Amazon's fastest growing and most profitable businesses, responsible for defining and delivering a collection of advertising products that drive discovery and sales. Our products and solutions are strategically important to enable our Retail and Marketplace businesses to drive long-term growth. We deliver billions of ad impressions and millions of clicks and break fresh ground in product and technical innovations every day! As an Applied Science Manager in Machine Learning, you will: - Directly manage and lead a cross-functional team of Applied Scientists, Data Scientists, Economists, and Business Intelligence Engineers. - Develop and manage a research agenda that balances short term deliverables with measurable business impact as well as long term investments. - Lead marketplace design and development based on economic theory and data analysis. - Provide technical and scientific guidance to team members. - Rapidly design, prototype and test many possible hypotheses in a high-ambiguity environment, making use of both quantitative and business judgment - Advance the team's engineering craftsmanship and drive continued scientific innovation as a thought leader and practitioner. - Develop science and engineering roadmaps, run annual planning, and foster cross-team collaboration to execute complex projects. - Perform hands-on data analysis, build machine-learning models, run regular A/B tests, and communicate the impact to senior management. - Collaborate with business and software teams across Amazon Ads. - Stay up to date with recent scientific publications relevant to the team. - Hire and develop top talent, provide technical and career development guidance to scientists and engineers within and across the organization. Why you will love this opportunity: Amazon is investing heavily in building a world-class advertising business. This team defines and delivers a collection of advertising products that drive discovery and sales. Our solutions generate billions in revenue and drive long-term growth for Amazon’s Retail and Marketplace businesses. We deliver billions of ad impressions, millions of clicks daily, and break fresh ground to create world-class products. We are a highly motivated, collaborative, and fun-loving team with an entrepreneurial spirit - with a broad mandate to experiment and innovate. Impact and Career Growth: You will invent new experiences and influence customer-facing shopping experiences to help suppliers grow their retail business and the auction dynamics that leverage native advertising; this is your opportunity to work within the fastest-growing businesses across all of Amazon! Define a long-term science vision for our advertising business, driven from our customers' needs, translating that direction into specific plans for research and applied scientists, as well as engineering and product teams. This role combines science leadership, organizational ability, technical strength, product focus, and business understanding. Team video ~ https://youtu.be/zD_6Lzw8raE
US, NJ, Newark
At Audible, we believe stories have the power to transform lives. It’s why we work with some of the world’s leading creators to produce and share audio storytelling with our millions of global listeners. We are dreamers and inventors who come from a wide range of backgrounds and experiences to empower and inspire each other. Imagine your future with us. ABOUT THIS ROLE As Senior Data Scientist, you will build scalable solutions and models to support our business functions (Marketing, Product, Content). Leveraging a range of methods including machine learning and simulation, you will explain, quantify, predict and prescribe in support of informing critical business decisions. You will translate business goals into agile, insightful analytics. You will seek to create value for both stakeholders and customers and inform findings in a clear, actionable way to managers and senior leaders. ABOUT THE TEAM Audible data science team partners with marketing, content, product, and technology teams to solve business and technology problems using scientific approaches to build product and services that surprise and delight our customers. We employ scalable cutting-edge machine learning (ML), causal inference (CI) and GenAI / Natural Language Processing (NLP) knowledge to better target customers and prospects, understand and personalize the content, and context needed to optimize their book-listening experience. We operate in an agile environment in which we own and collaborate on the life cycle of research, design, and model development of relevant projects. ABOUT YOU We are looking for a motivated, results-oriented Data Scientist with strong rigor and demonstrable skills in ML, CI, NLP, data mining and/or large-scale distributed computation. As a Senior Data Scientist, you will... - Develop and validate models to optimize the Who, When, Where and How of all our interactions with customers - Develop Amazon-scale data engineering pipelines - Imagine and invent before the business asks, and create groundbreaking applications using cutting-edge approaches - Develop compelling data visualizations - Work closely with other data scientists, ML experts, engineers as well as business across globe, and on cross-disciplinary efforts with other scientists within Amazon - Contribute to the growth of the Audible Data Science team by sharing your ideas, intellectual property and learning from others ABOUT AUDIBLE Audible is the leading producer and provider of audio storytelling. We spark listeners’ imaginations, offering immersive, cinematic experiences full of inspiration and insight to enrich our customers daily lives. Our Hub+Home hybrid workplace model gives employees the flexibility between gathering in a common office space (work from hub) and remote work (work from home). For more information, please visit adbl.co/hybrid
US, CA, Sunnyvale
The Amazon Artificial General Intelligence (AGI) Personalization team is looking for a passionate, highly skilled and inventive Applied Scientist with strong machine learning background to build state-of-the-art ML systems for personalizing large-scale, high-quality conversational assistant systems. As a Applied Scientist, you will play a critical role in driving the development of personalization techniques enabling conversational systems, in particular those based on large language models, information retrieval, recommender systems and knowledge graph, to be tailored to customer needs. You will handle Amazon-scale use cases with significant impact on our customers' experiences. Key job responsibilities - Use deep learning, ML and NLP techniques to create scalable solutions for creation and development of language model centric solutions for building personalized assistant systems based on a rich set of structured and unstructured contextual signals - Innovate new methods for contextual knowledge extraction and information retrieval, using language models in combination with other learning techniques, that allows effective grounding in context providers when considering memory, compute, latency and quality - Research in advanced customer understanding and behavior modeling techniques - Collaborate with cross-functional teams of scientists, engineers, and product managers to identify and solve complex problems in personal knowledge aggregation, processing, modeling, and verification - Design and execute experiments to evaluate the performance of state-of-the-art algorithms and models, and iterate quickly to improve results - Think Big on conversational assistant system personalization over a multi-year horizon, and identify new opportunities to apply these technologies to solve real-world problems - Communicate results and insights to both technical and non-technical audiences, including through presentations and written reports About the team The AGI Personalization org uses various contextual signals to personalize Large Language Model output for our customers while maintaining privacy and security of customer data. We work across multiple Amazon products, including Alexa, to enhance the user experience by bringing more personal context and relevance to customer interactions.
US, NY, New York
Amazon Advertising is one of Amazon's fastest growing and most profitable businesses. As a core product offering within our advertising portfolio, Sponsored Products (SP) helps merchants, retail vendors, and brand owners succeed via native advertising, which grows incremental sales of their products sold through Amazon. The SP team's primary goals are to help shoppers discover new products they love, be the most efficient way for advertisers to meet their business objectives, and build a sustainable business that continuously innovates on behalf of customers. Our products and solutions are strategically important to enable our Retail and Marketplace businesses to drive long-term growth. We deliver billions of ad impressions and millions of clicks and break fresh ground in product and technical innovations every day! Why you love this opportunity Amazon is investing heavily in building a world-class advertising business. This team is responsible for defining and delivering a collection of advertising products that drive discovery and sales. Our solutions generate billions in revenue and drive long-term growth for Amazon’s Retail and Marketplace businesses. We deliver billions of ad impressions, millions of clicks daily, and break fresh ground to create world-class products. We are highly motivated, collaborative, and fun-loving team with an entrepreneurial spirit - with a broad mandate to experiment and innovate. Impact and Career Growth You will invent new experiences and influence customer-facing shopping experiences to help suppliers grow their retail business and the auction dynamics that leverage native advertising; this is your opportunity to work within the fastest-growing businesses across all of Amazon! Define a long-term science vision for our advertising business, driven fundamentally from our customers' needs, translating that direction into specific plans for research and applied scientists, as well as engineering and product teams. This role combines science leadership, organizational ability, technical strength, product focus, and business understanding. Key job responsibilities Key job responsibilities As an Applied Scientist III on this team you will: * Lead complex and ambiguous projects to deliver bidding recommendation products to advertisers. * Build machine learning models and utilize data analysis to deliver scalable solutions to business problems. * Perform hands-on analysis and modeling with very large data sets to develop insights that increase traffic monetization and merchandise sales without compromising shopper experience. * Work closely with software engineers on detailed requirements, technical designs and implementation of end-to-end solutions in production. * Design and run A/B experiments that affect hundreds of millions of customers, evaluate the impact of your optimizations and communicate your results to various business stakeholders. * Work with scientists and economists to model the interaction between organic sales and sponsored content and to further evolve Amazon's marketplace. * Establish scalable, efficient, automated processes for large-scale data analysis, machine-learning model development, model validation and serving. * Research new predictive learning approaches for the sponsored products business. * Write production code to bring models into production. * Mentor junior scientists and engineer in the team.
CA, ON, Toronto
Amazon Advertising is one of Amazon's fastest growing and most profitable businesses. As a core product offering within our advertising portfolio, Sponsored Products (SP) helps merchants, retail vendors, and brand owners succeed via native advertising, which grows incremental sales of their products sold through Amazon. The SP team's primary goals are to help shoppers discover new products they love, be the most efficient way for advertisers to meet their business objectives, and build a sustainable business that continuously innovates on behalf of customers. Our products and solutions are strategically important to enable our Retail and Marketplace businesses to drive long-term growth. We deliver billions of ad impressions and millions of clicks and break fresh ground in product and technical innovations every day! Why you love this opportunity Amazon is investing heavily in building a world-class advertising business. This team is responsible for defining and delivering a collection of advertising products that drive discovery and sales. Our solutions generate billions in revenue and drive long-term growth for Amazon’s Retail and Marketplace businesses. We deliver billions of ad impressions, millions of clicks daily, and break fresh ground to create world-class products. We are highly motivated, collaborative, and fun-loving team with an entrepreneurial spirit - with a broad mandate to experiment and innovate. Impact and Career Growth You will invent new experiences and influence customer-facing shopping experiences to help suppliers grow their retail business and the auction dynamics that leverage native advertising; this is your opportunity to work within the fastest-growing businesses across all of Amazon! Define a long-term science vision for our advertising business, driven fundamentally from our customers' needs, translating that direction into specific plans for research and applied scientists, as well as engineering and product teams. This role combines science leadership, organizational ability, technical strength, product focus, and business understanding. Key job responsibilities As an Applied Scientist on this team you will: * Build machine learning models and utilize data analysis to deliver scalable solutions to business problems. * Perform hands-on analysis and modeling with very large data sets to develop insights that increase traffic monetization and merchandise sales without compromising shopper experience. * Work closely with software engineers on detailed requirements, technical designs and implementation of end-to-end solutions in production. * Design and run A/B experiments that affect hundreds of millions of customers, evaluate the impact of your optimizations and communicate your results to various business stakeholders. * Work with scientists and economists to model the interaction between organic sales and sponsored content and to further evolve Amazon's marketplace. * Establish scalable, efficient, automated processes for large-scale data analysis, machine-learning model development, model validation and serving. * Research new predictive learning approaches for the sponsored products business. * Write production code to bring models into production.
US, WA, Seattle
Are you excited by the idea of developing algorithms to improve the shopping experience for Amazon customers? Are you looking for new challenges and to solve hard science problems while applying state-of-the-art modeling techniques? Join us and you'll help make the shopping experience better for millions of customers while also advancing the state of Amazon's science through publishing research! Key job responsibilities - Develop and apply new machine learning algorithms - Use expertise in supervised learning and causal inference to improve ML performance - Scale optimization techniques to drive business value - Design A/B tests and conduct statistical analysis on their results - Work with distributed machine learning and statistical algorithms to harness enormous volumes of data at scale to serve our customers - Present and publish science research, contributing to Amazon's science community - Mentor junior engineers and scientists. - Work closely with internal stakeholders like the business teams, engineering teams and partner teams and align them with respect to your focus area About the team Our team's mission is to surface the right payments-related recommendations to customers at the right time, helping create a rewarding and successful shopping experience for Amazon's customers. Our team's culture is highly collaborative, with an emphasis on supporting each other and learning from one another. We dedicate time each week to focus on personal development and expanding our knowledge as a team. We also highly value having a big impact, both for Amazon's business and for our customers.