The photo shows an Amazon truck parked with the company logo and the word prime painted on the side
To help deliver more value to Prime members, scientists within Amazon’s Prime organization develop methods to help consumers discover and utilize Prime benefits.

The science behind Amazon Prime

Amazon’s scientists have developed a variety of scientific models to help customers get the most out of their membership.

In his 2020 shareholder letter, Jeff Bezos, executive chair of Amazon’s board of directors, shared that more than 200 million people around the world have a Prime membership — along with its attendant benefits.

Those include delivery benefits (like free one and two-day delivery), digital benefits (such as Prime Video and Amazon Music), and shopping benefits (including Prime Day member deals). Prime members are also able to download thousands of e-books, magazines and comics for free, get unlimited photo storage, order groceries online, and more.

Related content
The SCOT science team used lessons from the past — and improved existing tools — to contend with “a peak that lasted two years”.

Amazon is continually expanding and evolving its selection of Prime benefits to enhance the value for members. As Bezos wrote in an earlier shareholder letter: "We want Prime to be such a good value, you'd be irresponsible not to be a member.”

To help deliver more value to Prime members, scientists within Amazon’s Prime organization develop methods to help consumers discover and utilize Prime benefits. Using techniques derived from machine learning, structural econometrics, and other disciplines, they also help Amazon decide how to evolve Prime benefit offerings around the world.

Surface the most relevant Prime benefits to customers

When shoppers visit the Amazon Store, they are presented with a variety of Prime callouts with relevant benefits and related product information. Callouts for non-Prime members might outline the wide variety of benefits available, while Prime members might see more options to utilize their Prime benefits. For example, a Prime member visiting the detail page for the movie Jane Eyre might see a callout saying that the title is available for free on Prime Reading.

We utilize recommender systems to engage shoppers with information about Prime benefits that they would find most interesting.
Houssam Nassif

“We utilize recommender systems to engage shoppers with information about Prime benefits that they would find most interesting,” says Houssam Nassif, a principal applied scientist within Amazon’s consumer organization.

To make predictions about the callout that will most excite customers, the system maps item attributes (like brand, color, price, title, and category) to how often items are selected by customers. The models embedded in the system use Bayesian recommenders to make decisions on the most relevant content to surface. Bayesian inferences are used to make predictions about future events by updating prior hypotheses as more information becomes available.

Related content
Dual embeddings of each node, as both source and target, and a novel loss function enable 30% to 160% improvements over predecessors.

However, there are limits to this approach. For example, relying exclusively on Bayesian methods to measure customer selections can bias results toward more popular items. For example, shoppers interested in Jane Eyre might also want to read new romance novels. The challenge: newer items have untrained model weights, which can cause the system to underestimate their true click probability.

“This experience would be similar to going to a music recommendation engine, and seeing only the chart toppers in your favorite categories,” Nassif explains. “To improve the diversity of recommendations, we have to overcome the classic exploitation-exploration dilemma by including relevant and popular items [exploitation] along with newer or long-tail items that scored higher than their expected value [exploration].”

To do this, the Prime ML team utilizes methods that allow the algorithm to update the “click-probability” score by using delayed feedback from customers.

Some of the recommender systems employed by the Prime team are captured in the paper "Bayesian meta-prior learning using Empirical Bayes".

“Adaptive systems allow us to focus the diversity of recommendations even further,” says Nassif.

Prime’s adaptive systems respond continually to evolving preferences across all Amazon customers. For example, classic-literature enthusiasts who read Jane Eyre will not see callouts for romance novels or romantic comedy movies unless they express some interest in other romance novels. Some of those recommender systems are captured in the paper "Bayesian meta-prior learning using Empirical Bayes".

Recommending content that customers love

Determining the most relevant Prime benefits to present to users is the first step. Prime’s scientists have also developed algorithms to determine which formats are most likely to appeal to customers.

“Every callout has multiple dimensions, which in turn presents a large number of decisions,” says Nassif. “Do customers like to see their name? Should the callout feature a single particular product? Or even a grouping of products? To make these decisions, we have to develop an accurate understanding of customer preferences.”

Related content
Learn how the Amazon Music Conversations team is using pioneering machine learning to make Alexa's discernment better than ever.

Callouts comprise multiple components: headline, body copy, an image (or images). They can also include other elements like customer reviews. Testing multiple variables is a combinatorial problem that can often cover a large decision space. This poses limitations on the speed of experiments designed to arrive at the layout customers prefer most.

To eliminate combinatorial explosions that can result from considering every possible combination, the models score a small subset of combinations before extrapolating their learnings to the larger universe of layouts that can be presented to customers. Conditioned by prior observations, the models are able to select the layout that has the highest probability of delivering the highest customer value.

Evolving the selection of Prime benefits

In addition to informing how customers receive recommendations about Prime as it exists today, scientists also influence how Prime will evolve as a membership. This work involves scientists from multiple disciplines collaborating closely to determine the best selection of benefits: from determining how best to reduce shipping speeds for Prime (including items eligible for the fastest speeds) to recommending which new podcasts Amazon Music should release.

Charlie Manzanares is a senior manager on the team that specializes in simulating how customers benefit from expansion of Prime benefits. Manzanares’s team comprises economists, applied scientists, research scientists, and business intelligence engineers who partner closely with product managers and software and data engineers.

Our team works at the scientific intersection of structural econometrics, machine learning, and causal inference. Building these tools often involves inventing new science.
Charlie Manzanares

“Our team works at the scientific intersection of structural econometrics, machine learning, and causal inference,” says Manzanares. “Building these tools often involves inventing new science, by involving scientists and engineers from a variety of backgrounds. We then utilize these tools to create scientific software at engineering scale. What’s exhilarating about this space is not just solving these scientific and technical challenges, but using these tools to make Prime better for members around the world. Moreover, the company relies on this information to make high-stakes investments. This adds an interesting layer of strategic management consulting to our work.”

Manzanares points to a recent innovation from Prime scientists that made modeling dynamic customer decisions easier.

“Prime members make ‘dynamic’ choices over whether, and when, to become and remain Prime members. Dynamic customer choices often involve tradeoffs between value and flexibility,” he explains.  “For example, in the US, most customers choose between joining Prime’s annual or monthly plans, or ending their membership or not joining Prime at all. Over time, this tradeoff results in many possible permutations of choices. For example, a member might choose monthly Prime for two months, then join annual Prime. Or they might choose monthly Prime for two months, remain non-Prime for three, then join monthly Prime for five more months, etc.”

Modeling the impact of these choice permutations in a way that is useful for counterfactual simulation is theoretically and computationally challenging.

The theoretical challenge is an “identification” problem, Manzanares explains. The identification problem makes it hard for scientists to determine which Prime feature caused members to make a particular choice.

“For example, did a member who engaged with Prime shipping and Prime Video choose to renew because they valued Prime shipping highly, but Prime Video less, or Prime Video highly, and Prime shipping less?” asks Manzanares. “This problem is common to both dynamic and ‘static’ choice problems (i.e., choice problems where choice values are not influenced by past choices). The computational problem — which is pervasive in dynamic choice settings — is generated by the sheer number of possible choices, which is labeled the ‘curse of dimensionality’ in dynamic programming literature.”

To overcome these challenges, the team combined new techniques from inverse reinforcement learning with an old assumption from structural econometrics. Inverse reinforcement learning is a machine learning paradigm popularized in the late 1990s and early 2000s.

As opposed to reinforcement learning, which learns behavioral “policies” through active experimentation, inverse reinforcement learning learns “reward” or “utility” functions from actual customer behavior. It then uses estimated utility functions to make choices in new settings. Structural econometrics is an older paradigm with a rich literature and has been used for these types of exercises since the 1940s.

"Deep PQR: Solving Inverse Reinforcement Learning using Anchor Actions” was published at the 2020 International Conference for Machine Learning.

“On the one hand, inverse reinforcement learning draws upon modern machine learning techniques. These techniques allow for rich approximations in complex settings,” says Manzanares. “On the other hand, structural econometrics has already solved many complex theoretical issues related to counterfactual simulation. These solutions often predate the development of modern machine learning and computation. This dichotomy creates opportunities for intellectual arbitrage between literatures.”

The team’s approach to the challenge is described in the paper “Deep PQR: Solving Inverse Reinforcement Learning using Anchor Actions,” which was published at the 2020 International Conference for Machine Learning.

“The findings presented in the paper are applicable across multiple fields,” says Manzanares. “That’s not surprising since the paper’s insights were made possible by collaboration across multiple disciplines.”

Prime scientists use inverse-reinforcement models to develop insights. These insights show how Prime should evolve to meet customer needs. For example, how should Prime evolve to best meet the needs of Gen Z, who engage more heavily with digital benefits (video, music, gaming)? How can grocery delivery and pickup maximize customer convenience?

These questions multiply as Prime expands globally. In international marketplaces — especially emerging ones — customer needs vary widely. For example, how might Prime serve both rural and urban customers in a marketplace like India, where needs among rural and urban customers might be very different? Experimentation, Manzanares notes, becomes critical.

 “The process of discovering what customers want across the world is a lot of fun,” he says. “Combine that with building cutting-edge science in partnership with extremely talented science, engineering, and business professionals, and this makes Prime a really rewarding place to be a scientist.”

View from space of a connected network around planet Earth representing the Internet of Things.
Sign up for our newsletter

Research areas

Related content

US, CA, San Francisco
About Twitch Launched in 2011, Twitch is a global community that comes together each day to create multiplayer entertainment: unique, live, unpredictable experiences created by the interactions of millions. We bring the joy of co-op to everything, from casual gaming to world-class esports to anime marathons, music, and art streams. Twitch also hosts TwitchCon, where we bring everyone together to celebrate, learn, and grow their personal interests and passions. We’re always live at Twitch. Stay up to date on all things Twitch on Linkedin, Twitter and on our Blog. About the role: Twitch builds data-driven machine learning solutions across several rich problem spaces: Natural Language Processing (NLP), Recommendations, Semantic Search, Classification/Categorization, Anomaly Detection, Forecasting, Safety, and HCI/Social Computing/Computational Social Science. As an Intern, you will work with a dedicated Mentor and Manager on a project in one of these problem areas. You will also be supported by an Advisor and participate in cohort activities such as research teach backs and leadership talks. This position can also be located in San Francisco, CA or virtual. You Will: Solve large-scale data problems. Design solutions for Twitch's problem spaces Explore ML and data research
LU, Luxembourg
Are you a talented and inventive scientist with a strong passion about modern data technologies and interested to improve business processes, extracting value from the data? Would you like to be a part of an organization that is aiming to use self-learning technology to process data in order to support the management of the procurement function? The Global Procurement Technology, as a part of Global Procurement Operations, is seeking a skilled Data Scientist to help build its future data intelligence in business ecosystem, working with large distributed systems of data and providing Machine Learning (ML) and Predictive Modeling expertise. You will be a member of the Data Engineering and ML Team, joining a fast-growing global organization, with a great vision to transform the Procurement field, and become the role model in the market. This team plays a strategic role supporting the core Procurement business domains as well as it is the cornerstone of any transformation and innovation initiative. Our mission is to provide a high-quality data environment to facilitate process optimization and business digitalization, on a global scale. We are supporting business initiatives, including but not limited to, strategic supplier sourcing (e.g. contracting, negotiation, spend analysis, market research, etc.), order management, supplier performance, etc. We are seeking an individual who can thrive in a fast-paced work environment, be collaborative and share knowledge and experience with his colleagues. You are expected to deliver results, but at the same time have fun with your teammates and enjoy working in the company. In Amazon, you will find all the resources required to learn new skills, grow your career, and become a better professional. You will connect with world leaders in your field and you will be tackling Data Science challenges to ensure business continuity, by taking the right decisions for your customers. As a Data Scientist in the team, you will: -be the subject matter expert to support team strategies that will take Global Procurement Operations towards world-class predictive maintenance practices and processes, driving more effective procurement functions, e.g. supplier segmentation, negotiations, shipping supplies volume forecast, spend management, etc. -have strong analytical skills and excel in the design, creation, management, and enterprise use of large data sets, combining raw data from different sources -provide technical expertise to support the development of ML models to facilitate intelligent digital services, such as Contract Lifecycle Management (CLM) and Negotiations platform -cooperate closely with different groups of stakeholders, e.g. data/software engineers, product/program managers, analysts, senior leadership, etc. to evaluate business needs and objectives to set up the best data management environment -create and share with audiences of varying levels technical papers and presentations -deal with ambiguity, prioritizing needs, and delivering results in a dynamic environment Basic qualifications -Master’s Degree in Computer Science/Engineering, Informatics, Mathematics, or a related technical discipline -3+ years of industry experience in data engineering/science, business intelligence or related field -3+ years experience in algorithm design, engineering and implementation for very-large scale applications to solve real problems -Very good knowledge of data modeling and evaluation -Very good understanding of regression modeling, forecasting techniques, time series analysis, machine-learning concepts such as supervised and unsupervised learning, classification, random forest, etc. -SQL and query performance tuning skills Preferred qualifications -2+ years of proficiency in using R, Python, Scala, Java or any modern language for data processing and statistical analysis -Experience with various RDBMS, such as PostgreSQL, MS SQL Server, MySQL, etc. -Experience architecting Big Data and ML solutions with AWS products (Redshift, DynamoDB, Lambda, S3, EMR, SageMaker, Lex, Kendra, Forecast etc.) -Experience articulating business questions and using quantitative techniques to arrive at a solution using available data -Experience with agile/scrum methodologies and its benefits of managing projects efficiently and delivering results iteratively -Excellent written and verbal communication skills including data visualization, especially in regards to quantitative topics discussed with non-technical colleagues
US, CA, San Francisco
About Twitch Launched in 2011, Twitch is a global community that comes together each day to create multiplayer entertainment: unique, live, unpredictable experiences created by the interactions of millions. We bring the joy of co-op to everything, from casual gaming to world-class esports to anime marathons, music, and art streams. Twitch also hosts TwitchCon, where we bring everyone together to celebrate, learn, and grow their personal interests and passions. We’re always live at Twitch. Stay up to date on all things Twitch on Linkedin, Twitter and on our Blog. About the role: Twitch builds data-driven machine learning solutions across several rich problem spaces: Natural Language Processing (NLP), Recommendations, Semantic Search, Classification/Categorization, Anomaly Detection, Forecasting, Safety, and HCI/Social Computing/Computational Social Science. As an Intern, you will work with a dedicated Mentor and Manager on a project in one of these problem areas. You will also be supported by an Advisor and participate in cohort activities such as research teach backs and leadership talks. This position can also be located in San Francisco, CA or virtual. You Will: Solve large-scale data problems. Design solutions for Twitch's problem spaces Explore ML and data research
US, CA, San Francisco
About Twitch Launched in 2011, Twitch is a global community that comes together each day to create multiplayer entertainment: unique, live, unpredictable experiences created by the interactions of millions. We bring the joy of co-op to everything, from casual gaming to world-class esports to anime marathons, music, and art streams. Twitch also hosts TwitchCon, where we bring everyone together to celebrate, learn, and grow their personal interests and passions. We’re always live at Twitch. Stay up to date on all things Twitch on Linkedin, Twitter and on our Blog. About the role: Twitch builds data-driven machine learning solutions across several rich problem spaces: Natural Language Processing (NLP), Recommendations, Semantic Search, Classification/Categorization, Anomaly Detection, Forecasting, Safety, and HCI/Social Computing/Computational Social Science. As an Intern, you will work with a dedicated Mentor and Manager on a project in one of these problem areas. You will also be supported by an Advisor and participate in cohort activities such as research teach backs and leadership talks. This position can also be located in San Francisco, CA or virtual. You Will: Solve large-scale data problems. Design solutions for Twitch's problem spaces Explore ML and data research
US, CA, San Francisco
About Twitch Launched in 2011, Twitch is a global community that comes together each day to create multiplayer entertainment: unique, live, unpredictable experiences created by the interactions of millions. We bring the joy of co-op to everything, from casual gaming to world-class esports to anime marathons, music, and art streams. Twitch also hosts TwitchCon, where we bring everyone together to celebrate, learn, and grow their personal interests and passions. We’re always live at Twitch. Stay up to date on all things Twitch on Linkedin, Twitter and on our Blog. About the role: Twitch builds data-driven machine learning solutions across several rich problem spaces: Natural Language Processing (NLP), Recommendations, Semantic Search, Classification/Categorization, Anomaly Detection, Forecasting, Safety, and HCI/Social Computing/Computational Social Science. As an Intern, you will work with a dedicated Mentor and Manager on a project in one of these problem areas. You will also be supported by an Advisor and participate in cohort activities such as research teach backs and leadership talks. This position can also be located in San Francisco, CA or virtual. You Will: Solve large-scale data problems. Design solutions for Twitch's problem spaces Explore ML and data research
US, CA, San Francisco
About Twitch Launched in 2011, Twitch is a global community that comes together each day to create multiplayer entertainment: unique, live, unpredictable experiences created by the interactions of millions. We bring the joy of co-op to everything, from casual gaming to world-class esports to anime marathons, music, and art streams. Twitch also hosts TwitchCon, where we bring everyone together to celebrate, learn, and grow their personal interests and passions. We’re always live at Twitch. Stay up to date on all things Twitch on Linkedin, Twitter and on our Blog. About the role: Twitch builds data-driven machine learning solutions across several rich problem spaces: Natural Language Processing (NLP), Recommendations, Semantic Search, Classification/Categorization, Anomaly Detection, Forecasting, Safety, and HCI/Social Computing/Computational Social Science. As an Intern, you will work with a dedicated Mentor and Manager on a project in one of these problem areas. You will also be supported by an Advisor and participate in cohort activities such as research teach backs and leadership talks. This position can also be located in San Francisco, CA or virtual. You Will: Solve large-scale data problems. Design solutions for Twitch's problem spaces Explore ML and data research
US, WA, Seattle
Amazon is seeking an experienced, self-directed data scientist to support the research and analytical needs of Amazon Web Services' Sales teams. This is a unique opportunity to invent new ways of leveraging our large, complex data streams to automate sales efforts and to accelerate our customers' journey to the cloud. This is a high-visibility role with significant impact potential. You, as the right candidate, are adept at executing every stage of the machine learning development life cycle in a business setting; from initial requirements gathering to through final model deployment, including adoption measurement and improvement. You will be working with large volumes of structured and unstructured data spread across multiple databases and can design and implement data pipelines to clean and merge these data for research and modeling. Beyond mathematical understanding, you have a deep intuition for machine learning algorithms that allows you to translate business problems into the right machine learning, data science, and/or statistical solutions. You’re able to pick up and grasp new research and identify applications or extensions within the team. You’re talented at communicating your results clearly to business owners in concise, non-technical language. Key job responsibilities • Work with a team of analytics & insights leads, data scientists and engineers to define business problems. • Research, develop, and deliver machine learning & statistical solutions in close partnership with end users, other science and engineering teams, and business stakeholders. • Use AWS services like SageMaker to deploy scalable ML models in the cloud. • Examples of projects include modeling usage of AWS services to optimize sales planning, recommending sales plays based on historical patterns, and building a sales-facing alert system using anomaly detection.
US, WA, Seattle
We are a team of doers working passionately to apply cutting-edge advances in deep learning in the life sciences to solve real-world problems. As a Senior Applied Science Manager you will participate in developing exciting products for customers. Our team rewards curiosity while maintaining a laser-focus in bringing products to market. Competitive candidates are responsive, flexible, and able to succeed within an open, collaborative, entrepreneurial, startup-like environment. At the leading edge of both academic and applied research in this product area, you have the opportunity to work together with a diverse and talented team of scientists, engineers, and product managers and collaborate with others teams. Location is in Seattle, US Embrace Diversity Here at Amazon, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon’s culture of inclusion is reinforced within our 14 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust Balance Work and Life Our team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives Mentor & Grow Careers Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded engineer and enable them to take on more complex tasks in the future. Key job responsibilities • Manage high performing engineering and science teams • Hire and develop top-performing engineers, scientists, and other managers • Develop and execute on project plans and delivery commitments • Work with business, data science, software engineer, biological, and product leaders to help define product requirements and with managers, scientists, and engineers to execute on them • Build and maintain world-class customer experience and operational excellence for your deliverables
US, Virtual
The Amazon Economics Team is hiring Interns in Economics. We are looking for detail-oriented, organized, and responsible individuals who are eager to learn how to work with large and complicated data sets. Some knowledge of econometrics, as well as basic familiarity with Stata, R, or Python is necessary. Experience with SQL, UNIX, Sawtooth, and Spark would be a plus. These are full-time positions at 40 hours per week, with compensation being awarded on an hourly basis. You will learn how to build data sets and perform applied econometric analysis at Internet speed collaborating with economists, data scientists and MBAʼs. These skills will translate well into writing applied chapters in your dissertation and provide you with work experience that may help you with placement. Roughly 85% of interns from previous cohorts have converted to full time economics employment at Amazon. If you are interested, please send your CV to our mailing list at econ-internship@amazon.com.
US, WA, Seattle
Amazon internships are full-time (40 hours/week) for 12 consecutive weeks with start dates in May - July 2023. Our internship program provides hands-on learning and building experiences for students who are interested in a career in hardware engineering. This role will be based in Seattle, and candidates must be willing to work in-person. Corporate Projects (CPT) is a team that sits within the broader Corporate Development organization at Amazon. We seek to bring net-new, strategic projects to life by working together with customers and evolving projects from ZERO-to-ONE. To do so, we deploy our resources towards proofs-of-concept (POCs) and pilot programs and develop them from high-level ideas (the ZERO) to tangible short-term results that provide validating signal and a path to scale (the ONE). We work with our customers to develop and create net-new opportunities by relentlessly scouring all of Amazon and finding new and innovative ways to strengthen and/or accelerate the Amazon Flywheel. CPT seeks an Applied Science intern to work with a diverse, cross-functional team to build new, innovative customer experiences. Within CPT, you will apply both traditional and novel scientific approaches to solve and scale problems and solutions. We are a team where science meets application. A successful candidate will be a self-starter comfortable with ambiguity, strong attention to detail, and the ability to work in a fast-paced, ever-changing environment. As an Applied Science Intern, you will own the design and development of end-to-end systems. You’ll have the opportunity to create technical roadmaps, and drive production level projects that will support Amazon Science. You will work closely with Amazon scientists, and other science interns to develop solutions and deploy them into production. The ideal scientist must have the ability to work with diverse groups of people and cross-functional teams to solve complex business problems.