Aerial photo of the San Diego waterfront on an overcast day
Aerial photo of the San Diego waterfront on an overcast day
Credit: Jerry Uomala / Getty Images / iStockphoto

Amazon at AEA: The crossroads of economics and AI

Pat Bajari, VP and chief economist for Amazon's Core AI group, on his team's new research and what it says about economists' role at Amazon.

The 2020 meeting of the American Economic Association begins on January 3 in San Diego, and among the Amazon economists attending will be Pat Bajari, VP and chief economist for Amazon’s Core AI group, who is a coauthor on two papers accepted to the conference.

Pat Bajari
Pat Bajari, Amazon vice president and chief economist
Carl Clark, Amazon Imaging Studio

Economic research at Amazon, Bajari explains, is distinctive in the way it crosses disciplinary boundaries. “These disciplines are like their own worlds,” he says. “It’s easy to get siloed doing engineering, machine learning, natural-language processing, computer vision, stats, operational research, economics, and so on. But when these disciplines interact, you get more interesting and useful results.”

Apples to apples

One of Bajari’s two papers at AEA is a case in point. Titled “New Goods, Productivity and the Measurement of Inflation: Using Machine Learning to Improve Quality Adjustments,” it applies new AI techniques to an old problem in the calculation of inflation rates.

“If you look at a product line, over the course of a year, 80% of the products might vanish,” Bajari explains. “When you calculate the rate of inflation, you’re usually doing an annual measure of price changes. But if 80% of products are gone, that measurement can be inaccurate.”

A famous example, Bajari explains, is personal computers in the late ’90s. At the time, he says, 95% of computers would sell out in the course of a year. The computers on the shelves one January could have very different technical specifications from those on the shelves a year later, making direct price comparison misleading.

Consequently, the standard method of calculating inflation indicated little change in the price of personal computers, even though the price of computational power was plummeting. The classical solution to this problem is so-called hedonic pricing, in which the price of a product is factored into several components, which can be compared independently.

So, for instance, late-’90s computers could be compared according to their price per megahertz of processing speed, per megabyte of random-access memory, per megabyte of storage, and so on. Bajari’s first AEA paper updates hedonic pricing for the age of deep learning. On the paper, he joins Victor Chernozhukov, a professor of economics at MIT and a senior principal economist in Amazon’s Core AI group; Ramon Huerta, a research scientist at the University of California, San Diego, and a principal applied scientist in the Amazon North American Consumer group; George Monokroussos, a former senior economist at Amazon; and three other members of Core AI: Zhihao Cen, a senior applied scientist Junbo Li; a senior software engineer; and Manoj Manukonda, a senior data engineer.

Instead of factorizing product prices themselves, the researchers trained a machine learning model to identify correlations between product features and prices. If the model is trained on data from one year but fed descriptions of products on the shelves a year later, it will spit out the products’ prices according to the earlier valuation. Comparing the predicted and actual prices provides a measure of inflation.

Hedonic-pricing model
To predict a product's price, a new machine learning model factors in numeric data such as number of reviews and average star rating, textual data such as product descriptions and titles, and even visual data such as product shots.
Stacy Reilly

Internally, Amazon can use this type of model to analyze business trends. But if central bankers applied a similar model to products representative of the economy as a whole, they could observe inflation rate variations in real time.

“If central bankers have a view with a one-day latency, it could give them signals about whether monetary policy is too loose or too tight,” Bajari explains.

Feedback loops

Bajari’s other AEA paper examines the design of randomized experiments. It reports work done in collaboration with Guido Imbens, a member of Core AI and the Applied Econometrics Professor and professor of economics at Stanford Business School; Thomas Richardson, a professor of statistics at the University of Washington and an Amazon Scholar; Brian Burdick, the director of Core AI; Ido Rosen, a principal software engineer in Burdick’s group; and James McQueen, a senior applied scientist in Amazon’s Customer Behavior Analytics group.

The most familiar example of a randomized experiment is a drug trial, where some subjects receive an experimental drug, some receive a placebo, and their outcomes are compared. But randomized experiments are also common in industry.

Suppose, for instance, that Amazon researchers develop a new algorithm for calculating how much of a product to restock at a fulfillment center as a function of recent sales rates and supply on hand. In simulations, the algorithm promises more reliable delivery and greater customer satisfaction, but there’s a question about whether those theoretical gains will translate into practice.

Amazon might conduct a randomized experiment in which some fulfillment centers use the new algorithm, some use the old algorithm, and the average results are compared. Such experiments, however, are liable to so-called spillover effects, where the “treatment” — in this case, the deployment of the new algorithm — ends up having consequences for the control group — in this case, the fulfillment centers using the old algorithm.

Suppose that the treatment results in faster delivery of certain products, and consequently, those products grow in popularity. Amazon’s recommendation engine begins recommending those products more frequently, even to customers served by fulfillment centers using the old restock algorithm. Demand for the products spikes, and the control group starts selling through its stocks — a negative outcome, in terms of the experimental design. When the results of the experiment are tallied, the control group’s performance is artificially depreciated because of the treatment.

“This type of spillover does not happen in standard medical-drug trials, because one individual taking the new drug does not affect the outcome for another individual taking the placebo,” Imbens says. “But it is a feature of many experiments at Amazon and similar companies, where we have complex feedback loops.”

Exerting controls

One way to identify such spillover effects would be to ensure that, for every product that receives the treatment, there’s a related product that doesn’t, regardless of where it’s stored. That would make it possible to determine whether demand spikes are affecting product classes as a whole or are limited to treated products. But it complicates the experimental design.

The researchers’ paper presents an ambitious blueprint for performing such complex experiments. It describes how to simultaneously measure average effects and identify spillovers within a single experiment — by, for instance, systematically varying the treatment’s application to pairs of fulfillment centers and products. It also presents statistical techniques for analyzing the results of such experiments.

The researchers’ blueprint could be applied in a host of different contexts — movie recommendations, rideshare services, short-term-property-rental sites, homebuying sites, retail sites, job search sites, and the like. It also generalizes from double randomization — a given product can receive different treatments at different fulfillment centers, and a given fulfillment center can treat some products and not others — to higher-dimensional randomization — varying treatment according to season, delivery destination, vendor, and so on.

“When people do these kinds of experiments, they usually randomize only one variable at a time,” Bajari explains. “We want to go further with this idea, where we use multiple randomizations to learn supply responses, demand responses, equilibria — all with the goal to keep improving the customer experience.”

Helping identify the causal relationships that underlie the data, Bajari says, is one of the ways in which the economic perspective is useful. But another is in deciding what to measure, across what time frame.

“Usually, ML and AI are tools for making decisions,” Bajari says. “If you have a particular product, how much should you stock of it? You want to make that decision in a present-value-maximizing way. You don’t want to sacrifice long-term success for short-term gains. If you only looked at short-term numbers, we would cut safety stock by half. Then customers would be more apt to find products out of stock, which means they might be less likely to shop on Amazon, which in turn could hurt growth.

“If you want to use ML and AI to make decisions in a rational way, you need a way to trade off long-term and short-term results. This is a place where economists help. What should a firm rationally optimize for? That’s just squarely in economics. That’s what we do.”

Amazon's involvement at AEA/ASSA

Paper and presentation schedule

Friday, 1/3 | 2:30 pm - 4:30 pm | Marriott Marquee San Diego | San Diego Ballroom A

"GDPR and the Home Bias of Venture Investment"

Jian Jia (Illinois Institute of Technology) · Ginger Jin (University of Maryland/Amazon Scholar) · Liad Wagman (Illinois Institute of Technology)

"New Goods, Productivity and the Measurement of Inflation: Using Machine Learning to Improve Quality Adjustments"

Pat Bajari (Amazon) · Zhihao Cen (Amazon) · Victor Chernozhukov (MIT/Amazon) · Ramon Huerta (UCSD/Amazon) · Junbo Li (Amazon) · Manoj Manukonda (Amazon) · George Monokroussos (Wayfair)

"Double Randomized Online Experiments"

Pat Bajari (Amazon) · Brian Burdick (Amazon) · Guido Imbens (Stanford Graduate School of Business/Amazon) · James McQueen (Amazon) · Thomas Richardson (University of Washington/Amazon Scholar) · Ido Rosen (Amazon)

Saturday, 1/4 | 2:30 pm - 4:30 pm | Marriott Marquis San Diego | Del Mar

"Sustained Credit Card Borrowing"

Sergei Koulayev (Amazon) · Daniel Grodzicki (Pennsylvania State University/Consumer Financial Protection Bureau)

Workshops

Econometrica Session: New Developments in Econometrics

Chair: Guido Imbens

Research areas

Related content

US, CA, Palo Alto
The Amazon Search team creates powerful, customer-focused search and advertising solutions and technologies. Whenever a customer visits an Amazon site worldwide and types in a query or browses through product categories, the Amazon Search services go to work. We design, develop, and deploy high performance, fault-tolerant distributed search systems used by millions of Amazon customers every day. Our team works to maximize the quality and effectiveness of the search experience for visitors to Amazon websites worldwide.
JP, Tokyo
The Amazon Logistics (AMZL) Team is responsible for the acquisition, design, construction, and management of all facilities in the Amazon Delivery Station Network. AMZL is looking for a talented and passionate Data Scientist to help shape its Last Mile business with technical strategies and solutions, by processing, analyzing and interpreting huge data sets. You should be comfortable with ambiguity, problem solving and enjoy working in a fast-paced, diverse and dynamic environment. Using analytical rigor and statistical methods, you mine through data to identify opportunities for Amazon and our delivery channels. And you collaborate with other scientists, engineers, Product and Program Managers to deploy new products and solutions. [More Information] Last Mile Department Data Analyst/BI Engineer Tokyo Office *Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, visit https://www.amazon.jobs/disability/jp Key job responsibilities Creating a roadmap of the most challenging business questions and use data to articulate possible root cause analysis and solutions Managing and executing entire projects or components of large projects from start to finish including project management, data gathering and manipulation, synthesis and modeling, problem solving, and communication of insights Partnering with Product, Program and Engineering teams to design and run models, research new algorithms, and prove incrementality and drive growth Understanding drivers, impacts, and key influences on seller growth dynamics Developing and scaling end-to-end ML Models and solutions Automating feedback loops for algorithms in production Utilizing Amazon systems and tools to effectively work with terabytes of data About the team Last Mile Execution Analytics (LMEA) team of JP works as an integral part of Amazon Logistics to ensure that its business intelligence, analytics, tools and planning needs are met. By providing information, insight, and decision support, we strive to enable success of all parts of AMZL. Our customer set includes senior management, station operations, external vendors, long-term planning, Ops technology (Voice of the Delivery Station, Voice of the Customer), network planning, and pretty much every BI and Ops teams. Voice of Employee [Work Life Harmony] We believe, it is important to spend private time such as spending time with your family or doing anything you like to spur innovation. Amazon promotes a fulfilling and flexible work style according to the work volume and lifestyle of each employee.
US, CA, San Francisco
About Twitch Launched in 2011, Twitch is a global community that comes together each day to create multiplayer entertainment: unique, live, unpredictable experiences created by the interactions of millions. We bring the joy of co-op to everything, from casual gaming to world-class esports to anime marathons, music, and art streams. Twitch also hosts TwitchCon, where we bring everyone together to celebrate, learn, and grow their personal interests and passions. We’re always live at Twitch. Stay up to date on all things Twitch on Linkedin, Twitter and on our Blog. About the role: Twitch builds data-driven machine learning solutions across several rich problem spaces: Natural Language Processing (NLP), Recommendations, Semantic Search, Classification/Categorization, Anomaly Detection, Forecasting, Safety, and HCI/Social Computing/Computational Social Science. As an Intern, you will work with a dedicated Mentor and Manager on a project in one of these problem areas. You will also be supported by an Advisor and participate in cohort activities such as research teach backs and leadership talks. This position can also be located in San Francisco, CA or virtual. You Will: Solve large-scale data problems. Design solutions for Twitch's problem spaces Explore ML and data research
US, CA, San Francisco
About Twitch Launched in 2011, Twitch is a global community that comes together each day to create multiplayer entertainment: unique, live, unpredictable experiences created by the interactions of millions. We bring the joy of co-op to everything, from casual gaming to world-class esports to anime marathons, music, and art streams. Twitch also hosts TwitchCon, where we bring everyone together to celebrate, learn, and grow their personal interests and passions. We’re always live at Twitch. Stay up to date on all things Twitch on Linkedin, Twitter and on our Blog. About the role: Twitch builds data-driven machine learning solutions across several rich problem spaces: Natural Language Processing (NLP), Recommendations, Semantic Search, Classification/Categorization, Anomaly Detection, Forecasting, Safety, and HCI/Social Computing/Computational Social Science. As an Intern, you will work with a dedicated Mentor and Manager on a project in one of these problem areas. You will also be supported by an Advisor and participate in cohort activities such as research teach backs and leadership talks. This position can also be located in San Francisco, CA or virtual. You Will: Solve large-scale data problems. Design solutions for Twitch's problem spaces Explore ML and data research
LU, Luxembourg
Are you a talented and inventive scientist with a strong passion about modern data technologies and interested to improve business processes, extracting value from the data? Would you like to be a part of an organization that is aiming to use self-learning technology to process data in order to support the management of the procurement function? The Global Procurement Technology, as a part of Global Procurement Operations, is seeking a skilled Data Scientist to help build its future data intelligence in business ecosystem, working with large distributed systems of data and providing Machine Learning (ML) and Predictive Modeling expertise. You will be a member of the Data Engineering and ML Team, joining a fast-growing global organization, with a great vision to transform the Procurement field, and become the role model in the market. This team plays a strategic role supporting the core Procurement business domains as well as it is the cornerstone of any transformation and innovation initiative. Our mission is to provide a high-quality data environment to facilitate process optimization and business digitalization, on a global scale. We are supporting business initiatives, including but not limited to, strategic supplier sourcing (e.g. contracting, negotiation, spend analysis, market research, etc.), order management, supplier performance, etc. We are seeking an individual who can thrive in a fast-paced work environment, be collaborative and share knowledge and experience with his colleagues. You are expected to deliver results, but at the same time have fun with your teammates and enjoy working in the company. In Amazon, you will find all the resources required to learn new skills, grow your career, and become a better professional. You will connect with world leaders in your field and you will be tackling Data Science challenges to ensure business continuity, by taking the right decisions for your customers. As a Data Scientist in the team, you will: -be the subject matter expert to support team strategies that will take Global Procurement Operations towards world-class predictive maintenance practices and processes, driving more effective procurement functions, e.g. supplier segmentation, negotiations, shipping supplies volume forecast, spend management, etc. -have strong analytical skills and excel in the design, creation, management, and enterprise use of large data sets, combining raw data from different sources -provide technical expertise to support the development of ML models to facilitate intelligent digital services, such as Contract Lifecycle Management (CLM) and Negotiations platform -cooperate closely with different groups of stakeholders, e.g. data/software engineers, product/program managers, analysts, senior leadership, etc. to evaluate business needs and objectives to set up the best data management environment -create and share with audiences of varying levels technical papers and presentations -deal with ambiguity, prioritizing needs, and delivering results in a dynamic environment Basic qualifications -Master’s Degree in Computer Science/Engineering, Informatics, Mathematics, or a related technical discipline -3+ years of industry experience in data engineering/science, business intelligence or related field -3+ years experience in algorithm design, engineering and implementation for very-large scale applications to solve real problems -Very good knowledge of data modeling and evaluation -Very good understanding of regression modeling, forecasting techniques, time series analysis, machine-learning concepts such as supervised and unsupervised learning, classification, random forest, etc. -SQL and query performance tuning skills Preferred qualifications -2+ years of proficiency in using R, Python, Scala, Java or any modern language for data processing and statistical analysis -Experience with various RDBMS, such as PostgreSQL, MS SQL Server, MySQL, etc. -Experience architecting Big Data and ML solutions with AWS products (Redshift, DynamoDB, Lambda, S3, EMR, SageMaker, Lex, Kendra, Forecast etc.) -Experience articulating business questions and using quantitative techniques to arrive at a solution using available data -Experience with agile/scrum methodologies and its benefits of managing projects efficiently and delivering results iteratively -Excellent written and verbal communication skills including data visualization, especially in regards to quantitative topics discussed with non-technical colleagues
US, CA, San Francisco
About Twitch Launched in 2011, Twitch is a global community that comes together each day to create multiplayer entertainment: unique, live, unpredictable experiences created by the interactions of millions. We bring the joy of co-op to everything, from casual gaming to world-class esports to anime marathons, music, and art streams. Twitch also hosts TwitchCon, where we bring everyone together to celebrate, learn, and grow their personal interests and passions. We’re always live at Twitch. Stay up to date on all things Twitch on Linkedin, Twitter and on our Blog. About the role: Twitch builds data-driven machine learning solutions across several rich problem spaces: Natural Language Processing (NLP), Recommendations, Semantic Search, Classification/Categorization, Anomaly Detection, Forecasting, Safety, and HCI/Social Computing/Computational Social Science. As an Intern, you will work with a dedicated Mentor and Manager on a project in one of these problem areas. You will also be supported by an Advisor and participate in cohort activities such as research teach backs and leadership talks. This position can also be located in San Francisco, CA or virtual. You Will: Solve large-scale data problems. Design solutions for Twitch's problem spaces Explore ML and data research
US, CA, San Francisco
About Twitch Launched in 2011, Twitch is a global community that comes together each day to create multiplayer entertainment: unique, live, unpredictable experiences created by the interactions of millions. We bring the joy of co-op to everything, from casual gaming to world-class esports to anime marathons, music, and art streams. Twitch also hosts TwitchCon, where we bring everyone together to celebrate, learn, and grow their personal interests and passions. We’re always live at Twitch. Stay up to date on all things Twitch on Linkedin, Twitter and on our Blog. About the role: Twitch builds data-driven machine learning solutions across several rich problem spaces: Natural Language Processing (NLP), Recommendations, Semantic Search, Classification/Categorization, Anomaly Detection, Forecasting, Safety, and HCI/Social Computing/Computational Social Science. As an Intern, you will work with a dedicated Mentor and Manager on a project in one of these problem areas. You will also be supported by an Advisor and participate in cohort activities such as research teach backs and leadership talks. This position can also be located in San Francisco, CA or virtual. You Will: Solve large-scale data problems. Design solutions for Twitch's problem spaces Explore ML and data research
US, CA, San Francisco
About Twitch Launched in 2011, Twitch is a global community that comes together each day to create multiplayer entertainment: unique, live, unpredictable experiences created by the interactions of millions. We bring the joy of co-op to everything, from casual gaming to world-class esports to anime marathons, music, and art streams. Twitch also hosts TwitchCon, where we bring everyone together to celebrate, learn, and grow their personal interests and passions. We’re always live at Twitch. Stay up to date on all things Twitch on Linkedin, Twitter and on our Blog. About the role: Twitch builds data-driven machine learning solutions across several rich problem spaces: Natural Language Processing (NLP), Recommendations, Semantic Search, Classification/Categorization, Anomaly Detection, Forecasting, Safety, and HCI/Social Computing/Computational Social Science. As an Intern, you will work with a dedicated Mentor and Manager on a project in one of these problem areas. You will also be supported by an Advisor and participate in cohort activities such as research teach backs and leadership talks. This position can also be located in San Francisco, CA or virtual. You Will: Solve large-scale data problems. Design solutions for Twitch's problem spaces Explore ML and data research
US, CA, San Francisco
About Twitch Launched in 2011, Twitch is a global community that comes together each day to create multiplayer entertainment: unique, live, unpredictable experiences created by the interactions of millions. We bring the joy of co-op to everything, from casual gaming to world-class esports to anime marathons, music, and art streams. Twitch also hosts TwitchCon, where we bring everyone together to celebrate, learn, and grow their personal interests and passions. We’re always live at Twitch. Stay up to date on all things Twitch on Linkedin, Twitter and on our Blog. About the role: Twitch builds data-driven machine learning solutions across several rich problem spaces: Natural Language Processing (NLP), Recommendations, Semantic Search, Classification/Categorization, Anomaly Detection, Forecasting, Safety, and HCI/Social Computing/Computational Social Science. As an Intern, you will work with a dedicated Mentor and Manager on a project in one of these problem areas. You will also be supported by an Advisor and participate in cohort activities such as research teach backs and leadership talks. This position can also be located in San Francisco, CA or virtual. You Will: Solve large-scale data problems. Design solutions for Twitch's problem spaces Explore ML and data research
US, WA, Seattle
We are a team of doers working passionately to apply cutting-edge advances in deep learning in the life sciences to solve real-world problems. As a Senior Applied Science Manager you will participate in developing exciting products for customers. Our team rewards curiosity while maintaining a laser-focus in bringing products to market. Competitive candidates are responsive, flexible, and able to succeed within an open, collaborative, entrepreneurial, startup-like environment. At the leading edge of both academic and applied research in this product area, you have the opportunity to work together with a diverse and talented team of scientists, engineers, and product managers and collaborate with others teams. Location is in Seattle, US Embrace Diversity Here at Amazon, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon’s culture of inclusion is reinforced within our 14 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust Balance Work and Life Our team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives Mentor & Grow Careers Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded engineer and enable them to take on more complex tasks in the future. Key job responsibilities • Manage high performing engineering and science teams • Hire and develop top-performing engineers, scientists, and other managers • Develop and execute on project plans and delivery commitments • Work with business, data science, software engineer, biological, and product leaders to help define product requirements and with managers, scientists, and engineers to execute on them • Build and maintain world-class customer experience and operational excellence for your deliverables