An overhead shot inside an Amazon fulfillment center shows hundreds of boxes on conveyor belts along with people monitoring the flow of those packages
Amazon's scale makes picking the right package for each product a challenge. Fortunately, machine learning approaches — particularly deep learning — thrive on big data and massive scale. These tools have helped Amazon reduce per-shipment packaging weight by 36% and eliminate more than a million tons of packaging.

How pioneering deep learning is reducing Amazon’s packaging waste

A combination of deep learning, natural language processing, and computer vision enables Amazon to hone in on the right amount of packaging for each product.

Finding the right amount of packaging to ship an item can be challenging — and at Amazon, an ever-changing catalog of hundreds of millions of products makes it an ongoing challenge. In addition, Amazon’s scale also means it is impossible to solve this challenge using manual inspection to choose packaging for each and every item. For the same reason, general packaging rules and run-of-the-mill logic just won’t cut it. What’s required is a cutting-edge-smart automated mechanism that can adapt on the fly to changing circumstances.

Prasanth Meiyappan, top right, an applied scientist, and Matthew Bales, a research science manager, authored "Reducing Amazon’s packaging waste using multimodal deep learning". Their position paper was one of the 10 most read research papers on Amazon Science in 2021.

Fortunately, machine learning approaches — particularly deep learning — thrive on big data and massive scale, and a pioneering combination of natural language processing and computer vision is enabling Amazon to hone in on using the right amount of packaging. These tools have helped Amazon drive change over the past six years, reducing per-shipment packaging weight by 36% and eliminating more than a million tons of packaging, equivalent to more than 2 billion shipping boxes.

“When I started at Amazon in 2017, we had a lot of physical testing of products going on, but not a scalable mechanism that could assess hundreds of millions of products to identify the optimal packaging type for each product,” says research science manager Matthew Bales. Bales, who is also a physicist, heads up machine learning within Amazon’s Customer Packaging Experience team.

“Statistical tests were the first piece, but they are essentially only useful when products have already been shipped in more than one package type. We wanted the capability to predict how a product would fare in a less-protective, lighter, and more sustainable package type. And once you're in that predictive space, you need machine learning,” Bales explains.

The power of customer feedback

To make a prediction about whether a given product could be safely shipped in a particular package type, Bales and his colleagues built a ML model based largely on the text-based data that customers find on the Amazon Store — the item name, description, price, package dimensions, and so on.

Related content
As office buildings become smarter, it is easier to configure them with sustainability management in mind.

The model was trained on millions of examples of products successfully delivered in various packaging types, and on examples of products that arrived damaged in given packaging types. Amazon has access to almost real-time feedback when a product is not sufficiently protected by its packaging, because customers report it via the Online Returns Center and other forms of feedback, including product reviews.

“Customer feedback is paramount,” says Bales. “It powers all of our statistical testing.”

The model learned that certain keywords were particularly important when making packaging decisions. For example, keywords that indicated that a padded mailer would not be the right packaging included “ceramic”, “grocery”, “mug” and “glass”. These products were better shipped in a box. Keywords that suggested mailers were the right choice included “multipack” and “bag.” Those indicated the product might already have some form of protective packaging.

“The portion of the model that's learning from the Amazon Store has learned really well what the product is, and about its dimensions,” says Bales.

Reducing Amazon’s packaging waste using multimodal deep learning

It’s an important step in the journey, but automatically learning what a product is represents only half the battle. Equally important is how the vendor packaged the product before sending it to a fulfillment center. For example, a ceramic mug may be packaged in clear plastic bag, or in a sturdy box.

To identify product packaging at scale, computer vision needed to be deployed. The ML team already knew that the product images on the Amazon Store weren’t helpful when selecting packaging. For example, a multipack of LED bulbs might be illustrated by a picture of a single, unpacked bulb, suggesting it is fragile, yet the multipack is, in fact, safely packaged by the vendor and doesn’t require additional packaging. It is best shipped in its own container.

Bales’s team addressed this challenge by using Amazon’s own image data. When products are delivered to fulfillment centers, many are sent via conveyor belt through special computer-vision tunnels equipped with cameras that capture images of the products from multiple angles. These tunnels are used for many things, including ascertaining product dimensions and spotting defects.

Prasanth Meiyappan, an Amazon applied scientist, expanded the training of the team’s ML model to include these standardized product images in addition to the text classifiers from the catalog — a multimodal approach.

Our model detects the packaging edges to determine shape, identifies a perforation, a bag around the product, or light shining through a glass bottle.
Prasanth Meiyappan

“Our model detects the packaging edges to determine shape, identifies a perforation, a bag around the product, or light shining through a glass bottle.” Meiyappan explains. But to some extent, how the model makes its judgement about what it detects in images is hard for a human to discern, because the product features identified and weighted by the model tend to be complex.

“The important thing,” Bales notes, “is that the packaging decisions generated by the model are empirically accurate.”

Incorporating both text-based and visual data improved the ML model’s performance by as much as 30%, compared with using text-based data alone. Bales and Meiyappan have produced a position paper describing their work.

“When the model is certain of the best package type for a given product, we allow it to auto-certify it for that pack type,” says Bales. “When the model is less certain, it flags a product and its packaging for testing by a human.” The technology is currently being applied to product lines across North America, Europe, and Japan — automatically reducing waste at a growing scale.

“It’s a triple win,” says Bales. “Reduced waste, increased customer satisfaction, and lower costs.”

Balancing act

To arrive at this triple win, though, the team also had to take on a thorny challenge encountered frequently in the ML domain: class imbalance. In a nutshell, the problem is this: if you want an ML model to learn effectively, you ideally provide it with as many examples of failures as successes, so it can learn to differentiate effectively between the two.

The data used to train the model had many millions of examples of product/package pairings, yet depending on the package type, as little as 1% of those examples were for packages that turned out to be unsuitable in some way for the product within.

The machine learning literature to do with packaging is pretty sparse. Not many people deal with the kind of datasets we are dealing with in the packaging domain.
Prasanth Meiyappan

“Prior to implementing ML, we’ve shipped some product in envelopes and mailers for some time,” says Bales. “So, we had loads of examples of things that were good in mailers, but didn't have a lot of examples of things that were bad in mailers. ML models have problems with this kind of overwhelming imbalance.”

“The machine learning literature to do with packaging is pretty sparse,” Meiyappan says. “Not many people deal with the kind of datasets we are dealing with in the packaging domain. How effective a technique is in dealing with dataset imbalance is both domain and dataset specific.”

Thus the team’s approach to the class imbalance problem was primarily experimental. And of the six approaches they applied — four data based, two algorithm based — the clear winner produced a marked improvement in model accuracy. That was a data-based approach called two-phase learning with random under sampling which focuses the model on the minority class in the first phase of training and then on all of the data in the second. “In our position paper we share that knowledge with the ML community,” says Bales, “so that anyone who encounters a similar problem might choose to try this approach for themselves, to see if it also works in their problem space.”

What’s next

The team said they are eager to expand the use of this tool by training the model to understand all Amazon’s customers languages while also incorporating the unique aspects of fulfilment in each country.

Read the Amazon Sustainability Report

Amazon is committed to building a sustainable business for customers and the planet. Learn more about Amazon's goals, strategies, and policies in the Amazon Sustainability Report.

While Amazon scientists continue to research other ways to utilize machine learning to eliminate waste, the company is also working to reduce packaging waste throughout the e-commerce supply chain. Amazon is, for example, increasingly incentivizing its vendors to create optimized e-commerce packaging for themselves that saves space and materials without compromising product protection.

The company’s Shipment Zero goal is to deliver 50% of shipments with net-zero carbon by 2030, which from a packaging perspective means shipping products without added Amazon packaging or in carbon-neutral packaging. This is part of the Amazon’s wider Climate Pledge — a commitment to reach net-zero carbon by 2040, a decade earlier than the 2050 emissions target of the Paris Agreement.

View from space of a connected network around planet Earth representing the Internet of Things.
Sign up for our newsletter

Related content

US, WA, Seattle
Innovators wanted! Are you an entrepreneur? A builder? A dreamer? This role is part of an Amazon Special Projects team that takes the company’s Think Big leadership principle to the extreme. We focus on creating entirely new products and services with a goal of positively impacting the lives of our customers. No industries or subject areas are out of bounds. If you’re interested in innovating at scale to address big challenges in the world, this is the team for you. Here at Amazon, we embrace our differences. We are committed to furthering our culture of inclusion. We have thirteen employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We are constantly learning through programs that are local, regional, and global. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust. Our team highly values work-life balance, mentorship and career growth. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We care about your career growth and strive to assign projects and offer training that will challenge you to become your best. Key job responsibilities • Develop automated laboratory workflows. • Perform data QC, document results, and communicate to stakeholders. • Maintain updated understanding and knowledge of methods. • Identify and escalate equipment malfunctions; troubleshoot common errors. • Participate in the updating of protocols and database to accurately reflect the current practices. • Maintain equipment and instruments in good operating condition • Adapt to unexpected schedule changes and respond to emergency situations, as needed. We are open to hiring candidates to work out of one of the following locations: Seattle, WA, USA
US, VA, Arlington
Amazon’s mission is to be the most customer centric company in the world. The Workforce Staffing (WFS) organization is on the front line of that mission by hiring the hourly fulfillment associates who make that mission a reality. To drive the necessary growth and continued scale of Amazon’s associate needs within a constrained employment environment, Amazon has created the Workforce Intelligence (WFI) team. This team will (re)invent how Amazon attracts, communicates with, and ultimately hires its hourly associates. This team owns multi-layered research and program implementation to drive deep learning, process improvements, and strategic recommendations to global leadership. Are you passionate about data? Do you enjoy questioning the status quo? Do complex and difficult challenges excite you? If yes, this may be the team for you. The Data Scientist will be responsible for creating cutting edge algorithms, predictive and prescriptive models as well as required data models to facilitate WFS at-scale warehouse associate hiring. This role acts as an internal consultant to the marketing, biz ops and candidate experience teams covering responsibilities such as at-scale hiring process improvement, analyzing large scale candidate/associate data and being strategic to providing best candidate hiring experience to WFS warehouse associate candidates. We are open to hiring candidates to work out of one of the following locations: Arlington, VA, USA
US, WA, Seattle
Innovators wanted! Are you an entrepreneur? A builder? A dreamer? This role is part of an Amazon Special Projects team that takes the company’s Think Big leadership principle to the extreme. We focus on creating entirely new products and services with a goal of positively impacting the lives of our customers. No industries or subject areas are out of bounds. If you’re interested in innovating at scale to address big challenges in the world, this is the team for you. Here at Amazon, we embrace our differences. We are committed to furthering our culture of inclusion. We have thirteen employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We are constantly learning through programs that are local, regional, and global. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust. Our team highly values work-life balance, mentorship and career growth. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We care about your career growth and strive to assign projects and offer training that will challenge you to become your best. We are open to hiring candidates to work out of one of the following locations: Seattle, WA, USA
US, WA, Seattle
Are you excited about developing generative AI and foundation models to revolutionize automation, robotics and computer vision? Are you looking for opportunities to build and deploy them on real problems at truly vast scale? At Amazon Fulfillment Technologies and Robotics we are on a mission to build high-performance autonomous systems that perceive and act to further improve our world-class customer experience - at Amazon scale. We are looking for scientists, engineers and program managers for a variety of roles. The Amazon Robotics software team is seeking a Applied Scientist to focus on large vision and manipulation machine learning models. This includes building multi-viewpoint and time-series computer vision systems. It includes using machine learning to drive hardware movement. It includes building large-scale models using data from many different tasks and scenes. This work spans from basic research such as cross domain training, to experimenting on prototype in the lab, to running wide-scale A/B tests on robots in our facilities. Key job responsibilities * Research vision - Where should we be focusing our efforts * Research delivery – Proving/dis-proving strategies in offline data or in the lab * Production studies - Insights from production data or ad-hoc experimentation. About the team This team invents and runs robots focused on grasping and packing items. These are typically 6-dof style robotic arms. Our work ranges from the long-term-research on basic science to deploying/supporting large production fleets handling billions of items per year. We are open to hiring candidates to work out of one of the following locations: Seattle, WA, USA
US, VA, Arlington
Amazon launched the Generative AI (GenAI) Innovation Center (GAIIC) in Jun 2023 to help AWS customers accelerate enterprise innovation and success with Generative AI (https://press.aboutamazon.com/2023/6/aws-announces-generative-ai-innovation-center). Customers such as Highspot, Lonely Planet, Ryanair, and Twilio are engaging with the GAI Innovation Center to explore developing generative solutions. GAIIC provides opportunities to innovate in a fast-paced organization that contributes to game-changing projects and technologies that get deployed on devices and in the cloud. As a data scientist at GAIIC, you are proficient in designing and developing advanced Generative AI based solutions to solve diverse customer problems. You will be working with terabytes of text, images, and other types of data to solve real-world problems through Gen AI. You will be working closely with account teams and ML strategists to define the use case, and with other scientists and ML engineers on the team to design experiments, and find new ways to deliver value to the customer. The successful candidate will possess both technical and customer-facing skills that will allow you to be the technical “face” of AWS within our solution providers’ ecosystem/environment as well as directly to end customers. You will be able to drive discussions with senior technical and management personnel within customers and partners. This position requires that the candidate selected be a US Citizen and currently possess and maintain an active Top Secret security clearance. About the team Work/Life Balance Our team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives. Mentorship & Career Growth Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded engineer and enable them to take on more complex tasks in the future. We are open to hiring candidates to work out of one of the following locations: Arlington, VA, USA | Denver, CO, USA
US, CA, Sunnyvale
At Amazon Fashion, we are obsessed with making Amazon Fashion the most loved fashion destinations globally. We're searching for Computer Vision pioneers who are passionate about technology, innovation, and customer experience, and who are enthusiastic about making a lasting impact on the industry. You'll be working with talented scientists, engineers, and product managers to innovate on behalf of our customers. If you're fired up about being part of a dynamic, driven team, then this is your moment to join us on this exciting journey and change the world of eCommerce forever Key job responsibilities As a Applied Scientist, you will be at the forefront to define, own and drive the science that span multiple machine learning models and enabling multiple product/engineering teams and organizations. You will partner with product management and technical leadership to identify opportunities to innovate customer facing experiences. You will identify new areas of investment and work to align product roadmaps to deliver on these opportunities. As a science leader, you will not only develop unique scientific solutions, but more importantly influence strategy and outcomes across different Amazon organizations such as Search, Personalization and more. This role is inherently cross-functional and requires a strong ability to communicate, influence and earn the trust of software engineers, technical and business leadership. We are open to hiring candidates to work out of one of the following locations: Sunnyvale, CA, USA
US, CA, Sunnyvale
Are you passionate about solving unique customer-facing problems in the Amazon scale? Are you excited about utilizing statistical analysis, machine learning, data mining and leverage tons of Amazon data to learn and infer customer shopping patterns? Do you enjoy working with a diversity of engineers, machine learning scientists, product managers and user-experience designers? If so, you have found the right match! Fashion is extremely fast-moving, visual, subjective, and it presents numerous unique problem domains such as product recommendations, product discovery and evaluation. The vision for Amazon Fashion is to make Amazon the number one online shopping destination for Fashion customers by providing large selections, inspiring and accurate recommendations and customer experience. The mission of Fit science team as part of Fashion Tech is to innovate and develop scalable ML solutions to provide personalized fit and size recommendation when Amazon Fashion customers evaluate apparels or shoes online. The team is hiring a Data Scientist who has a solid background in Statistical Analysis, Machine Learning and Data Mining and a proven record of effectively analyzing large complex heterogeneous datasets, and is motivated to grow professionally as a Data Scientist. Key job responsibilities - You will work on our Science team and partner closely with applied scientists, data engineers as well as product managers, UX designers, and business partners to answer complex problems via data analysis. Outputs from your analysis will directly help improve the performance of the ML based recommendation systems thereby enhancing the customer experience as well as inform the roadmap for science and the product. - You can effectively analyze complex and disparate datasets collected from diverse sources to derive key insights. - You have excellent communication skills to be able to work with cross-functional team members to understand key questions and earn the trust of senior leaders. - You are able to multi-task between different tasks such as gap analysis of algorithm results, integrating multiple disparate datasets, doing business intelligence, analyzing engagement metrics or presenting to stakeholders. - You thrive in an agile and fast-paced environment on highly visible projects and initiatives. We are open to hiring candidates to work out of one of the following locations: Sunnyvale, CA, USA
US, CA, Sunnyvale
Are you passionate about solving unique customer-facing problem in the Amazon scale? Are you excited by developing and productizing machine learning, deep learning algorithms and leverage tons of Amazon data to learn and infer customer shopping patterns? Do you enjoy working with a diversity of engineers, machine learning scientists, product managers and user-experience designers? If so, you have found the right match! Fashion is extremely fast-moving, visual, subjective, and it presents numerous unique problem domains such as product recommendations, product discovery and evaluation. The vision for Amazon Fashion is to make Amazon the number one online shopping destination for Fashion customers by providing large selections, inspiring and accurate recommendations and customer experience. The mission of Fit science team as part of Fashion Tech is to innovate and develop scalable ML solutions to provide personalized fit and size recommendation when Amazon Fashion customers evaluate apparels or shoes online. The team is hiring Applied Scientist who has a solid background in applied Machine Learning and a proven record of solving customer-facing problems via scalable ML solutions, and is motivated to grow professionally as an ML scientist. Key job responsibilities - Tackle ambiguous problems in Machine Learning and drive full life-cycle Machine Learning projects. - Build machine learning models, perform proof-of-concept, experiment, optimize, and deploy your models into production. - Run A/B experiments, gather data, and perform statistical tests. - Establish scalable, efficient, automated processes for large-scale data mining, machine-learning model development, model validation and serving. - Work closely with software engineers and product managers to assist in productizing your ML models. We are open to hiring candidates to work out of one of the following locations: Sunnyvale, CA, USA
US, WA, Bellevue
Have you ever wondered how Amazon predicts when your order will arrive and how we ensure that it actually arrives on at the promised date/time? Have you wondered where all those Amazon semi-trucks on the road are headed? Are you passionate about increasing efficiency and reducing carbon footprint? Does the idea of having worldwide impact on Amazon's logistics network including our planes, trucks, and vans sound exciting to you? If so, then we want to talk with you! The Network Planning and Fulfillment Execution team owns and operates OR/ML and simulation systems that continually optimize the distribution of tens of millions of products across Amazon’s warehouses in the most cost-effective manner, utilizing large scale optimization techniques and distributed computing in trying to reduce overall transportation costs while improving the customer experience. We are focused on saving hundreds of millions of dollars using big data technologies, cutting edge science, machine learning, and scalable distributed software on the cloud that automates and optimizes inventory and shipments to customers under the uncertainty of demand, pricing and supply. We’re looking for a passionate, results-oriented, and inventive Research Scientist who can create and improve OR/ML models for our outbound transportation planning systems. In addition, you will be working on design, development and evaluation of highly innovative OR and ML models for solving complex business problems in the area of outbound transportation planning systems. More specifically, you will be developing a Mathematical Optimization model towards short term Origin-Destination flows that are inventory aware and adhere to facility capacities given destination demand. This will also require you to build machine learning models to predict inventory N weeks out (N<13 Weeks) and ML models to calibrate inventory bounds and math model errors. You will work closely with our product managers and software engineers to disambiguate complex supply chain problems and create ML solutions to solve those problems at scale. You will directly impact our direct customers, and even play with big data and incredible scale in the background. Watch http://bit.ly/amazon-scot to get the big picture. Key job responsibilities As part of your daily work you will: * Design, development and evaluation of highly innovative OR/ML models for solving complex business problems. * Analyze and extract relevant information from large amounts of data to help automate and optimize key processes. * Research and apply the latest ML techniques and best practices from both academia and industry. * Think about customers and how to improve the customer delivery experience. * Use and analytical techniques to create scalable solutions for business problems. * Work closely with data & software engineering teams to build model implementations and integrate successful models and algorithms in production systems at very large scale. * Technically lead and mentor other scientists in team. * Establish scalable, efficient, automated processes for large scale data analyses, model development, model validation and model implementation. A day in the life This is a great role for someone who likes to learn new things. You will have the opportunity to learn all about how Amazon plans for and executes within it's logistics network including Fulfillment Centers, Sort Centers, Delivery Stations, and more. In this role, you will be a design and develop Optimization and Machine Learning models with significant scope, impact, and high visibility. Your solutions will impact business segments worth many-billions-of-dollars and geographies spanning multiple countries and markets. From day one, you will be working with bar raising scientists, engineers, and designers. You will also collaborate with the broader science community in Amazon to broaden the horizon of your work. Successful candidates must thrive in fast-paced environments, which encourage collaborative and creative problem solving, be able to measure and estimate risks, constructively critique peer research, and align research focuses with the Amazon's strategic needs. We look for individuals who know how to deliver results and show a desire to develop themselves, their colleagues, and their career. About the team Network Planning and Fulfillment Execution Science team contains a group of scientists with different technical backgrounds including Machine Learning and Operations Research, who will collaborate closely with you on your projects. Our team directly supports multiple functional areas across Fulfillment Optimization and the research needs of the corresponding product and engineering teams. We tackle some of the most mathematically complex challenges in facility and transportation planning to improve Amazon's operational efficiency worldwide and at a scale that is unique to Amazon. We often seek the opportunity of applying hybrid techniques in the space of Operations Research and Machine Learning to tackle some of our biggest technical challenges. We disambiguate complex supply chain problems and create ML and optimization solutions to solve those problems at scale. We are open to hiring candidates to work out of one of the following locations: Bellevue, WA, USA
US, WA, Seattle
Amazon is continuing to invest in its Advertising business to tap into the growing online advertising market. The Publisher Technologies team builds and operates extensible services that empower 1P Publishers to improve the monetization of their customer experiences, along with the experiences themselves. We bias toward standards-based and flexible designs that allow Publishers the ability to invent on top of our solutions and to interoperate well with other advertising technology providers; both internal and external. The Publisher Technology Data, Insights, and Analytics team enables faster data-driven decision making for Publishers and Monetization teams by providing them with near real time data, data management tools, actionable insights, and an easy-to-use reporting experience. Our data products provide Publishers and Monetization teams with the capabilities necessary to better understand the performance of their Advertising products along with supporting machine learning at scale. In this role, you will join a team whose data products and services empower hundreds of teams across Amazon with near real time data to support big data analytics, insights, and machine learning at scale. You will collaborate with cross-functional teams to design, develop, and implement advanced data tools, predictive models, and machine learning algorithms to support Advertising strategies and optimize revenue streams. You will analyze large-scale data to identify patterns and trends, and design and run A/B experiments to improve Publisher and advertiser experiences. Key job responsibilities - Design and lead large projects and experiments from beginning to end, and drive solutions to complex or ambiguous problems - Create tools and solve challenges using statistical modeling, machine learning, optimization, and/or other approaches for quantifiable impact on the business - Use broad expertise to recommend the right strategies, methodologies, and best practices, teaching and mentoring others - Key influencer of your team’s business strategy and of related teams’ strategies - Communication and documentation of methodologies, insights, and recommendations for senior leaders with various levels of technical knowledge We are open to hiring candidates to work out of one of the following locations: Seattle, WA, USA