Pinch-grasping robot handles items with precision

Preliminary tests show a prototype pinch-grasping robot achieved a 10-fold reduction in damage on items such as books and boxes.

For humans, finding and fetching a bottle of ketchup from a cluttered refrigerator without toppling the milk carton is a routine task. For robots, this remains a challenge of epic complexity.

At Amazon, scientists are addressing this challenge by teaching robots to understand cluttered environments in three dimensions, locate specific items, and safely retrieve them using a move called the pinch grasp — that unique thumb-and-finger hold that many people take for granted.

The research is part of an ongoing effort in the field of item-specific manipulation to develop robots that can handle millions of items across the kaleidoscope of shapes and sizes that are shipped to customers every day from Amazon fulfillment centers.

Watch the pinch grasping arm sort through items

We humans find and retrieve specific items with hands that are loaded with nerves connected to the brain for signal processing, hand-eye coordination, and motion control.

“In robotics, we don’t have the mechanical ability of a five-finger dexterous hand,” said Aaron Parness, a senior manager for applied science at Amazon Robotics AI. “But we are starting to get some of the ability to reason and think about how to grasp. We’re starting to catch up. Where pinch-grasping is really interesting is taking something mechanically simple and making it highly functional.”

Related content
Three of Amazon’s leading roboticists — Sidd Srinivasa, Tye Brady, and Philipp Michel — discuss the challenges of building robotic systems that interact with human beings in real-world settings.

This catching up is powered by breakthrough machine learning capabilities aimed at understanding the three-dimensional geometry of cluttered environments and how to navigate in them, according to Siddhartha Srinivasa, director of Amazon Robotics AI.

“Not only are we able to build robust three-dimensional models of the scene, we’re able to identify a specific item in the scene and use machine learning to know how best to pick it up and to move it quickly and without damage,” he said.

From suction to pinching

Today, vacuum-like suction is the default technology for robots tasked to pick up and move items of different shapes and sizes. These robots typically have elastic suction cups that form to the surface of the item to be lifted, creating a tight seal that provides control. The process works well across a range of items, from gift cards to cylindrical poster tubes.

Watch the Robin robotic arm deftly handling packages

Challenges occur if a vacuum seal breaks prematurely, which can happen when the angle of attachment changes during motion.

“If you are moving really fast from one location to another, objects can swing out and then just fly away,” said Can Erdogan, a senior applied scientist at Amazon Robotics AI. “All of the sudden, there are items on the ground.”

Increased suction to prevent premature detachment can also cause damage such as blistered or ripped packaging.

Related content
New statistical model reduces shipment damage by 24% while cutting shipping costs by 5%.

In other instances, the item to be moved requires contact on more than one surface. Books, for example, flop open if lifted from only the front or back cover. Another challenge is to get a tight seal on bags filled with granular items such as marbles or sand.

Pinch-grasping mimics the firm grip of a hand, enabling the robot to safely move the item from one place to the next without dropping it or causing damage.

“We are not just interested in picking up an item. We want to move the item,” Erdogan noted. “To do that, you need to be able to control it.”

Getting a grip on the scene

People who are sighted can estimate the shape of an item they intend to move, even when part of it is obscured from view. Take the ketchup bottle in the refrigerator: Even if only the top of it can be seen, experience and context allow people to imagine the full shape. We automatically create a mental model of it and a plan to grasp and move it without spilling the milk.

One of our big investments was making sure we can visualize the scene from multiple cameras and fuse all of that information as fast as possible so that we can get the full shape of the objects.
Can Erdogan

“Our robots are not quite there yet, but to be able to grasp this item from the front and back, we need to understand this whole shape,” Erdogan said. “So, one of our big investments was making sure we can visualize the scene from multiple cameras and fuse all of that information as fast as possible so that we can get the full shape of the objects.”

This 3D scene understanding is generated by multiple camera angles along with machine learning models trained to recognize and estimate the shape of individual items that help the robot compute how to grasp the item on two surfaces.

A set of motion algorithms take this understanding of the scene and item identification and combine it with the known dynamics of the robot — such as arm and hand weight — to calculate how to move the object from one place to another. The fusion of these models allows the robot to execute a pinch grasp and move something without bumping into other items.

In addition, multiple cameras provide a set of eyes on the scene — also known as continuous perception — to monitor the grasp and movement of an item so that the robot can adjust its plan of motion as necessary.

That’s an advance for robots, which typically “look at the scene, make a decision of what to do, and then do it. It’s almost like they close their eyes after they decide what to do, which is quite a shame because there are things going on in the scene while you’re doing it. Most of the damage to items happens in those moments,” Erdogan said.

Move fast, don’t break things

An advantage of suction is speed. That’s because contact is on a single surface. This allows a robot to quickly pick and move items such as chocolate bars from a shelf to a box. Grasping an item on two surfaces is more complicated, and thus takes longer, Erdogan noted. To make up for the extra time spent on a pinch grasp, the team optimized the robot arm to move faster.

“If you have a better grasp on the item, you can move faster. Moving faster also means you can take your time to achieve these good grasps,” he said. “We are lucky we have collaborators on our team who are focusing on motion, and we did this nice optimization where we made both the grasp and the motion faster.”

In preliminary tests, the team’s prototype pinch-grasping robot achieved a 10-fold reduction in damage on certain items, such as books, without a loss of speed when compared to robots that use suction.

“They not only showed they could grip a lot of objects, but they did it really fast — they got to 1,000 units per hour,” said Parness, who oversees the project.

The ability to grasp a diversity of items and move them quickly without damage makes pinch-grasping well suited for eventual deployment in an Amazon fulfillment center.

“What’s interesting about e-commerce, as opposed to manufacturing, is it’s much more dynamic,” Parness explained. “It’s a pen, and then it’s a teddy bear, and then it’s a light bulb, and then it’s a t-shirt, and then it’s a book.”

Fulfillment automation

For deployment in an Amazon fulfillment center, a key challenge is to generalize the robot’s item specific manipulation capability to all items available in the Amazon Store, noted Srinivasa.

Related content
By managing and automating many of the steps involved in continual learning, Janus is helping Amazon’s latest robots adapt to a changing environment.

“A majority of the items the robot is going to encounter in production it’s probably never seen before, so it needs to be able to generalize effectively to previously unseen items,” he explained. “Humans do this, too. When we see something novel, we try to map it to the nearest thing that we have encountered before and then try to use that experience from that task and modify it for a new situation.”

Another challenge is to gear the robot so that it can effectively manipulate the vast range of items available in the Amazon Store. For now, the robot uses an off-the-shelf hand to manipulate items that weigh less than two pounds, about half of the items available for purchase.

We can get to the questions that are relevant for the world of robotics in a very data-driven way. Once you have those questions, answering them is a joy. And when you answer them, you know how impactful they can be.
Siddhartha Srinivasa

Going forward, the team will need to design a hand — and associated tools — from scratch that can handle the full range of available items, Erdogan said.

What’s more, while pinch-grasping is superior to suction for some items, suction is better for others, especially flat items such as cards and rulers. A robot optimized for deployment in a fulfillment center may require suction and pinching, along with a machine learning algorithm that’s trained to decide which technique to use for any given situation, Parness said.

“As a person, you pick up a book differently than if you pick up a coin or a t-shirt,” he explained. “We need robots to be intelligent about the items they’re manipulating. If I’m picking up a hammer to hammer a nail in, I want to grasp it in a certain way. But if I’m picking up a hammer to go put it in a box to ship it to you, I want to grasp it in a different way. That’s the future of item intelligence.”

Amazon’s size, scale, and mission enable this level of robotics research, Srinivasa said, and it also enhances the effect it can have in the real world. For example, working within Amazon provides scientists with access to data on current item damage rates and models that show the improvements required to justify the investment in robotics. This provides a focus for his team’s scientists and engineers.

“We can get to the questions that are relevant for the world of robotics in a very data-driven way. Once you have those questions, answering them is a joy,” he said. “And when you answer them, you know how impactful they can be.”

Research areas

Related content

GB, Cambridge
Amazon Devices is an inventive research and development company that designs and engineer high-profile devices like Echo, Fire Tablets, Fire TV, and other consumer devices. We are looking for exceptional scientists to join our Applied Science team to advance the state-of-the-art in developing efficient multimodal language models across our product portfolio. Through close hardware-software integration, we design and train models for resource efficiency across the hardware and software tech stack. The Silicon and Solutions Group Edge AI team is looking for a talented Sr. Applied Scientist who will lead our efforts on inventing evaluation methods for multimodal language models and agents for new devices, including audio and vision experiences. Key job responsibilities - Collaborate with cross-functional engineers and scientists to advance the state of the art in multimodal model evaluations for devices, including audio, images, and videos - Invent and validate reliability for novel automated evaluation methods for perception tasks, such as fine-tuned LLM-as-judge - Develop and extend our evaluation framework(s) to support expanding capabilities for multimodal language models - Analyze large offline and online datasets to understand model gaps, develop methods to interpret model failures, and collaborate with training teams to enhance model capabilities for product use cases - Work closely with scientists, compiler engineers, data collection, and product teams to advance evaluation methods - Mentor less experienced Applied Scientists A day in the life As a Scientist with the Silicon and Solutions Group Edge AI team, you'll contribute to innovative methods for evaluating new product experiences and discover ways to enhance our model capabilities and enrich our customer experiences. You'll research new methods for reliably assessing perception capabilities for audio-visual tasks in multimodal language models, design and implement new metrics, and develop our evaluation framework. You'll collaborate across teams of engineers and scientists to identify and root cause issues in models and their system integration to continuously enhance the end-to-end experience. About the team Our Edge AI science team brings together our unique skills and experiences to deliver state-of-the-art multimodal AI models that enable new experiences on Amazon devices. We work at the intersection of hardware, software, and science to build models designed for our custom silicon.
US, CA, Sunnyvale
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Applied Scientist with a strong deep learning background, to help build industry-leading technology with generative AI (GenAI) and multi-modal systems. Key job responsibilities As an Applied Scientist with the AGI team, you will work with talented peers to develop algorithms and modeling techniques to advance the state of the art with multi-modal systems. Your work will directly impact our customers in the form of products and services that make use of vision and language technology. You will leverage Amazon’s large-scale computing resources to accelerate development with multi-modal Large Language Models (LLMs) and GenAI in Computer Vision. About the team The AGI team has a mission to push the envelope with multimodal LLMs and GenAI in Computer Vision, in order to provide the best-possible experience for our customers.
US, MA, Boston
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Applied Scientist with a strong deep learning background, to build industry-leading Generative Artificial Intelligence (GenAI) technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As an Applied Scientist with the AGI team, you will work with talented peers to lead the development of algorithms and modeling techniques, to advance the state of the art with LLMs. Your work will directly impact our customers in the form of products and services that make use of GenAI technology. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in LLMs. About the team The AGI team has a mission to push the envelope in GenAI with LLMs and multimodal systems, in order to provide the best-possible experience for our customers.
US, WA, Seattle
Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports – including Amazon MGM Studios-produced series and movies; licensed fan favorites; and programming from Prime Video add-on subscriptions such as Apple TV+, Max, Crunchyroll and MGM+. All customers, regardless of whether they have a Prime membership or not, can rent or buy titles via the Prime Video Store, and can enjoy even more content for free with ads. Are you interested in shaping the future of entertainment? Prime Video's technology teams are creating best-in-class digital video experience. As a Prime Video technologist, you’ll have end-to-end ownership of the product, user experience, design, and technology required to deliver state-of-the-art experiences for our customers. You’ll get to work on projects that are fast-paced, challenging, and varied. You’ll also be able to experiment with new possibilities, take risks, and collaborate with remarkable people. We’ll look for you to bring your diverse perspectives, ideas, and skill-sets to make Prime Video even better for our customers. With global opportunities for talented technologists, you can decide where a career Prime Video Tech takes you! Key job responsibilities Develop ML models for various recommendation & search systems using deep learning, online learning, and optimization methods Work closely with other scientists, engineers and product managers to expand the depth of our product insights with data, create a variety of experiments to determine the high impact projects to include in planning roadmaps Stay up-to-date with advancements and the latest modeling techniques in the field Publish your research findings in top conferences and journals A day in the life We're using advanced approaches such as foundation models to connect information about our videos and customers from a variety of information sources, acquiring and processing data sets on a scale that only a few companies in the world can match. This will enable us to recommend titles effectively, even when we don't have a large behavioral signal (to tackle the cold-start title problem). It will also allow us to find our customer's niche interests, helping them discover groups of titles that they didn't even know existed. We are looking for creative & customer obsessed machine learning scientists who can apply the latest research, state of the art algorithms and ML to build highly scalable page personalization solutions. You'll be a research leader in the space and a hands-on ML practitioner, guiding and collaborating with talented teams of engineers and scientists and senior leaders in the Prime Video organization. You will also have the opportunity to publish your research at internal and external conferences.
US, CA, Palo Alto
About Sponsored Products and Brands The Sponsored Products and Brands (SPB) team at Amazon Ads is re-imagining the advertising landscape through generative AI technologies, revolutionizing how millions of customers discover products and engage with brands across Amazon.com and beyond. We are at the forefront of re-inventing advertising experiences, bridging human creativity with artificial intelligence to transform every aspect of the advertising lifecycle from ad creation and optimization to performance analysis and customer insights. We are a passionate group of innovators dedicated to developing responsible and intelligent AI technologies that balance the needs of advertisers, enhance the shopping experience, and strengthen the marketplace. If you're energized by solving complex challenges and pushing the boundaries of what's possible with AI, join us in shaping the future of advertising. About our team SPB Ad Response Prediction team is your choice, if you want to join a highly motivated, collaborative, and fun-loving team with a strong entrepreneurial spirit and bias for action. We are seeking an experienced and motivated Applied Scientist with machine learning engineering background who loves to innovate at the intersection of customer experience, deep learning, and high-scale machine learning systems. We are looking for a talented Applied Scientist with a strong background in machine learning engineering to join our team and help us grow the business. In this role, you will partner with a team of engineers and scientists to build advanced machine learning models and infrastructure, from training to inference, including emerging LLM-based systems, that deliver highly relevant ads to shoppers across all Amazon platforms and surfaces worldwide. Key job responsibilities As an Applied Scientist, you will: * Develop scalable and effective machine learning models and optimization strategies to solve business problems. * Conduct research on new machine learning modeling to optimize all aspects of Sponsored Products business. * Enhance the scalability, automation, and efficiency of large-scale training and real-time inference systems. * Pioneer the development of LLM inference infrastructure to support next-generation GenAI workloads at Amazon Ads scale.
US, WA, Seattle
The Economics Science team in the Amazon Manager Experience (AMX) organization builds science models supporting employee career-related experiences such as their evaluation, learning and development, onboarding, and promotion. Additionally, the team conducts experiments for a wide range of employee and talent-related product features, and measures the impact of product and program initiatives in enhancing our employees' career experiences at Amazon. The team is looking for an Economist who specializes in the field of macroeconomics and time series forecasting. This role combines traditional macroeconomic analysis with modern data science techniques to enhance understanding and forecasting of workforce dynamics at scale. Key job responsibilities The economists within ALX focus on enhancing causal evaluation, measurement, and experimentation tasks to ensure various science integrations and interventions achieve their goals in building more rewarding careers for our employees. The economists develop and implement complex randomization designs that address the nuances of experimentation in complex settings where multiple populations interact. Additionally, they engage in building a range of econometric models that surface various proactive and reactive inspection signals, aiming toward better alignment in the implementation of talent processes. The economists closely collaborate with scientists from diverse backgrounds, as well as program and product leaders, to implement and assess science solutions in our products.
GB, London
Are you looking to work at the forefront of Machine Learning and AI? Would you be excited to apply Generative AI algorithms to solve real world problems with significant impact? The Generative AI Innovation Center helps AWS customers implement Generative AI solutions and realize transformational business opportunities. This is a team of strategists, scientists, engineers, and architects working step-by-step with customers to build bespoke solutions that harness the power of generative AI. The team helps customers imagine and scope the use cases that will create the greatest value for their businesses, define paths to navigate technical or business challenges, develop proof-of-concepts, and make plans for launching solutions at scale. The GenAI Innovation Center team provides guidance on best practices for applying generative AI responsibly and cost efficiently. You will work directly with customers and innovate in a fast-paced organization that contributes to game-changing projects and technologies. You will design and run experiments, research new algorithms, and find new ways of optimizing risk, profitability, and customer experience. We’re looking for Data Scientists capable of using GenAI and other techniques to design, evangelize, and implement state-of-the-art solutions for never-before-solved problems. Key job responsibilities As a Data Scientist, you will • Collaborate with AI/ML scientists and architects to research, design, develop, and evaluate generative AI solutions to address real-world challenges • Interact with customers directly to understand their business problems, aid them in implementation of generative AI solutions, brief customers and guide them on adoption patterns and paths to production • Create and deliver best practice recommendations, tutorials, blog posts, sample code, and presentations adapted to technical, business, and executive stakeholder • Provide customer and market feedback to product and engineering teams to help define product direction About the team Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
IN, KA, Bengaluru
The Amazon Alexa AI team in India is seeking a talented, self-driven Applied Scientist to work on prototyping, optimizing, and deploying ML algorithms within the realm of Generative AI. The Applied Scientist will be in a team of exceptional scientists to develop novel algorithms and modeling techniques to advance the state of the art in Natural Language Processing (NLP) or Computer Vision (CV) related tasks. They will work in a hybrid, fast-paced organization where scientists, engineers, and product managers work together to build customer facing experiences. They will collaborate with and mentor other scientists to raise the bar of scientific research in Amazon. Their work will directly impact our customers in the form of products and services that make use of speech, language, and computer vision technologies. Key job responsibilities - Research, experiment and build Proof Of Concepts advancing the state of the art in AI & ML for GenAI. - Collaborate with cross-functional teams to architect and execute technically rigorous AI projects. - Thrive in dynamic environments, adapting quickly to evolving technical requirements and deadlines. - Engage in effective technical communication (written & spoken) with coordination across teams. - Conduct thorough documentation of algorithms, methodologies, and findings for transparency and reproducibility. - Publish research papers in internal and external venues of repute - Support on-call activities for critical issues A day in the life You will be working with a group of talented scientists on researching algorithm and running experiments to test scientific proposal/solutions to improve solutions powering customer experience on Alexa+. This will involve collaboration with partner teams including engineering, PMs, data annotators, and other scientists to discuss data quality, policy, and model development. You will mentor other scientists, review and guide their work, help develop roadmaps for the team. You work closely with partner teams across Alexa to deliver platform features that require cross-team leadership.
US, CA, Sunnyvale
As a Principal Scientist within the Artificial General Intelligence (AGI) organization, you are a trusted part of the technical leadership. You bring business and industry context to science and technology decisions. You set the standard for scientific excellence and make decisions that affect the way we build and integrate algorithms. You solicit differing views across the organization and are willing to change your mind as you learn more. Your artifacts are exemplary and often used as reference across organization. You are a hands-on scientific leader. Your solutions are exemplary in terms of algorithm design, clarity, model structure, efficiency, and extensibility. You tackle intrinsically hard problems, acquiring expertise as needed. You decompose complex problems into straightforward solutions. You amplify your impact by leading scientific reviews within your organization or at your location. You scrutinize and review experimental design, modeling, verification and other research procedures. You probe assumptions, illuminate pitfalls, and foster shared understanding. You align teams toward coherent strategies. You educate, keeping the scientific community up to date on advanced techniques, state of the art approaches, the latest technologies, and trends. You help managers guide the career growth of other scientists by mentoring and play a significant role in hiring and developing scientists and leads. You will play a critical role in driving the development of Generative AI (GenAI) technologies that can handle Amazon-scale use cases and have a significant impact on our customers' experiences. Key job responsibilities You will be responsible for defining key research directions, adopting or inventing new machine learning techniques, conducting rigorous experiments, publishing results, and ensuring that research is translated into practice. You will develop long-term strategies, persuade teams to adopt those strategies, propose goals and deliver on them. You will also participate in organizational planning, hiring, mentorship and leadership development. You will be technically fearless and with a passion for building scalable science and engineering solutions. You will serve as a key scientific resource in full-cycle development (conception, design, implementation, testing to documentation, delivery, and maintenance).
US, CA, Mountain View
Amazon launched the Generative AI Innovation Center (GAIIC) in Jun 2023 to help AWS customers accelerate the use of Generative AI to solve business and operational problems and promote innovation in their organization (https://press.aboutamazon.com/2023/6/aws-announces-generative-ai-innovation-center). GAIIC provides opportunities to innovate in a fast-paced organization that contributes to game-changing projects and technologies that get deployed on devices and in the cloud. As an Applied Science Manager in GAIIC, you'll partner with technology and business teams to build new GenAI solutions that delight our customers. You will be responsible for directing a team of data/research/applied scientists, deep learning architects, and ML engineers to build generative AI models and pipelines, and deliver state-of-the-art solutions to customer’s business and mission problems. Your team will be working with terabytes of text, images, and other types of data to address real-world problems. The successful candidate will possess both technical and customer-facing skills that will allow you to be the technical “face” of AWS within our solution providers’ ecosystem/environment as well as directly to end customers. You will be able to drive discussions with senior technical and management personnel within customers and partners, as well as the technical background that enables them to interact with and give guidance to data/research/applied scientists and software developers. The ideal candidate will also have a demonstrated ability to think strategically about business, product, and technical issues. Finally, and of critical importance, the candidate will be an excellent technical team manager, someone who knows how to hire, develop, and retain high quality technical talent. About the team About the team Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Inclusive Team Culture AWS values curiosity and connection. Our employee-led and company-sponsored affinity groups promote inclusion and empower our people to take pride in what makes us unique. Our inclusion events foster stronger, more collaborative teams. Our continual innovation is fueled by the bold ideas, fresh perspectives, and passionate voices our teams bring to everything we do. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve.