iNaturalist opens up a wealth of nature data — and computer vision challenges

Amazon Machine Learning Research Award recipient utilizes a combination of people and machine learning models to illuminate the planet's incredible biodiversity.

On a hike in the woods, you spot a colorful little bird. You're pretty sure it's a finch — but what kind? The iNaturalist app was made for this kind of scenario: people all over the world use it to record and identify what they're seeing outside. Increasingly, artificial intelligence enabled by Amazon Web Services (AWS) is playing a role in classifying those observations.

iNaturalist launched about 10 years ago, evolving from a master's project from three students at the University of California, Berkeley. Since then, the app has attracted a community of 1.5 million scientists and nature lovers who post photos of everything from bumblebees to bears.

iNaturalist, which today is a joint initiative of the California Academy of Sciences and the National Geographic Society, once relied solely on its members to identify species.  Now computers are helping out.

"iNaturalist's goal is really just to connect people with nature," said Grant Van Horn, a research engineer at the Cornell Lab of Ornithology. Being able to name that flower or insect you see "really ups the engagement level and makes for a completely different experience,” he adds.

A unique computer vision challenge

Grant Van Horn, research engineer at the Cornell Lab of Ornithology
Grant Van Horn
Oisin Mac Aodha, assistant professor of machine learning at the University of Edinburgh
Oisin Mac Aodha

Van Horn and Oisin Mac Aodha, now an assistant professor of machine learning at the University of Edinburgh, began working with iNaturalist five years ago to solve challenges related to the app's data. Both were at the California Institute of Technology; Van Horn was working on his PhD, and Mac Aodha was a postdoctoral researcher. They were interested in how computer vision could help accelerate and validate the identifications that humans were making on the app.

The appeal of iNaturalist to the researchers is that it represented a unique challenge to the computer vision community, Van Horn says.

If you were to build a computer model to identify finches, for example, you might scrape some images from the internet and use those to train it.

But that dataset, likely full of high-quality photos with serenely perched birds, would look quite different from the vast diversity of mostly amateur photos on iNaturalist. There, a hiker may have just barely managed to capture a photo as a bird is flying away, or the bird might be hard to identify against the background.

That all assumes the bird is even standing still. Swallows and swifts, Van Horn noted, are rarely perching — a good birder will recognize them in flight, but how do you train a computer to do the same thing?

This is just one in a seemingly endless list of computer vision challenges related to nature.

Many species look strikingly similar. They have more than one name: The scientific one (Danaus plexippus, for example) and the common one (monarch butterfly). They can have more than one form: females of one species might look different from their male counterparts; eggs turn into larva, which turn into mature insects.

inat_fg.png
An image provided by the researchers illustrates the difficulty involved in identifying species from images taken in the wild.
Courtesy of Grant Van Horn and Oisin Mac Aodha

These challenges exist across millions of plant and animal species in the world. Taken from that perspective, the more than 300,000 species catalogued on the AWS-hosted iNaturalist are a fraction of what might be possible as users continue to add data.

"You could imagine a future system that can reason about all these things at, effectively, an unprecedented level of ability," Mac Aodha said, "because there's no person that's going to be able to tell you which of X million different things this one picture could be."

New machine learning competitions

In 2017, Van Horn and Mac Aodha began hosting competitions with iNaturalist data at the annual Conference on Computer Vision and Pattern Recognition (CVPR). Part of the conference's Workshop on Fine-Grained Visual Categorization, the competitions present a dataset and then rank entries on their accuracy in classifying it. The winning team is the one that generates the lowest error rate.

In the beginning, just the basic taxonomy of iNaturalist's data posed a learning curve for Van Horn and Mac Aodha. "This was not obvious to us: there's no one taxonomic authority in the world," Van Horn said.

They spent considerable time early on learning to work with the taxonomy, clean up the data, and assemble a dataset comprising 859,000 images for the first competition. In the second year, they featured a dataset with more of a long-tailed distribution, meaning there were many species that had relatively few associated images. In 2019, the dataset was reduced to 268,243 images of highly similar categories captured in a wide variety of situations.

inaturalist dataset image.jpg
After a break last year, the main iNat competition is back and bigger, with a training dataset of 2.7 million images representing 10,000 species. The image above is from an earlier iNat competition dataset.
Courtesy of Grant Van Horn and Mac Aodha Oisin

After a break last year, the main iNat competition is back and bigger, with a training dataset of 2.7 million images representing 10,000 species. The iNat Challenge 2021, which began March 8, ends on May 28.

"It's not like we're trying to throw in categories just to make this thing sound big," Van Horn said. "It is big. And it will just continue to get bigger as the years progress."

This year's larger dataset could encourage teams to explore a recent trend in the machine learning field toward unsupervised learning, where a computer model can learn from the data without labels, or predefined "answers," by seeking patterns within the information.

"We have quite a lot of images for each of these 10,000 categories," Mac Aodha said. "We're hoping that this will open up some interesting avenues for people who are exploring the self-supervised question in the context of this naturalistic, real-world task."

Each competition entry must provide one predicted classification for every image in the dataset. An error rate of 5% on this year’s dataset would be “amazing,” Van Horn said, adding that one team had already achieved an 8.67% error rate by late March.

A move to Open Data

The ability to classify large groups of images opens up the potential to answer a wide range of scientific questions about habitat, behavior, and variations within a species. For example, iNaturalist users have documented alligator lizards' jaw-clinching mating rituals in Los Angeles, where the amount of private property makes traditional wildlife studies impossible.

With this type of insight in mind, Mac Aodha and Van Horn have created a new dataset of natural world tasks (NeWT) that moves beyond the question of species classification and explores concepts related to behavior and attributes that are also exhibited in these photographs.

This work is appearing in the CVPR conference this year, and a competition is being planned to challenge competitors to produce models that generalize to these alternative questions.

So far, winning entries in the CVPR competitions haven’t been deployed by iNaturalist itself, because there are performance tradeoffs between code that generates the least errors, and code that is efficient enough to run on smartphones. But the competition datasets, Mac Aodha said, have found widespread use in the computer vision and machine learning literature, generating some 300 citations over the last few years.

FGVC7: Intro to the 7th Workshop on Fine-Grained Visual Categorization at CVPR 2020

The competitions are hosted on Kaggle, a machine learning and data science platform that draws a wide variety of entrants beyond the iNaturalist community. The 2019 competition drew 213 teams from around the world, and the winners were based in China.

In order for the competition to be fair, an entrant must be able to access and work with the thousands or millions of images in a dataset, no matter where they are in the world. The competitions, and now the iNaturalist app itself, are part of Open Data on AWS, which "makes accessing the data insanely easy and very convenient," Van Horn said.

In 2020, iNaturalist received an Amazon Machine Learning Research Award, which provides unrestricted cash funds and AWS promotional credits to academics to advance the frontiers of machine learning. That helped cover costs for iNaturalist to continue storing data on AWS as it implemented machine learning classification. In March, the app moved to the Registry of Open Data on AWS, which ensures iNaturalist's vast collection of observations — some 60 million — will remain freely accessible to anyone who wants access.

"iNaturalist is doing really important work to bring scientists and everyday citizens together to create a community and drive awareness on biodiversity and environmental sciences," said An Luo, senior technical program manager leading the Amazon Research Awards program. “We are very excited that AWS is empowering them to serve more people as well as conduct advanced machine learning research using the AWS Open Data platform and AWS machine learning services such as Amazon SageMaker.”

Today, iNaturalist has gone from being entirely people-powered to regularly providing machine-generated identifications that are only just beginning to reveal new potential research paths.

"It's important for us that this data lasts and is accessible for a long time, not just for the duration of the competitions," Mac Aodha said. "Having a stable home for these datasets is a really valuable thing."

Related content

US, NJ, Newark
Employer: Audible, Inc. Title: Data Scientist II Location: 1 Washington Street, Newark, NJ 07102 Duties: Independently own, design, and implement scalable and reliable solutions to support or automate decision making throughout the business. Apply a range of data science techniques and tools combined with subject matter expertise to solve difficult business problems and cases in which the approach is unclear. Acquire data by building the necessary SQL/ETL queries. Import processes through various company specific interfaces for accessing RedShift, and S3/edX storage systems. Deliver artifacts on medium size projects that affect important business decisions. Build relationships with stakeholders and counterparts, and communicate model outputs, observations, and key performance indicators (KPIs) to the management to develop sustainable and consumable products and product features. Explore and analyze data by inspecting univariate distributions and multivariate interactions, constructing appropriate transformations, and tracking down the source and meaning of anomalies. Build production-ready models using statistical modeling, mathematical modeling, econometric modeling, machine learning algorithms, network modeling, social network modeling, natural language processing, large language models and/or genetic algorithms. Validate models against alternative approaches, expected and observed outcome, and other business defined key performance indicators. Implement models that comply with evaluations of the computational demands, accuracy, and reliability of the relevant ETL processes at various stages of production. Position reports to Newark, NJ office; however, telecommuting from a home office may be allowed. Requirements: Requires a Master’s degree in Statistics, Computer Science, Computer Engineering, Data Science, Machine Learning, Applied Math, Operations Research, or a related field plus two (2) years of experience as a Data Scientist or other occupation involving data processing and predictive Machine Learning modeling at scale. Experience may be gained concurrently and must include: Two (2) years in each of the following: - Utilizing specialized modelling software including Python or R - Building statistical models and machine learning models using large datasets from multiple resources - Building non-linear models including Neural Nets, Deep Learning, or Gradient Boosting. One (1) year in each of the following: - Building production-ready solutions or applications relying on Large Language Models (LLM), accessed programmatically and beyond just prompting - Evaluating LLM results at scale or fine-tuning LLMs - Building production-ready recommendation systems - Using database technologies including SQL or ETL. Alternatively, will accept a Bachelor’s degree and five (5) years of experience. Salary: $169,550 - 207,500 /year. Multiple positions. Apply online: www.amazon.jobs Job Code: ADBL175.
US, WA, Seattle
Innovators wanted! Are you an entrepreneur? A builder? A dreamer? This role is part of an Amazon Special Projects team that takes the company’s Think Big leadership principle to the limits. If you’re interested in innovating at scale to address big challenges in the world, this is the team for you. As a Senior Applied Scientist on our team, you will focus on building state-of-the-art ML models for healthcare. Our team rewards curiosity while maintaining a laser-focus in bringing products to market. Competitive candidates are responsive, flexible, and able to succeed within an open, collaborative, entrepreneurial, startup-like environment. At the forefront of both academic and applied research in this product area, you have the opportunity to work together with a diverse and talented team of scientists, engineers, and product managers and collaborate with other teams. This role offers a unique opportunity to work on projects that could fundamentally transform healthcare outcomes. Key job responsibilities In this role, you will: • Design and implement novel AI/ML solutions for complex healthcare challenges • Drive advancements in machine learning and data science • Balance theoretical knowledge with practical implementation • Work closely with customers and partners to understand their requirements • Navigate ambiguity and create clarity in early-stage product development • Collaborate with cross-functional teams while fostering innovation in a collaborative work environment to deliver impactful solutions • Establish best practices for ML experimentation, evaluation, development and deployment • Partner with leadership to define roadmap and strategic initiatives You’ll need a strong background in AI/ML, proven leadership skills, and the ability to translate complex concepts into actionable plans. You’ll also need to effectively translate research findings into practical solutions. A day in the life You will solve real-world problems by getting and analyzing large amounts of data, generate insights and opportunities, design simulations and experiments, and develop statistical and ML models. The team is driven by business needs, which requires collaboration with other Scientists, Engineers, and Product Managers across the Special Projects organization. You will prepare written and verbal presentations to share insights to audiences of varying levels of technical sophistication. About the team We represent Amazon's ambitious vision to solve the world's most pressing challenges. We are exploring new approaches to enhance research practices in the healthcare space, leveraging Amazon's scale and technological expertise. We operate with the agility of a startup while backed by Amazon's resources and operational excellence. We're looking for builders who are excited about working on ambitious, undefined problems and are comfortable with ambiguity.
IN, TS, Hyderabad
Are you fascinated by the power of Natural Language Processing (NLP) and Large Language Models (LLM) to transform the way we interact with technology? Are you passionate about applying advanced machine learning techniques to solve complex challenges in the e-commerce space? If so, Amazon's International Seller Services team has an exciting opportunity for you as an Applied Scientist. At Amazon, we strive to be Earth's most customer-centric company, where customers can find and discover anything they want to buy online. Our International Seller Services team plays a pivotal role in expanding the reach of our marketplace to sellers worldwide, ensuring customers have access to a vast selection of products. As an Applied Scientist, you will join a talented and collaborative team that is dedicated to driving innovation and delivering exceptional experiences for our customers and sellers. You will be part of a global team that is focused on acquiring new merchants from around the world to sell on Amazon’s global marketplaces around the world. The position is based in Seattle but will interact with global leaders and teams in Europe, Japan, China, Australia, and other regions. Join us at the Central Science Team of Amazon's International Seller Services and become part of a global team that is redefining the future of e-commerce. With access to vast amounts of data, cutting-edge technology, and a diverse community of talented individuals, you will have the opportunity to make a meaningful impact on the way sellers engage with our platform and customers worldwide. Together, we will drive innovation, solve complex problems, and shape the future of e-commerce. Please visit https://www.amazon.science for more information Key job responsibilities - Apply your expertise in LLM models to design, develop, and implement scalable machine learning solutions that address complex language-related challenges in the international seller services domain. - Collaborate with cross-functional teams, including software engineers, data scientists, and product managers, to define project requirements, establish success metrics, and deliver high-quality solutions. - Conduct thorough data analysis to gain insights, identify patterns, and drive actionable recommendations that enhance seller performance and customer experiences across various international marketplaces. - Continuously explore and evaluate state-of-the-art NLP techniques and methodologies to improve the accuracy and efficiency of language-related systems. - Communicate complex technical concepts effectively to both technical and non-technical stakeholders, providing clear explanations and guidance on proposed solutions and their potential impact.
US, CA, San Francisco
Amazon AGI Autonomy develops foundational capabilities for useful AI agents. We are the research lab behind Amazon Nova Act, a state-of-the-art computer-use agent. Our work combines Large Language Models (LLMs) with Reinforcement Learning (RL) to solve reasoning, planning, and world modeling in the virtual world. We are a small, talent-dense lab with the autonomy to move fast and the long-term commitment to pursue high-risk, high-payoff research. Come be a part of our journey! -- About the team: We are a research engineering team responsible for data ingestion and research tooling that support model development across the lab. The lab’s ability to train state-of-the-art models depends on generating high-quality training data and having useful tools for understanding experimental outcomes. We accelerate research work across the lab while maintaining the operational reliability expected of critical infrastructure. -- About the role: As a frontend engineer on the team, you will build the platform and tooling that power data creation, evaluation, and experimentation across the lab. Your work will be used daily by annotators, engineers, and researchers. This is a hands-on technical leadership role. You will ship a lot of code while defining frontend architecture, shared abstractions, and UI systems across the platform. We are looking for someone with strong engineering fundamentals, sound product judgment, and the ability to build polished UIs in a fast-moving research environment. Key job responsibilities - Be highly productive in the codebase and drive the team’s engineering velocity. - Define and evolve architecture for a research tooling platform with multiple independently evolving tools. - Design and implement reusable UI components, frontend infrastructure, and APIs. - Collaborate directly with Research, Human -Feedback, Product Engineering, and other teams to understand workflows and define requirements. - Write technical RFCs to communicate design decisions and tradeoffs across teams. - Own projects end to end, from technical design through implementation, rollout, and long-term maintenance. - Raise the team’s technical bar through thoughtful code reviews, architectural guidance, and mentorship.
US, CA, San Francisco
Amazon AGI Autonomy develops foundational capabilities for useful AI agents. We are the research lab behind Amazon Nova Act, a state-of-the-art computer-use agent. Our work combines Large Language Models (LLMs) with Reinforcement Learning (RL) to solve reasoning, planning, and world modeling in the virtual world. We are a small, talent-dense lab with the autonomy to move fast and the long-term commitment to pursue high-risk, high-payoff research. Come be a part of our journey! -- About the team: We are a research engineering team responsible for data ingestion and research tooling that support model development across the lab. The lab’s ability to train state-of-the-art models depends on generating high-quality training data and having useful tools for understanding experimental outcomes. We accelerate research work across the lab while maintaining the operational reliability expected of critical infrastructure. -- About the role: As a backend engineer on the team, you will build and operate core services that ingest, process, and distribute large-scale, multi-modal datasets to internal tools and data pipelines across the lab. This is a hands-on technical leadership role. You will ship a lot of code while defining backend architecture and operational standards across the platform. The platform is built primarily in TypeScript today, with plans to introduce Python services in the future. We are looking for someone who can balance rapid experimentation with operational rigor to build reliable services in a fast-moving research environment. Key job responsibilities - Be highly productive in the codebase and drive the team’s engineering velocity. - Design and evolve backend architecture and interfaces for core services. - Define and own standards for production health, performance, and observability. - Collaborate directly with Research, Human Feedback, Product Engineering, and other teams to understand workflows and define requirements. - Write technical RFCs to communicate design decisions and tradeoffs across teams. - Own projects end to end, from technical design through long-term maintenance. - Raise the team’s technical bar through thoughtful code reviews, architectural guidance, and mentorship.
US, CA, Pasadena
The Amazon Center for Quantum Computing (CQC) team is looking for a passionate, talented, and inventive Research Engineer specializing in hardware design for cryogenic environments. The ideal candidate should have expertise in 3D CAD (SolidWorks), thermal and structural FEA (Ansys/COMSOL), hardware design for cryogenic applications, design for manufacturing, and mechanical engineering principles. The candidate must have demonstrated experience driving designs through full product development cycles (requirements, conceptual design, detailed design, manufacturing, integration, and testing). Candidates must also have a strong background in both cryogenic mechanical engineering theory and implementation. Working effectively within a cross-functional team environment is critical. Key job responsibilities The CQC collaborates across teams and projects to offer state-of-the-art, cost-effective solutions for scaling the signal delivery to quantum processor systems at cryogenic temperatures. Equally important is the ability to scale the thermal performance and improve EMI mitigation of the cryogenic environment. You will work on the following: - High density novel packaging solutions for quantum processor units - Cryogenic mechanical design for novel cryogenic signal conditioning sub-assemblies - Cryogenic mechanical design for signal delivery systems - Simulation-driven designs (shielding, filtering, etc.) to reduce sources of EMI within the qubit environment. - Own end-to-end product development through requirements, design reports, design reviews, assembly/testing documentation, and final delivery A day in the life As you design and implement cryogenic hardware solutions, from requirements definition to deployment, you will also: - Participate in requirements, design, and test reviews and communicate with internal stakeholders - Work cross-functionally to help drive decisions using your unique technical background and skill set - Refine and define standards and processes for operational excellence - Work in a high-paced, startup-like environment where you are provided the resources to innovate quickly About the team The Amazon Center for Quantum Computing (CQC) is a multi-disciplinary team of scientists, engineers, and technicians, on a mission to develop a fault-tolerant quantum computer. Inclusive Team Culture Here at Amazon, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness. Diverse Experiences Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Export Control Requirement Due to applicable export control laws and regulations, candidates must be either a U.S. citizen or national, U.S. permanent resident (i.e., current Green Card holder), or lawfully admitted into the U.S. as a refugee or granted asylum, or be able to obtain a US export license. If you are unsure if you meet these requirements, please apply and Amazon will review your application for eligibility.
US, CA, Pasadena
The Amazon Center for Quantum Computing (CQC) team is looking for a passionate, talented, and inventive Research Engineer specializing in hardware design for cryogenic environments. The ideal candidate should have expertise in 3D CAD (SolidWorks), thermal and structural FEA (Ansys/COMSOL), hardware design for cryogenic applications, design for manufacturing, and mechanical engineering principles. The candidate must have demonstrated experience driving designs through full product development cycles (requirements, conceptual design, detailed design, manufacturing, integration, and testing). Candidates must also have a strong background in both cryogenic mechanical engineering theory and implementation. Working effectively within a cross-functional team environment is critical. Key job responsibilities The CQC collaborates across teams and projects to offer state-of-the-art, cost-effective solutions for scaling the signal delivery to quantum processor systems at cryogenic temperatures. Equally important is the ability to scale the thermal performance and improve EMI mitigation of the cryogenic environment. You will work on the following: - High density novel packaging solutions for quantum processor units - Cryogenic mechanical design for novel cryogenic signal conditioning sub-assemblies - Cryogenic mechanical design for signal delivery systems - Simulation-driven designs (shielding, filtering, etc.) to reduce sources of EMI within the qubit environment. - Own end-to-end product development through requirements, design reports, design reviews, assembly/testing documentation, and final delivery A day in the life As you design and implement cryogenic hardware solutions, from requirements definition to deployment, you will also: - Participate in requirements, design, and test reviews and communicate with internal stakeholders - Work cross-functionally to help drive decisions using your unique technical background and skill set - Refine and define standards and processes for operational excellence - Work in a high-paced, startup-like environment where you are provided the resources to innovate quickly About the team The Amazon Center for Quantum Computing (CQC) is a multi-disciplinary team of scientists, engineers, and technicians, on a mission to develop a fault-tolerant quantum computer. Inclusive Team Culture Here at Amazon, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness. Diverse Experiences Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Export Control Requirement Due to applicable export control laws and regulations, candidates must be either a U.S. citizen or national, U.S. permanent resident (i.e., current Green Card holder), or lawfully admitted into the U.S. as a refugee or granted asylum, or be able to obtain a US export license. If you are unsure if you meet these requirements, please apply and Amazon will review your application for eligibility.
FR, Courbevoie
Are you a MS or PhD student interested in a 2026 internship in the field of machine learning, deep learning, generative AI, large language models, speech technology, robotics, computer vision, optimization, operations research, quantum computing, automated reasoning, or formal methods? If so, we want to hear from you! We are looking for students interested in using a variety of domain expertise to invent, design and implement state-of-the-art solutions for never-before-solved problems. You can find more information about the Amazon Science community as well as our interview process via the links below; https://www.amazon.science/ https://amazon.jobs/content/en/career-programs/university/science https://amazon.jobs/content/en/how-we-hire/university-roles/applied-science Key job responsibilities As an Applied Science Intern, you will own the design and development of end-to-end systems. You’ll have the opportunity to write technical white papers, create roadmaps and drive production level projects that will support Amazon Science. You will work closely with Amazon scientists and other science interns to develop solutions and deploy them into production. You will have the opportunity to design new algorithms, models, or other technical solutions whilst experiencing Amazon’s customer focused culture. The ideal intern must have the ability to work with diverse groups of people and cross-functional teams to solve complex business problems. A day in the life At Amazon, you will grow into the high impact person you know you’re ready to be. Every day will be filled with developing new skills and achieving personal growth. How often can you say that your work changes the world? At Amazon, you’ll say it often. Join us and define tomorrow. Some more benefits of an Amazon Science internship include; • All of our internships offer a competitive stipend/salary • Interns are paired with an experienced manager and mentor(s) • Interns receive invitations to different events such as intern program initiatives or site events • Interns can build their professional and personal network with other Amazon Scientists • Interns can potentially publish work at top tier conferences each year About the team Applicants will be reviewed on a rolling basis and are assigned to teams aligned with their research interests and experience prior to interviews. Start dates are available throughout the year and durations can vary in length from 3-6 months for full time internships. This role may available across multiple locations in the EMEA region (Austria, Estonia, France, Germany, Ireland, Israel, Italy, Jordan, Luxembourg, Netherlands, Poland, Romania, South Africa, Spain, Sweden, UAE, and UK). Please note these are not remote internships.
US, WA, Seattle
Amazon's Pricing & Promotions Science is seeking a driven Applied Scientist to harness planet scale multi-modal datasets, and navigate a continuously evolving competitor landscape, in order to regularly generate fresh customer-relevant prices on billions of Amazon and Third Party Seller products worldwide. We are looking for a talented, organized, and customer-focused applied researchers to join our Pricing and Promotions Optimization science group, with a charter to measure, refine, and launch customer-obsessed improvements to our algorithmic pricing and promotion models across all products listed on Amazon. This role requires an individual with exceptional machine learning and reinforcement learning modeling expertise, excellent cross-functional collaboration skills, business acumen, and an entrepreneurial spirit. We are looking for an experienced innovator, who is a self-starter, comfortable with ambiguity, demonstrates strong attention to detail, and has the ability to work in a fast-paced and ever-changing environment. Key job responsibilities - See the big picture. Understand and influence the long term vision for Amazon's science-based competitive, perception-preserving pricing techniques - Build strong collaborations. Partner with product, engineering, and science teams within Pricing & Promotions to deploy machine learning price estimation and error correction solutions at Amazon scale - Stay informed. Establish mechanisms to stay up to date on latest scientific advancements in machine learning, neural networks, natural language processing, probabilistic forecasting, and multi-objective optimization techniques. Identify opportunities to apply them to relevant Pricing & Promotions business problems - Keep innovating for our customers. Foster an environment that promotes rapid experimentation, continuous learning, and incremental value delivery. - Successfully execute & deliver. Apply your exceptional technical machine learning expertise to incrementally move the needle on some of our hardest pricing problems. A day in the life We are hiring an applied scientist to drive our pricing optimization initiatives. The Price Optimization science team drives cross-domain and cross-system improvements through: - invent and deliver price optimization, simulation, and competitiveness tools for Sellers. - shape and extend our RL optimization platform - a pricing centric tool that automates the optimization of various system parameters and price inputs. - Promotion optimization initiatives exploring CX, discount amount, and cross-product optimization opportunities. - Identifying opportunities to optimally price across systems and contexts (marketplaces, request types, event periods) Price is a highly relevant input into many partner-team architectures, and is highly relevant to the customer, therefore this role creates the opportunity to drive extremely large impact (measured in Bs not Ms), but demands careful thought and clear communication. About the team About the team: the Pricing Discovery and Optimization team within P2 Science owns price quality, discovery and discount optimization initiatives, including criteria for internal price matching, price discovery into search, p13N and SP, pricing bandits, and Promotion type optimization. We leverage planet scale data on billions of Amazon and external competitor products to build advanced optimization models for pricing, elasticity estimation, product substitutability, and optimization. We preserve long term customer trust by ensuring Amazon's prices are always competitive and error free.
US, CA, Sunnyvale
Amazon's Industrial Robotics Group is seeking exceptional talent to help develop the next generation of advanced robotics systems that will transform automation at Amazon's scale. We're building revolutionary robotic systems that combine innovative AI, sophisticated control systems, and advanced mechanical design to create adaptable automation solutions capable of working safely alongside humans in dynamic environments. This is a unique opportunity to shape the future of robotics and automation at unprecedented scale, working with world-class teams pushing the boundaries of what's possible in robotic manipulation, locomotion, and human-robot interaction. This role presents an opportunity to shape the future of robotics through innovative applications of deep learning and large language models. At Industrial Robotics Group we leverage advanced robotics, machine learning, and artificial intelligence to solve complex operational challenges at unprecedented scale. Our fleet of robots operates across hundreds of facilities worldwide, working in sophisticated coordination to fulfill our mission of customer excellence. We are pioneering the development of robotics foundation models that: • Enable unprecedented generalization across diverse tasks • Enable unprecedented robustness and reliability, industry-ready • Integrate multi-modal learning capabilities (visual, tactile, linguistic) • Accelerate skill acquisition through demonstration learning • Enhance robotic perception and environmental understanding • Streamline development processes through reusable capabilities The ideal candidate will contribute to research that bridges the gap between theoretical advancement and practical implementation in robotics. You will be part of a team that's revolutionizing how robots learn, adapt, and interact with their environment. Join us in building the next generation of intelligent robotics systems that will transform the future of automation and human-robot collaboration. As an Applied Science Manager in the Foundation Model team, you will build and lead a team that develops and improves machine learning systems that help robots perceive, reason, and act in real-world environments. You will set the technical direction for leveraging state-of-the-art models (open source and internal research), evaluating them on representative tasks, and adapting/optimizing them to meet robustness, safety, and performance needs. You will drive the capability roadmap and the evaluation strategy that defines “what the robot brain can do,” and you will sponsor targeted innovation when gaps remain. You’ll collaborate closely with research, controls, hardware, and product teams, and ensure the team’s outputs can be further customized and deployed by downstream teams on specific robot embodiments. Key job responsibilities • Build and lead a team responsible for the best foundation models (visuomotor / VLA / worldmodel-action policies), and grow capability through hiring, coaching, and bar-raising. • Own the technical roadmap and portfolio strategy: proactively track SOTA (open-source + internal research), decide what to adopt, and drive targeted innovation where gaps persist; • Establish the capability control plane: define evaluation strategy, benchmarks, scorecards, and regression practices that profile what the robot FMs can do across sim + real and guide investment decisions. • Drive embodiment readiness for FMs: ensure models can be adapted/optimized for target embodiments (interfaces, latency/throughput, robustness, safety constraints) and that outputs are consumable by downstream teams for robot-specific finetuning and deployment. • Lead the data & training strategy: set standards for data governance/provenance/quality, define data needs for closing key gaps, and ensure efficient training/fine-tuning pipelines and experimentation velocity. • Partner across the org: collaborate with research teams (to transition new methods), and with controls/WBC, hardware, and product teams (to align interfaces, constraints, milestones, and integration plans). • Communicate and deliver: produce clear technical narratives (roadmaps, design docs, evaluation readouts), manage execution toward milestones, and ensure high-quality handoffs.