How some of AWS's most innovative customers are using computer vision technologies

From counting fish to identifying touchdowns, AWS customers are utilizing computer vision and pattern recognition technologies to improve business processes and customer experiences.

Computer vision, the automatic recognition and description of images and video, has applications that are far-reaching, from identifying defects in high speed assembly lines and its use in autonomous robots, to the analysis of medical images, and the identification of products and people in social media. This week, in line with the IEEE Computer Vision and Pattern Recognition (CVPR) conference, we’ve rounded up examples of how some of AWS's most innovative customers are utilizing computer vision and pattern recognition technologies to improve business processes and customer experiences. This includes approaches such as data scientists building custom vision models using Amazon SageMaker, and application developers using Amazon Rekognition and Amazon Textract to embed computer vision into their applications.

Advertising

REA Group image
REA Group has developed an image compliance system that automatically detects any noncompliance and notifies home sellers.
fstop123/Getty Images

In advertising and other online media, computer vision can automate content moderation. REA Group, a multinational digital advertising company specializing in property and real estate, provides search-based portals that enable property sellers to upload images of properties on the market to deliver a wide, searchable selection to their consumers. REA Group discovered that images uploaded to their portal often weren’t compliant with their usage terms. Some images included trademarks or contact details of the sellers, which created lead attribution challenges. They set up a dedicated team of individuals to manually review the images for unapproved content, but the large volume of daily uploads and the additional review process delayed the property listing time by several days. The REA team developed an image compliance system that automatically detects any noncompliance and notifies sellers. To augment their existing machine learning models, they're using Amazon Rekognition Text in Image, which detects and extracts text in images, enabling them to increase the accuracy of detecting noncompliance and reduce false positives by more than 56 percent. They added business rules that factored in a variety of predictions from their own models, and from Amazon Rekognition, to enable automated decision-making.

Agriculture

fin.png
Aquabyte's machine learning algorithms can estimate how much a fish weighs while still in the water.

Agriculture has also benefited from computer vision. Fish farming is one of the most efficient sources of protein, since a pound of feed equates to nearly a pound of protein. But the cold, dark waters of fish habitats make it nearly impossible to effectively manage these farms from the surface. Historically, fish farmers have had to randomly scoop fish out of the water to measure their weight and check for disease. Aquabyte’s machine learning solution reimagines this process by using underwater cameras that keep tabs on the fish and compare photos of them over time. The machine learning algorithms, running on Amazon SageMaker, can estimate how much each fish weighs while it’s still in the water. The system can also monitor the fish for sea lice, a parasite that is a major problem in salmon farms, and the subject of significant regulation in Norway, where the bulk of Aquabyte’s client base currently operates. Without a solution like Aquabyte, managing sea lice amounts to nearly a quarter of the cost of operating a salmon farm. Aquabyte’s cameras have counted 2 million sea lice to date, the result of billions of images being captured. The Aquabyte team has been working on methods that would allow farmers to track individual fish for growth-tracking and breeding purposes. In the future, machine learning might even help automate elements of the farms by intelligently distributing fish feed, for example.

Autonomous driving

grid.png
DeepMap is focused on solving the mapping and localization challenge for autonomous vehicles.

Industries like autonomous driving wouldn’t even be possible without the help of computer vision. Perhaps you think the world is already sufficiently mapped. With the advent of satellite images and Google Street View, it seems like every square inch of the globe is represented in data. But for autonomous vehicles, much of the world is uncharted territory. That’s because the maps designed for humans “can’t be consumed by robots,” says Tom Wang, the director of engineering at DeepMap, a Palo Alto startup focused on solving the mapping and localization challenge for autonomous vehicles. According to Wang, these new kinds of vehicles need higher precision maps with richer semantics, things like the traffic signals, a lot of different traffic signs, driving boundaries, and connecting lanes. For DeepMap computer vision is critical. DeepMap needs to run a vast volume of image detections to automatically generate a comprehensive list of map features and detect dynamic road changes. Using Amazon SageMaker, DeepMap updates training models within a day and runs image detection on tens of millions of images on a daily basis to keep up with ever-changing conditions.

Education

Certipass, a UNI ISO standards accredited body for the certification of digital skills
Certipass was able to build their solution in under 30 days, enabling all their testing centers to test candidates online during the COVID-19 pandemic.
fizkes/Getty Images/iStockphoto

In the wake of the COVID-19 pandemic, many educational institutions needed to quickly pivot to the online proctoring of exams, leading to a need for new ways to verify identification. Certipass, a UNI ISO standards accredited body for the certification of digital skills, is the primary provider of the international digital competency certification –European Informatics Passport (EIPASS).

Since the EIPASS Certification is an international standard, Certipass has made it their mission to ensure maximum security, objectiveness, transparency, and fairness during the entire online evaluation process. Certipass used Amazon Rekognition for automated candidate identity verification during tests that are in line with e-Competence Framework for Information and Communication Technology (CEN) and The Digital Competence Framework for Citizens (Joint Research Centre). They were able to build the solution in under 30 days to enable all their testing centers to test candidates online during COVID-19.

Financial services

Aella Credit
Aella Credit provides easy access to credit in emerging markets using biometric, employer, and mobile phone data
Victor Karanja/Getty Images

In financial services, Aella Credit provides easy access to credit in emerging markets using biometric, employer, and mobile phone data. For those in emerging markets, identity verification and validation is one of the major challenges to accessing retail banking services. How can you know that people are who they say they are in communities that don't have proper identification systems? Aella Credit uses Amazon Rekognition to analyze images to verify a customer’s identity and give them access to financial and healthcare services with minimal friction. Amazon Rekognition helps to automate video and image analysis, with no machine learning expertise required. What would have taken days to verify someone’s identity manually, now happens in seconds. Customers can actually receive their loan in their account in less than five minutes, broadening access to credit.

Financial technology

To make sure users are getting the largest possible tax refund, Intuit incorporates machine learning throughout the TurboTax experience to help users file their taxes more efficiently. TurboTax uses machine learning to shorten the filing process, which takes an average of 13 hours.

Taxes image for AWS customer success story
TurboTax utilizes machine learning to shorten the filing process.
simpson33/Getty Images/iStockphoto

With Intuit’s computer vision capabilities supported by Amazon Textract, entering information from tax forms like W2s or 1099s takes seconds. Rather than a user having to enter form fields manually, the service scans pictures of the forms and digitizes them. Then, using contextual data from TurboTax’s existing database of tax codes and compliance forms, Amazon Textract verifies accuracy and identifies any anomalies or missing data for the user.

Healthcare

face.png
By combining the power of machine learning and computer vision, an interdisciplinary team of researchers at Duke University has created a faster, less expensive, more reliable, and more accessible system to screen children for autism spectrum disorder.

Machine learning plays a key role in many health-related realms - from providers and payers looking to expedite the care continuum to pharma and biotech researchers looking to reduce costs and speed up the drug discovery and disease detection process. Researchers at Duke Center for Autism and Brain Development are using machine learning to screen for autism spectrum disorder (ASD) in children. It’s critically important to diagnose ASD as early in a child’s development as possible — starting treatment for ASD at an age of 18 to 24 months can increase a child’s IQ by up to 17 points—in some cases moving them into the “average” child IQ range of 90-110 (or above it)—and, in turn, significantly improving their quality of life. Currently, the wait time for children to receive a diagnosis could be well after the child’s third birthday. By combining the power of machine learning and computer vision, powered by AWS, an interdisciplinary team of researchers at Duke University have created a faster, less expensive, more reliable, and more accessible system to screen children for ASD.

Media and entertainment

Computer vision technology is helping sports organizations like the National Football League (NFL) improve the game for fans. The NFL works with AWS to develop real-time, state-of-the-art cloud technology leveraging machine learning and artificial intelligence to increase the efficiency and pace of the game.

For example, deep learning and computer vision technologies are being explored to aid game officiating including real-time football tracking. Within days, AWS and NFL scientists were able to create custom training data sets of thousands of images extracted from NFL broadcast game footage using Amazon SageMaker Ground Truth.

NFL football
Deep learning and computer vision technologies are being explored by the NFL to aid game officiating, including real-time football tracking.
CREDIT: National Foottball League

Working with the Amazon ML Solutions Lab, Amazon SageMaker and GluonCV with MXNet were used to train and optimize several state-of-the-art deep learning-based object detection models such as Faster-RCNN and Yolov3, to accurately detect the football across video frames. This led to a first-of-its-kind football tracking model that performs well in a number of complex scenarios, such as when the ball is highly occluded or is partially visible in different camera angles.

The NFL also uses computer vision to more easily and quickly search through thousands of media assets. The NFL photo team, official photographers of the NFL, has millions of photos in archive and generates 500,000 photos each season. Manually, they were able to tag 50,000 images over 18 months. By using Amazon Rekognition custom face collection, text in image, object detection, and Custom Labels, an automated machine learning object detection service, they were able to apply detailed tags for players, teams, objects, action, jerseys, location, etc. to their entire photo collection in a fraction of time it took previously. This allowed them to make these photos searchable and usable to everyone in the company in ways that weren't possible before.

For Sportradar, the global provider of sports and intelligence for the betting and media industries providing data coverage from more than 200,000 events annually, advances in computer vision are an opportunity to expand the depth of sports data offered to customers and reduce the costs of data collection through automation.

Sports betting image for AWS customer success story
For Sportradar, advances in computer vision are an opportunity to expand the depth of sports data offered to customers and reduce the costs of data collection through automation.
scyther5/Getty Images/iStockphoto

Sportradar is investing in computer vision research both through internal development and external partnerships to build computer vision data collection capabilities with an initial focus on tennis, soccer and snooker. Working with the Amazon ML Solutions Lab, Sportradar is exploring the application of state-of-the-art deep learning models for automated match event detection in soccer, moving beyond player and ball localization to understanding the intent of the play in terms of what is happening in the game.

To bring this technology into production as it matures, Sportradar is leveraging AWS services including Amazon SageMaker, EKS, MSK, FSx and Amazon’s broad range of GPU and CPU compute instances for its computer vision processing pipeline. This infrastructure allows Sportradar's researchers to test and validate computer vision models at scale and bring models from the lab to production with minimal effort while delivering the low latency, reliability and scalability needed for live sports betting use cases.

You can find more ways that AWS customers are innovating with computer vision here. More information about Amazon's participation at CVPR is available here.

Related content

US, CA, Sunnyvale
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Applied Scientist; to support the development and implementation of Generative AI (GenAI) algorithms and models for supervised fine-tuning, and advance the state of the art with Large Language Models (LLMs), As an Applied Scientist, you will play a critical role in supporting the development of GenAI technologies that can handle Amazon-scale use cases and have a significant impact on our customers' experiences. Key job responsibilities - Collaborate with cross-functional teams of engineers and scientists to identify and solve complex problems in GenAI - Design and execute experiments to evaluate the performance of different algorithms and models, and iterate quickly to improve results - Think big about the arc of development of GenAI over a multi-year horizon, and identify new opportunities to apply these technologies to solve real-world problems - Communicate results and insights to both technical and non-technical audiences, including through presentations and written reports
LU, Luxembourg
Are you a MS student interested in a 2026 internship in the field of machine learning, deep learning, generative AI, large language models and speech technology, robotics, computer vision, optimization, operations research, quantum computing, automated reasoning, or formal methods? If so, we want to hear from you! We are looking for a customer obsessed Data Scientist Intern who can innovate in a business environment, building and deploying machine learning models to drive step-change innovation and scale it to the EU/worldwide. If this describes you, come and join our Data Science teams at Amazon for an exciting internship opportunity. If you are insatiably curious and always want to learn more, then you’ve come to the right place. You can find more information about the Amazon Science community as well as our interview process via the links below; https://www.amazon.science/ https://amazon.jobs/content/en/career-programs/university/science Key job responsibilities As a Data Science Intern, you will have following key job responsibilities: • Work closely with scientists and engineers to architect and develop new algorithms to implement scientific solutions for Amazon problems. • Work on an interdisciplinary team on customer-obsessed research • Experience Amazon's customer-focused culture • Create and Deliver Machine Learning projects that can be quickly applied starting locally and scaled to EU/worldwide • Build and deploy Machine Learning models using large data-sets and cloud technology. • Create and share with audiences of varying levels technical papers and presentations • Define metrics and design algorithms to estimate customer satisfaction and engagement A day in the life At Amazon, you will grow into the high impact person you know you’re ready to be. Every day will be filled with developing new skills and achieving personal growth. How often can you say that your work changes the world? At Amazon, you’ll say it often. Join us and define tomorrow. Some more benefits of an Amazon Science internship include; • All of our internships offer a competitive stipend/salary • Interns are paired with an experienced manager and mentor(s) • Interns receive invitations to different events such as intern program initiatives or site events • Interns can build their professional and personal network with other Amazon Scientists • Interns can potentially publish work at top tier conferences each year About the team Applicants will be reviewed on a rolling basis and are assigned to teams aligned with their research interests and experience prior to interviews. Start dates are available throughout the year and durations can vary in length from 3-6 months for full time internships. This role may available across multiple locations in the EMEA region (Austria, France, Germany, Ireland, Israel, Italy, Luxembourg, Netherlands, Poland, Romania, Spain and the UK). Please note these are not remote internships.
US, CA, San Francisco
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Member of Technical Staff with a strong deep learning background, to build industry-leading Generative Artificial Intelligence (GenAI) technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As a Member of Technical Staff with the AGI team, you will lead the development of algorithms and modeling techniques, to advance the state of the art with LLMs. You will lead the foundational model development in an applied research role, including model training, dataset design, and pre- and post-training optimization. Your work will directly impact our customers in the form of products and services that make use of GenAI technology. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in LLMs. About the team The AGI team has a mission to push the envelope in GenAI with LLMs and multimodal systems, in order to provide the best-possible experience for our customers.
US, CA, San Francisco
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Member of Technical Staff with a strong deep learning background, to build industry-leading Generative Artificial Intelligence (GenAI) technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As a Member of Technical Staff with the AGI team, you will lead the development of algorithms and modeling techniques, to advance the state of the art with LLMs. You will lead the foundational model development in an applied research role, including model training, dataset design, and pre- and post-training optimization. Your work will directly impact our customers in the form of products and services that make use of GenAI technology. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in LLMs. About the team The AGI team has a mission to push the envelope in GenAI with LLMs and multimodal systems, in order to provide the best-possible experience for our customers.
US, CA, San Francisco
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Member of Technical Staff with a strong deep learning background, to build industry-leading Generative Artificial Intelligence (GenAI) technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As a Member of Technical Staff with the AGI team, you will lead the development of algorithms and modeling techniques, to advance the state of the art with LLMs. You will lead the foundational model development in an applied research role, including model training, dataset design, and pre- and post-training optimization. Your work will directly impact our customers in the form of products and services that make use of GenAI technology. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in LLMs. About the team The AGI team has a mission to push the envelope in GenAI with LLMs and multimodal systems, in order to provide the best-possible experience for our customers.
US, CA, San Francisco
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Member of Technical Staff with a strong deep learning background, to build industry-leading Generative Artificial Intelligence (GenAI) technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As a Member of Technical Staff with the AGI team, you will lead the development of algorithms and modeling techniques, to advance the state of the art with LLMs. You will lead the foundational model development in an applied research role, including model training, dataset design, and pre- and post-training optimization. Your work will directly impact our customers in the form of products and services that make use of GenAI technology. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in LLMs. About the team The AGI team has a mission to push the envelope in GenAI with LLMs and multimodal systems, in order to provide the best-possible experience for our customers.
US, CA, San Francisco
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Member of Technical Staff with a strong deep learning background, to build industry-leading Generative Artificial Intelligence (GenAI) technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As a Member of Technical Staff with the AGI team, you will lead the development of algorithms and modeling techniques, to advance the state of the art with LLMs. You will lead the foundational model development in an applied research role, including model training, dataset design, and pre- and post-training optimization. Your work will directly impact our customers in the form of products and services that make use of GenAI technology. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in LLMs. About the team The AGI team has a mission to push the envelope in GenAI with LLMs and multimodal systems, in order to provide the best-possible experience for our customers.
US, CA, Sunnyvale
Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports – including Amazon MGM Studios-produced series and movies; licensed fan favorites; and programming from Prime Video add-on subscriptions such as Apple TV+, Max, Crunchyroll and MGM+. All customers, regardless of whether they have a Prime membership or not, can rent or buy titles via the Prime Video Store, and can enjoy even more content for free with ads. Are you interested in shaping the future of entertainment? Prime Video's technology teams are creating best-in-class digital video experience. As a Prime Video technologist, you’ll have end-to-end ownership of the product, user experience, design, and technology required to deliver state-of-the-art experiences for our customers. You’ll get to work on projects that are fast-paced, challenging, and varied. You’ll also be able to experiment with new possibilities, take risks, and collaborate with remarkable people. We’ll look for you to bring your diverse perspectives, ideas, and skill-sets to make Prime Video even better for our customers. With global opportunities for talented technologists, you can decide where a career Prime Video Tech takes you! We are looking for a self-motivated, passionate and resourceful Sr. Applied Scientists with Recommender System or Search Ranking or Ads Ranking experience to bring diverse perspectives, ideas, and skill-sets to make Prime Video even better for our customers. You will spend your time as a hands-on machine learning practitioner and a research leader. You will play a key role on the team, building and guiding machine learning models from the ground up. At the end of the day, you will have the reward of seeing your contributions benefit millions of Amazon.com customers worldwide. Key job responsibilities - Develop AI solutions for various Prime Video Recommendation/Search systems using Deep learning, GenAI, Reinforcement Learning, and optimization methods; - Work closely with engineers and product managers to design, implement and launch AI solutions end-to-end; - Design and conduct offline and online (A/B) experiments to evaluate proposed solutions based on in-depth data analyses; - Effectively communicate technical and non-technical ideas with teammates and stakeholders; - Stay up-to-date with advancements and the latest modeling techniques in the field; - Publish your research findings in top conferences and journals. About the team Prime Video Recommendation/Search Science team owns science solution to power search experience on various devices, from sourcing, relevance, ranking, to name a few. We work closely with the engineering teams to launch our solutions in production.
US, WA, Seattle
We are open to hiring candidates to work out of one of the following locations: San Francisco, CA, USA | Santa Clara, CA, USA | Seattle, WA, USA | Sunnyvale, CA, USA Amazon is seeking an innovative and high-judgement Senior Applied Scientist to join the Privacy Engineering team in the Amazon Privacy Services org. We own products and programs that deliver technical innovation for ensuring compliance with high-impact, urgent regulation across Amazon services worldwide. The Senior Applied Scientist will contribute to the strategic direction for Amazon’s privacy practices while building/owning the compliance approach for individual regulations such as General Data Protection Regulation (GDPR), DMA, Quebec 25 etc. This will require helping to frame, and participating in, high judgment debates and decision making across senior business, technology, legal, and public policy leaders. A great candidate will have a unique combination of experience with innovative data governance technology, high judgement in system architecture decisions and ability to set detailed technical design from ambiguous compliance requirements. You will drive foundational, cross-service decisions, set technical requirements, oversee technical design, and have end to end accountability for delivering technical changes across dozens of different systems. You will have high engagement with WW senior leadership via quarterly reviews, annual organizational planning, and s-team goal updates. Key job responsibilities * Develop information retrieval benchmarks related to code analysis and invent algorithms to optimize identification of privacy requirements and controls. * Develop semantic and syntactic code analysis tools to assess privacy implementations within application code, and automatic code replacement tools to enhance privacy implementations. * Leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in generative artificial intelligence for privacy compliance. * Collaborate with other science and engineering teams as well as business stakeholders to maximize the velocity and impact of your contributions. A day in the life Amazon Privacy Services own products and programs that deliver technical innovation for ensuring Privacy Amazon services worldwide. We are hiring an innovative and high-judgement Senior Applied Scientist to develop AI solutions for builders across Amazon’s consumer and digital businesses including but not limited to Amazon.com, Amazon Ads, Amazon Go, Prime Video, Devices and more. Our ideal candidate is creative, has excellent problem-solving skills, a solid understanding of computer science fundamentals, deep learning and a customer-focused mindset. The Senior Scientist will serve as the resident expert on the development of AI agents for privacy. They build on their experiences to develop LLMs to develop AI implementations across privacy workflows. They will have responsibilities to mentor junior scientists and engineers develop AI skills. About the team Diverse Experiences Amazon Security values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why Amazon Security? At Amazon, security is central to maintaining customer trust and delivering delightful customer experiences. Our organization is responsible for creating and maintaining a high bar for security across all of Amazon’s products and services. We offer talented security professionals the chance to accelerate their careers with opportunities to build experience in a wide variety of areas including cloud, devices, retail, entertainment, healthcare, operations, and physical stores Inclusive Team Culture In Amazon Security, it’s in our nature to learn and be curious. Ongoing DEI events and learning experiences inspire us to continue learning and to embrace our uniqueness. Addressing the toughest security challenges requires that we seek out and celebrate a diversity of ideas, perspectives, and voices. Training & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, training, and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why flexible work hours and arrangements are part of our culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve.
US, WA, Seattle
Amazon's Price Perception and Evaluation team is seeking a driven Principal Applied Scientist to harness planet scale multi-modal datasets, and navigate a continuously evolving competitor landscape, in order to build and scale an advanced self-learning scientific price estimation and product understanding system, regularly generating fresh customer-relevant prices on billions of Amazon and Third Party Seller products worldwide. We are looking for a talented, organized, and customer-focused technical leader with a charter to derive deep neural product relationships, quantify substitution and complementarity effects, and publish trust-preserving probabilistic price ranges on all products listed on Amazon. This role requires an individual with excellent scientific modeling and system design skills, bar-raising business acumen, and an entrepreneurial spirit. We are looking for an experienced leader who is a self-starter comfortable with ambiguity, demonstrates strong attention to detail, and has the ability to work in a fast-paced and ever-changing environment. Key job responsibilities - Develop the team. Mentor a highly talented group of applied machine learning scientists & researchers. - See the big picture. Shape long term vision for Amazon's science-based competitive, perception-preserving pricing techniques - Build strong collaborations. Partner with product, engineering, and science teams within Pricing & Promotions to deploy machine learning price estimation and error correction solutions at Amazon scale - Stay informed. Establish mechanisms to stay up to date on latest scientific advancements in machine learning, neural networks, natural language processing, probabilistic forecasting, and multi-objective optimization techniques. Identify opportunities to apply them to relevant Pricing & Promotions business problems - Keep innovating for our customers. Foster an environment that promotes rapid experimentation, continuous learning, and incremental value delivery. - Deliver Impact. Develop, Deploy, and Scale Amazon's next generation foundational price estimation and understanding system