Image shows Amazon science intern Michael Saxon standing in front of two office buildings
Michael Saxon, an Amazon science intern, is completing his PhD in computer science at the University of California, Santa Barbara, with a core focus on natural language processing.

“Alexa, how do you know everything?”

How Amazon intern Michael Saxon uses his experience with automatic speech recognition models to help Alexa answer complex queries.

“Alexa, play ‘Rhapsody in Blue’.”

“Playing ‘Rhapsody in Blue’.”

Customers often describe this kind of interaction with Alexa as magical; less than a decade ago it would have seemed fanciful.

A black and white profile shot of the Nobel Prize-winning biologist Peter Medawar
By Digitised for CODEBREAKERS, MAKERS OF MODERN GENETICS
The Nobel Prize-winning biologist Peter Medawar published "Advice to a Young Scientist" in 1979. Here are some of Medawar’s key insights from the book.

One component of the science behind Alexa is automatic speech recognition — the process that Alexa utilizes to interpret semantic meaning from a speech signal. And scientists like Michael Saxon, PhD student and three-time Amazon applied science intern, encounter interesting challenges when a customer’s request is more complex than asking for a song to play.

Saxon is one of more than 10,000 interns Amazon hosted virtually this summer. More than 10 percent of those internships were for applied science and data science roles with teams across the company. The majority of science-related internships run between 12 and 16 weeks.

A growing interest in NLP

Saxon completed his undergraduate degree in electrical engineering and received a master’s in computer engineering at Arizona State University. He’s now completing his PhD in computer science at the University of California, Santa Barbara, with a core focus on natural language processing (NLP).

He became interested in speech and NLP as an undergrad; in his final year, a professor recruited him for a project. Saxon studied the progression of neurological disorders by using automatic speech recognition models to detect and track hypernasality in dysarthric speech

Saxon later met some Amazon recruiters who were looking for applied science interns at the AAAI Conference on Artificial Intelligence. “Based on my interests in speech and NLP, they offered for me to join the Alexa Hybrid Science team in Pittsburgh,” Saxon says. “And my experience with automatic speech recognition models was a plus.”

Solving end-to-end SLU

A core research direction of the Alexa Hybrid Science team has been the development of neural end-to-end spoken language understanding (SLU) models. For his 2019 internship project, Saxon was given a task that seemed relatively easy to him at the outset: develop an end-to-end intent SLU system that can make a decision after hearing as few words as possible.

However, he found the project proved to be deceptively difficult. Using training data, Saxon and the team were unable to replicate high-performance results from prior SLU publications.

Toward the end of the summer 2019 internship, the team identified the reason why. There was a mismatch between levels of semantic complexity in the training data and the publicly available datasets from the existing literature.

Semantic complexity refers to the number of possible expressions and their various meanings that a collection of language data contains. The more semantically complex the collection, the more ways a program can interpret a single utterance from it.

Due to their relatively low semantic complexity, the publicly available datasets required less training data and ultimately restricted the research systems to choose from a fixed list of predetermined exact command permutations.

Saxon’s team applied the model architecture from the existing literature to Amazon’s training data, which has much higher semantic complexity.

“We found for similarly sized datasets, and similar architectures, that we couldn’t reproduce these strong results from prior work, and we suspected that it was due to this semantic-complexity mismatch,” says Saxon. “The models were fundamentally designed for domains with lower semantic complexity.”

However, this setback in his first internship project inspired the direction for the next one.

Getting results

When Saxon returned to the Alexa Hybrid Science team for his second internship in January 2020, the team hit the ground running. While he was finishing his master’s coursework at ASU, the team began a research effort toward demonstrating usable measures of semantic complexity to facilitate objective comparisons of SLU tasks.

To produce useful measures, the team needed to compare the relationship between an SLU task’s complexity measures and the accuracy they could achieve with a model if they applied it to different datasets, each less semantically complex than the last.

The team artificially generated datasets of different levels of semantic complexity by repeatedly removing batches of rare words. This led to a continuum of virtual SLU problems ranging from Alexa-level tasks in large artificial datasets to effectively spotting keywords from a short list.

Michael Saxon and team published their findings on the importance of contextualizing results to demonstrate an SLU system’s scope of applicability in “Semantic Complexity in End-to-End Spoken Language Understanding”.

“There is a strong, nearly linear relationship between these semantic complexity measures and the maximum accuracy we were able to get across several different models,” Saxon says. “So that suggests that there is a fundamental relationship between a given model’s performance ceiling and the semantic complexity of the task it solves.”

Saxon and team published their findings on the importance of contextualizing results to demonstrate an SLU system’s scope of applicability in “Semantic Complexity in End-to-End Spoken Language Understanding” and presented them at Interspeech 2020.  

Considering the challenges of semantic complexity, the team then set out to develop an end-to-end model for generalized SLU that could enable voice assistants like Alexa to process any utterance with improved accuracy over other models.

"End-to-end spoken language understanding for generalized voice assistants" presents an approach to developing an E2E model for generalized SLU in commercial voice assistants.

The result: a second publication, “End-to-End Spoken Language Understanding for Generalized Voice Assistants.” The team produced an end-to-end SLU system that could both be pretrained on speech and accept the drop-in insertion of a large language model. This allowed the team to separately adjust the system’s transcription and interpretation capabilities.

Consequently, the system could process many more combinations of intent and argument interpretations. Of note, the SLU system’s speech-to-interpretation accuracy achieved a 43 percent improvement over similarly capable end-to-end baselines.

Answering any question using the web

This summer, Saxon is completing his third applied science internship at Amazon, working remotely for the Alexa AI team in Manhattan Beach, Calif. The team’s work focuses on getting Alexa to provide highly accurate responses to customers’ questions. 

“I’ve been on this journey where I've started on the speech side of things and transitioned further down the technology stack to where I am now in the web information domain, where there are still echoes of this previous work,” explains Saxon.

Michael’s internship helped us build substantial expertise and reach the level of maturity that we have in the team today in end-to-end SLU.
Athanasios Mouchtaris

The challenge this time involves an even more semantically complex use case: the Alexa AI team needs to train web information–based models that can correctly answer any possible question — even the most confounding ones — so that Alexa can provide useful responses to customers’ questions.

Often, the most important words in a question sentence that an ASR system needs to transcribe correctly are very rare. They increase the sentence’s semantic complexity and are also the hardest words for the system to transcribe.

Without correctly hearing one of those words, the system won’t be able to answer the question. Saxon’s current work brings his previous experiences in end-to-end SLU to bear on this task.

“Michael’s internship helped us build substantial expertise and reach the level of maturity that we have in the team today in end-to-end SLU,” says his former manager, Athanasios Mouchtaris. “Everything we learned from Michael’s work during his internship was crucial to our success.”

Looking ahead

Having only completed the first year of his PhD, Saxon is still in an exploratory phase of finding a research direction. He has four years left of his PhD and intends to complete additional internships — and he said he can see himself returning to Amazon again.

“I’ve really bought into the leadership principles and culture here. And I particularly like the emphasis on taking ownership and ‘disagree and commit,’ which have served me well during these research projects,” he says. “I would definitely consider coming back for full-time work after I graduate.”

Amazon hosted more than 10,000 interns virtually this summer. If you’re a student with interest in an Amazon internship, you can learn more about internship opportunities at Amazon Student Programs.

Research areas

Related content

US, WA, Seattle
The Artificial General Intelligent team (AGI) seeks a passionate, talented, and resourceful Applied Scientist in the field of LLM, Artificial Intelligence (AI), Natural Language Processing (NLP) and/or Information Retrieval, to invent and build scalable solutions for a state-of-the-art context-aware conversational AI. As part of this team, you will collaborate with talented peers to create scalable solutions for an innovative conversational assistant, aiming to revolutionize user experiences for millions of Alexa customers. The ideal candidate possesses a solid understanding of machine learning fundamentals and a passion for pushing boundaries in the field. They thrive in fast-paced environments, possess the drive to tackle complex challenges, and excel at swiftly delivering impactful solutions while iterating based on user feedback. Join us in our mission to redefine industry standards and provide unparalleled experiences for our customers. Key job responsibilities . You will analyze, understand and improve user experiences by leveraging Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in artificial intelligence. . You will work on core LLM technologies, including developing best-in-class modeling, prompt optimization algorithms to enable Conversation AI use cases · Build and measure novel online & offline metrics for personal digital assistants and customer scenarios, on diverse devices and endpoints · Create, innovate and deliver deep learning, policy-based learning, and/or machine learning based algorithms to deliver customer-impacting results · Perform model/data analysis and monitor metrics through online A/B testing · Research and implement novel machine learning and deep learning algorithms and models. We are open to hiring candidates to work out of one of the following locations: Bellevue, WA, USA | Boston, MA, USA | Seattle, WA, USA
GB, London
Amazon Advertising is looking for a Senior Applied Scientist to join its brand new initiative that powers Amazon’s contextual advertising product. Advertising at Amazon is a fast-growing multi-billion dollar business that spans across desktop, mobile and connected devices; encompasses ads on Amazon and a vast network of hundreds of thousands of third party publishers; and extends across US, EU and an increasing number of international geographies. We are looking for a dynamic, innovative and accomplished Senior Applied Scientist to work on machine learning and data science initiatives for contextual data processing and classification that power our contextual advertising solutions. Are you excited by the prospect of analyzing terabytes of data and leveraging state-of-the-art data science and machine learning techniques to solve real world problems? Do you like to own business problems/metrics of high ambiguity where yo get to define the path forward for success of a new initiative? As an applied scientist, you will invent ML and Artificial General Intelligence based solutions to power our contextual classification technology. As this is a new initiative, you will get an opportunity to act as a thought leader, work backwards from the customer needs, dive deep into data to understand the issues, conceptualize and build algorithms and collaborate with multiple cross-functional teams. Key job responsibilities * Design, prototype and test many possible hypotheses in a high-ambiguity environment, making use of both analysis and business judgment. * Collaborate with software engineering teams to integrate successful experiments into large-scale, highly complex Amazon production systems. * Promote the culture of experimentation and applied science at Amazon. * Demonstrated ability to meet deadlines while managing multiple projects. * Excellent communication and presentation skills working with multiple peer groups and different levels of management * Influence and continuously improve a sustainable team culture that exemplifies Amazon’s leadership principles. About the team The Supply Quality organization has the charter to solve optimization problems for ad-programs in Amazon and ensure high-quality ad-impressions. We develop advanced algorithms and infrastructure systems to optimize performance for our advertisers and publishers. We are focused on solving a wide variety of problems in computational advertising like Contextual data processing and classification, traffic quality prediction (robot and fraud detection), Security forensics and research, Viewability prediction, Brand Safety and experimentation. Our team includes experts in the areas of distributed computing, machine learning, statistics, optimization, text mining, information theory and big data systems. We are open to hiring candidates to work out of one of the following locations: London, GBR
ES, M, Madrid
At Amazon, we are committed to being the Earth’s most customer-centric company. The International Technology group (InTech) owns the enhancement and delivery of Amazon’s cutting-edge engineering to all the varied customers and cultures of the world. We do this through a combination of partnerships with other Amazon technical teams and our own innovative new projects. You will be joining the Tools and Machine learning (Tamale) team. As part of InTech, Tamale strives to solve complex catalog quality problems using challenging machine learning and data analysis solutions. You will be exposed to cutting edge big data and machine learning technologies, along to all Amazon catalog technology stack, and you'll be part of a key effort to improve our customers experience by tackling and preventing defects in items in Amazon's catalog. We are looking for a passionate, talented, and inventive Scientist with a strong machine learning background to help build industry-leading machine learning solutions. We strongly value your hard work and obsession to solve complex problems on behalf of Amazon customers. Key job responsibilities We look for applied scientists who possess a wide variety of skills. As the successful applicant for this role, you will with work closely with your business partners to identify opportunities for innovation. You will apply machine learning solutions to automate manual processes, to scale existing systems and to improve catalog data quality, to name just a few. You will work with business leaders, scientists, and product managers to translate business and functional requirements into concrete deliverables, including the design, development, testing, and deployment of highly scalable distributed services. You will be part of team of 5 scientists and 13 engineers working on solving data quality issues at scale. You will be able to influence the scientific roadmap of the team, setting the standards for scientific excellence. You will be working with state-of-the-art models, including image to text, LLMs and GenAI. Your work will improve the experience of millions of daily customers using Amazon in Europe and in other regions. You will have the chance to have great customer impact and continue growing in one of the most innovative companies in the world. You will learn a huge amount - and have a lot of fun - in the process! This position will be based in Madrid, Spain We are open to hiring candidates to work out of one of the following locations: Madrid, M, ESP
US, WA, Seattle
Join us in the evolution of Amazon’s Seller business! The Selling Partner Recruitment and Success organization is the growth and development engine for our Store. Partnering with business, product, and engineering, we catalyze SP growth with comprehensive and accurate data, unique insights, and actionable recommendations and collaborate with WW SP facing teams to drive adoption and create feedback loops. We strongly believe that any motivated SP should be able to grow their businesses and reach their full potential by using our scaled, automated, and self-service tools. We aim to accelerate the growth of Sellers by providing tools and insights that enable them to make better and faster decisions at each step of selection management. To accomplish this, we offer intelligent insights that are both detailed and actionable, allowing Sellers to introduce new products and engage with customers effectively. We leverage extensive structured and unstructured data to generate science-based insights about their business. Furthermore, we provide personalized recommendations tailored to individual Sellers' business objectives in a user-friendly format. These insights and recommendations are integrated into our products, including Amazon Brand Analytics (ABA), Product Opportunity Explorer (OX), and Manage Your Growth (MYG). We are looking for a talented and passionate Sr. Research Scientist to lead our research endeavors and develop world-class statistical and machine learning models. The successful candidate will work closely with Product Managers (PM), User Experience (UX) designers, engineering teams, and Seller Growth Consulting teams to provide actionable insights that drive improvements in Seller businesses. Key job responsibilities You set the standard for scientific excellence and make decisions that affect the way we build and integrate algorithms. Your solutions are exemplary in terms of algorithm design, clarity, model structure, efficiency, and extensibility. You tackle intrinsically hard problems, acquiring expertise as needed. You decompose complex problems into straightforward solutions. About the team The Seller Growth science team aims to provide data and science solutions to drive Seller growth and create better Seller experiences. We structure our science domain with three key themes and two horizontal components. We discover the opportunity space by identifying opportunities with unrealized potential, then generate actionable analytics to identify high value actions (HVAs) that unlock the opportunity space, and finally, empower Sellers with personalized Growth Plans and differentiated treatment that help them realize their potential. We are open to hiring candidates to work out of one of the following locations: Seattle, WA, USA
US, WA, Redmond
Project Kuiper is an initiative to increase global broadband access through a constellation of 3,236 satellites in low Earth orbit (LEO). Its mission is to bring fast, affordable broadband to unserved and underserved communities around the world. Project Kuiper will help close the digital divide by delivering fast, affordable broadband to a wide range of customers, including consumers, businesses, government agencies, and other organizations operating in places without reliable connectivity. As an Applied Scientist on the team you will responsible for building out and maintaining the algorithms and software services behind one of the world’s largest satellite constellations. You will be responsible for developing algorithms and applications that provide mission critical information derived from past and predicted satellite orbits to other systems and organizations rapidly, reliably, and at scale. You will be focused on contributing to the design and analysis of software systems responsible across a broad range of areas required for automated management of the Kuiper constellation. You will apply knowledge of mathematical modeling, optimization algorithms, astrodynamics, state estimation, space systems, and software engineering across a wide variety of problems to enable space operations at an unprecedented scale. You will develop features for systems to interface with internal and external teams, predict and plan communication opportunities, manage satellite orbits determination and prediction systems, develop analysis and infrastructure to monitor and support systems performance. Your work will interface with various subsystems within Project Kuiper and Amazon, as well as with external organizations, to enable engineers to safely and efficiently manage the satellite constellation. The ideal candidate will be detail oriented, strong organizational skills, able to work independently, juggle multiple tasks at once, and maintain professionalism under pressure. You should have proven knowledge of mathematical modeling and optimization along with strong software engineering skills. You should be able to independently understand customer requirements, and use data-driven approaches to identify possible solutions, select the best approach, and deliver high-quality applications. Export Control Requirement: Due to applicable export control laws and regulations, candidates must be a U.S. citizen or national, U.S. permanent resident (i.e., current Green Card holder), or lawfully admitted into the U.S. as a refugee or granted asylum. About the team The Constellation Management & Space Safety team maintains and builds the software services responsible for maintaining situational awareness of Kuiper satellites through their entire lifecycle in space. We coordinate with internal and external organizations to maintain the nominal operational state of the constellation. We build automated systems that use satellite telemetry and other relevant data to predict future orbits, plan maneuvers to avoid high risk close approaches with other objects in space, keep satellites in the desired locations, and exchange data with external organizations. We provide visibility information that is used to predict and establish communication channels for Kuiper satellites. We are open to hiring candidates to work out of one of the following locations: Redmond, WA, USA
IN, KA, Bangalore
Appstore Quality tech team builds tools, using AI and engineering techniques to provide the best quality apps to Amazon Appstore users. We are a team of highly-motivated, engaged, and responsive professionals who enable the core testing and quality infrastructure of Amazon Appstore. Come join our team and be a part of history as we deliver results for our customers. Appstore Quality team's mission is to automate all types of functional, non functional, and compliance checks on apps submitted by appstore app developers to enable north star vision of publishing apps in under 5 hours. Our team uses various ML/AI/Generative AI techniques to automatically detect violations in images and text metadata submitted by developers. We are working on ambitious project AI projects such as building LLM, auto navigate a mobile app to detect inside app issues and violations. We are seeking an innovative and technically strong data scientist with a background in optimization, machine learning, and statistical modeling/analysis. This role requires a team member to have strong quantitative modeling skills and the ability to apply optimization/statistical/machine learning methods to complex decision-making problems, with data coming from various data sources. The candidate should have strong communication skills, be able to work closely with stakeholders and translate data-driven findings into actionable insights. The successful candidate will be a self-starter, comfortable with ambiguity, with strong attention to detail and ability to work in a fast-paced and ever-changing environment. This role involves working closely with Sr Data Scientist, Principal engineer, and engineering team to build ML and AL based solutions in meeting our north start vision. Key job responsibilities • Implement statistical methods to solve specific business problems utilizing code (Python, Scala, etc.). • Improve upon existing methodologies by developing new data sources, testing model enhancements, and fine-tuning model parameters. • Collaborate with program management, product management, software developers, data engineering, and business leaders to provide science support, and communicate feedback; develop, test and deploy a wide range of statistical, econometric, and machine learning models. • Build customer-facing reporting tools to provide insights and metrics which track model performance and explain variance. • Communicate verbally and in writing to business customers with various levels of technical knowledge, educating them about our solutions, as well as sharing insights and recommendations. • Earn the trust of your customers by continuing to constantly obsess over their needs and helping them solve their problems by leveraging technology • Excellent prompt engineering skillset with a deep knowledge of LLMs, embeddings, transformer models. • Work with distributed machine learning and statistical algorithms to harness enormous volumes of data at scale to serve our customers About the team In Appstore, “We entertain, and delight, hundreds of millions of people across devices with a vast selection of relevant apps, games, and services by making it trivially easy for developers to deliver”. Appstore team enables the customer and developer flywheel on devices by enabling developers to seamlessly launch and manage their apps/ in-app content on Amazon. It helps customers discover, buy and engage with these apps on Fire TV, Fire Tablets and mobile devices. The technologies we build on vary from device software, to high scale services, to efficient tools for developers. We are open to hiring candidates to work out of one of the following locations: Bangalore, KA, IND
US, NJ, Newark
Employer: Audible, Inc. Title: Data Scientist II Location: One Washington Park, Newark, NJ, 07102 Duties: Design and implement scalable and reliable approaches to support or automate decision making throughout the business. Apply a range of data science techniques and tools combined with subject matter expertise to solve difficult business problems and cases in which the solution approach is unclear. Acquire data by building the necessary SQL / ETL queries. Import processes through various company specific interfaces for accessing RedShift, and S3 / edX storage systems. Build relationships with stakeholders and counterparts, and communicate model outputs, observations, and key performance indicators (KPIs) to the management to develop sustainable and consumable products. Explore and analyze data by inspecting univariate distributions and multivariate interactions, constructing appropriate transformations, and tracking down the source and meaning of anomalies. Build production-ready models using statistical modeling, mathematical modeling, econometric modeling, machine learning algorithms, network modeling, social network modeling, natural language processing, or genetic algorithms. Validate models against alternative approaches, expected and observed outcome, and other business defined key performance indicators. Implement models that comply with evaluations of the computational demands, accuracy, and reliability of the relevant ETL processes at various stages of production. Position reports into Newark, NJ office; however, telecommuting from a home office may be allowed. Requirements: Requires a Master’s in Statistics, Computer Science, Data Science, Machine Learning, Applied Math, Operations Research, Economics, or a related field plus two (2) years of experience as a Data Scientist, Data Engineer, or other occupation/position/job title involving research and data analysis. Experience may be gained concurrently and must include one (1) year in each of the following: - Building statistical models and machine learning models using large datasets from multiple resources - Working with Customer, Content, or Product data modeling and extraction - Using database technologies such as SQL or ETL - Applying specialized modelling software including Python, R, SAS, MATLAB, or Stata. Alternatively, will accept a Bachelor's and four (4) years of experience. Multiple positions. Apply online: www.amazon.jobs Job Code: ADBL157. We are open to hiring candidates to work out of one of the following locations: Newark, NJ, USA
US, CA, Sunnyvale
The Artificial General Intelligence (AGI) team is looking for a highly-skilled Senior Applied Scientist, to lead the development and implementation of cutting-edge algorithms and push the boundaries of efficient inference for Generative Artificial Intelligence (GenAI) models. As a Senior Applied Scientist, you will play a critical role in driving the development of GenAI technologies that can handle Amazon-scale use cases and have a significant impact on our customers' experiences. Key job responsibilities - Design and execute experiments to evaluate the performance of different decoding algorithms and models, and iterate quickly to improve results - Develop deep learning models for compression, system optimization, and inference - Collaborate with cross-functional teams of engineers and scientists to identify and solve complex problems in GenAI - Mentor and guide junior scientists and engineers, and contribute to the overall growth and development of the team We are open to hiring candidates to work out of one of the following locations: Bellevue, WA, USA | Boston, MA, USA | New York, NY, USA | Sunnyvale, CA, USA
US, CA, Sunnyvale
The Artificial General Intelligence (AGI) team is looking for a highly-skilled Senior Applied Scientist, to lead the development and implementation of cutting-edge algorithms and push the boundaries of efficient inference for Generative Artificial Intelligence (GenAI) models. As a Senior Applied Scientist, you will play a critical role in driving the development of GenAI technologies that can handle Amazon-scale use cases and have a significant impact on our customers' experiences. Key job responsibilities - Design and execute experiments to evaluate the performance of different decoding algorithms and models, and iterate quickly to improve results - Develop deep learning models for compression, system optimization, and inference - Collaborate with cross-functional teams of engineers and scientists to identify and solve complex problems in GenAI - Mentor and guide junior scientists and engineers, and contribute to the overall growth and development of the team We are open to hiring candidates to work out of one of the following locations: Bellevue, WA, USA | Boston, MA, USA | New York, NY, USA | Sunnyvale, CA, USA
US, WA, Bellevue
Want to be part of the team whose mission is to expand Alexa to new countries, languages, devices and cultures? The Alexa International team makes it happen. Our customers are very diverse in where they live, the languages they speak to Alexa, the devices they use and the content that matters most. In turn, our problems are diverse and need innovative solutions. We are seeking a Senior Applied Science Manager who will play a key role in the next generation of AI powered Conversational Assistants. Key job responsibilities Lead and manage a team of applied and research scientists responsible for building multilingual experiences Collaborate with cross-functional teams to ensure that Amazon’s AI models are aligned with human preferences. Identify and prioritize research opportunities that have the potential to significantly impact our AI systems. Mentor and guide team members to achieve their career goals and objectives. Communicate research findings and progress to senior leadership and stakeholders. We are open to hiring candidates to work out of one of the following locations: Bellevue, WA, USA