The science behind the improved Fire TV voice search

How phonetically blended results (PBR) help ensure customers find the content they were actually asking for.

Put your hand up if you enjoy using your TV remote to type in the name of the show you want to watch next. Who doesn’t love shuffling the highlighted box across the screen, painstakingly selecting each letter in turn? And let’s not forget the joy of accidentally selecting a wrong letter.

Such text-based search works, but it can feel like a chore. It’s much easier and faster to just ask for what you want. With Amazon’s Fire TV, you can ask the Alexa voice assistant to find your favorite shows, movies, movie genres, actors … you name it.

But voice-based search can come with its own frustrations. What if Alexa misheard a request for the TV show Hunted as “haunted” and as a result presented a spooky screenful of incorrect suggestions?

Related content
The phrase launches a feature built to help customers navigate an increasingly complex and diverse world of content.

This is a story of how two groups at Amazon — the Fire TV Search team and the Alexa Entertainment Spoken Language Understanding team — collaborated to launch an improved Fire TV voice search experience in the U.S. in November 2022.

The new search system gives customers a greater chance of finding what they are looking for, on their first attempt, by casting the search net a little wider — and a little smarter. It works by harnessing a suite of Alexa machine learning (ML) models to generate additional, similar-sounding words to inject into Fire TV’s search function to broaden the scope of the results presented to the customer. Hence its name: phonetically blended results (PBR). Today, about 80% of the 20 million or so unique search terms that Fire TV deals with are augmented by PBR.

To better understand PBR and why it was needed, let’s look at one reason the previous version of Fire TV voice search could get things wrong. A customer, in a noisy room full of excited children, holds down the microphone button on the Alexa Voice Remote and simply says “Find Encanto”.

Image is a screengrab of a Fire TV screen which includes Encanto and several similar sounding results
Phonetically blended results give customers a greater chance of finding what they are looking for, on the first attempt, by harnessing a suite of machine learning models to generate additional, similar-sounding words to inject into Fire TV’s search function.

This piece of audio first goes to Alexa’s automatic-speech-recognition (ASR) system to be converted to text. But in this case, the system mishears the customer utterance and converts it to “Find Encounter”.

Fire TV’s search algorithm, known as ReRanker, faithfully performs the erroneous search and presents the customer with a selection of content with the word “encounter” in the title or description, prominently featuring, for example the Amazon original movie Encounter or popular TV shows that include that word. Encanto is nowhere to be seen. The customer sighs, asks the kids to pipe down, presses the microphone button and tries again. Or they resort to the very method they were trying to avoid in the first place: typing with the remote.

One challenge here is that because Alexa supports myriad applications, its ASR system is necessarily generalized.

“Previously, Alexa was not tuned into individual Fire TV customers’ preferences,” says Kanna Shimizu, senior manager of research science in Alexa AI’s Natural Understanding (NU) group, who led the PBR project. “That's the layer my team is adding. We are connecting Alexa machine learning with Fire TV search algorithms to build toward an end-to-end algorithm to help customers find what they’re looking for.”

Related content
A behind-the-scenes look at the unique challenges the engineering teams faced, and how they used scientific research to drive fundamental innovation to overcome those challenges.

The reason the voice search for Encanto failed is that the search process decided early on that “encounter” was the customer’s intended search query, so “Encanto” wasn’t even searched for.

“The big change that PBR introduced was to say, ‘Actually, the customer might have said or meant this other thing, but we’re not sure, so let’s search for both,’” says Shimizu. “Let’s keep the door open to different interpretations of what the customer may have said, so they can decide for themselves on the search results screen.”

How would our customer example look now? The search results page will now show Encanto as an option in addition to Encounter.

Building this keep-your-options-open approach into Fire TV voice search was complex for several reasons. One challenge is generating appropriate additional search candidates that are phonetically similar to the customer’s utterance. The next was changing Fire TV’s ReRanker algorithm, already a high-performing recommender system, to utilize the PBR system’s suggested search candidates when delivering results to the customer.

It's really a two-way communication. We use Alexa models to improve the performance of Fire TV and we use Fire TV customer signals to improve the performance of Alexa models. It’s a very cool learning loop.
Mingxian Wang

The PBR system addresses the first challenge in multiple ways. Most of the additional search candidates come from corrective actions taken by customers themselves. That’s because when a customer’s voice search fails to deliver what they are looking for, about 40% of the time they will try voice search again or type what they are looking for, leading to a successful viewing. Knowing the initial mistaken search term and the final successful one allows the PBR system to, for example, map the search candidate “Encounter” onto the additional search candidate “Encanto”.

That self-correction process is how PBR learned that the search term “hunted” sometimes represents a search for the 2018 Netflix reality series Haunted.

The PBR system can make these useful connections in part because it contains knowledge of the wider world via the Alexa Teacher Model, a large language model trained on enormous amounts of Internet data and subsequently fine-tuned with data including Fire TV voice traffic and customer self-corrections.

“It's really a two-way communication,” says Mingxian Wang, senior applied scientist at Alexa AI-NU. “We use Alexa models to improve the performance of Fire TV and we use Fire TV customer signals to improve the performance of Alexa models. It’s a very cool learning loop.”

Besides the Alexa Teacher Model and the model that learns from customers’ on-screen search behavior, the PBR system also uses an Alexa model that identifies phonetic variations for popular titles, to further enrich its search results.

Related content
New approach speeds graph-based search by 20% to 60%, regardless of graph construction method.

Using a mixture of these three models, by the time it launched in late 2022, the PBR system had already generated millions of search-query mappings, such as “Encounter” to “Encanto” — and that number continues to grow. Here’s another example. To avoid Alexa mishearing “Zatima”, a popular new show and a novel word unknown to ASR, as “Fatima”, which is a movie and also a city in Portugal, PBR’s models suggests that Zatima also be presented along with Fatima.

“In this way, we serve the customer who wanted the new show and also don’t break the customer experience for those searching for the movie,” says Wang.

“It’s a subtle balance”

It's one thing to suggest additional results to ReRanker. It’s another to change the algorithm to take PBR’s suggestions and present these results to customers. And if it does, how should it rank them on the results screen?

The teams solved this problem by inventing the PBR confidence score. With every search-query mapping, the PBR system provides ReRanker with a prediction of how likely the customer is to click on that result.

“We want customers to see our alternatives but don’t want to boost them higher than might be warranted, because we want to avoid overwhelming customers with irrelevant search results,” says Shimizu. “It’s a subtle balance, and that scoring mechanism was the key to making this whole thing succeed.”

Related content
Dataset that requires question-answering models to look up multiple facts and perform comparisons bridges a significant gap in the field.

To illustrate this subtlety, consider the search term “Enchanted” (a fairy-tale movie). The PBR system estimates that search results based on this term will deliver a customer clickthrough rate (i.e., a successful search) of 60%. So this should be the most prominently displayed result.

But the search term “enchanted” also triggers several PBR candidates — “Encanto” (with an anticipated clickthrough rate of 20%) and “Disenchanted” (5%). You can see that by blending these similar-sounding shows into its results, ReRanker is more likely to strike gold for the customer.

“In testing, we saw the ReRanker model picking up on the PBR confidence score and boosting those search results higher. It learned that this feature was worth paying attention to,” says Aleksandr Kulikov, a principal software engineer at Fire TV.

“The Fire TV voice search is already successful for most customer voice searches — it’s easy to deliver popular searches like ‘Jack Ryan’ correctly — but for some customers, PBR is significantly improving their voice search experience,” says Kulikov. Where it makes the biggest difference is, of course, in ambiguous searches, where it can boost customer clickthroughs by 10% or more. “A gain of 10% is like, wow, that’s significant,” Kulikov adds.

And it will only get better with time. The Alexa and Fire TV teams are working toward a feedback learning system that will allow PBR’s models to automatically generate new search candidates, prune ineffective ones, and home in on increasingly accurate confidence scores.

Ultimately, bringing the power of multiple Alexa machine learning models to bear on Fire TV voice search is helping to give Amazon customers what they want the first time, more of the time, through a greater understanding of diverse voices and of the world itself. Hands up if you like the sound of that.

Related content

US, WA, Seattle
Amazon's Global Fixed Marketing Campaign Measurement & Optimization (CMO) team is looking for a senior economic expert in causal inference and applied ML to advance the economic measurement, accuracy validation and optimization methodologies of Amazon's global multi-billion dollar fixed marketing spend. This is a thought leadership position to help set the long-term vision, drive methods innovation, and influence cross-org methods alignment. This role is also an expert in modeling and measuring marketing and customer value with proven capacity to innovate, scale measurement, and mentor talent. This candidate will also work closely with senior Fixed Marketing tech, product, finance and business leadership to devise science roadmaps for innovation and simplification, and adoption of insights to influence important resource allocation, fixed marketing spend and prioritization decisions. Excellent communication skills (verbal and written) are required to ensure success of this collaboration. The candidate must be passionate about advancing science for business and customer impact. Key job responsibilities - Advance measurement, accuracy validation, and optimization methodology within Fixed Marketing. - Motivate and drive data generation to size. - Develop novel, innovative and scalable marketing measurement techniques and methodologies. - Enable product and tech development to scale science solutions and approaches. A day in the life - Propose and refine economic and scientific measurement, accuracy validation, and optimization methodology to improve Fixed Marketing models, outputs and business results - Brief global fixed marketing and retails executives about FM measurement and optimization approaches, providing options to address strategic priorities. - Collaborate with and influence the broader scientific methodology community. About the team CMO's vision is to maximizing long-term free cash flow by providing reliable, accurate and useful global fixed marketing measurement and decision support. The team measures and helps optimize the incremental impact of Amazon (Stores, AWS, Devices) fixed marketing investment across TV, Digital, Social, Radio, and many other channels globally. This is a fully self supported team composed of scientists, economists, engineers, and product/program leaders with S-Team visibility. We are open to hiring candidates to work out of one of the following locations: Irvine, CA, USA | San Francisco, CA, USA | Seattle, WA, USA | Sunnyvale, CA, USA
GB, Cambridge
Our team builds generative AI solutions that will produce some of the future’s most influential voices in media and art. We develop cutting-edge technologies with Amazon Studios, the provider of original content for Prime Video, with Amazon Game Studios and Alexa, the ground-breaking service that powers the audio for Echo. Do you want to be part of the team developing the future technology that impacts the customer experience of ground-breaking products? Then come join us and make history. We are looking for a passionate, talented, and inventive Applied Scientist with a background in Machine Learning to help build industry-leading Speech, Language, Audio and Video technology. As an Applied Scientist at Amazon you will work with talented peers to develop novel algorithms and generative AI models to drive the state of the art in audio (and vocal arts) generation. Position Responsibilities: * Participate in the design, development, evaluation, deployment and updating of data-driven models for digital vocal arts applications. * Participate in research activities including the application and evaluation and digital vocal and video arts techniques for novel applications. * Research and implement novel ML and statistical approaches to add value to the business. * Mentor junior engineers and scientists. We are open to hiring candidates to work out of one of the following locations: Cambridge, GBR
US, TX, Austin
The Workforce Solutions Analytics and Tech team is looking for a senior Applied Scientist who is interested in solving challenging optimization problems in the labor scheduling and operations efficiency space. We are actively looking to hire senior scientists to lead one or more of these problem spaces. Successful candidates will have a deep knowledge of Operations Research and Machine Learning methods, experience in applying these methods to large-scale business problems, the ability to map models into production-worthy code in Python or Java, the communication skills necessary to explain complex technical approaches to a variety of stakeholders and customers, and the excitement to take iterative approaches to tackle big research challenges. As a member of our team, you'll work on cutting-edge projects that directly impact over a million Amazon associates. This is a high-impact role with opportunities to designing and improving complex labor planning and cost optimization models. The successful candidate will be a self-starter comfortable with ambiguity, with strong attention to detail and outstanding ability in balancing technical leadership with strong business judgment to make the right decisions about model and method choices. Successful candidates must thrive in fast-paced environments, which encourage collaborative and creative problem solving, be able to measure and estimate risks, constructively critique peer research, and align research focuses with the Amazon's strategic needs. Key job responsibilities • Candidates will be responsible for developing solutions to better manage and optimize flexible labor capacity. The successful candidate should have solid research experience in one or more technical areas of Operations Research or Machine Learning. As a senior scientist, you will also help coach/mentor junior scientists on the team. • In this role, you will be a technical leader in applied science research with significant scope, impact, and high visibility. You will lead science initiatives for strategic optimization and capacity planning. They require superior logical thinkers who are able to quickly approach large ambiguous problems, turn high-level business requirements into mathematical models, identify the right solution approach, and contribute to the software development for production systems. • Invent and design new solutions for scientifically-complex problem areas and identify opportunities for invention in existing or new business initiatives. • Successfully deliver large or critical solutions to complex problems in the support of medium-to-large business goals. • Apply mathematical optimization techniques and algorithms to design optimal or near optimal solution methodologies to be used for labor planning. • Research, prototype, simulate, and experiment with these models and participate in the production level deployment in Python or Java. We are open to hiring candidates to work out of one of the following locations: Arlington, VA, USA | Austin, TX, USA | Bellevue, WA, USA | Nashville, TN, USA | Seattle, WA, USA | Tempe, AZ, USA
US, NY, New York
Where will Amazon's growth come from in the next year? What about over the next five? Which product lines are poised to quintuple in size? Are we investing enough in our infrastructure, or too much? How do our customers react to changes in prices, product selection, or delivery times? These are among the most important questions at Amazon today. The Topline Forecasting team in the Supply Chain Optimization Technologies (SCOT) group is looking for innovative, passionate and results-oriented Economists to answer these questions. You will have an opportunity to own the long-run outlook for Amazon’s global consumer business and shape strategic decisions at the highest level. The successful candidate will be able to formalize problem definitions from ambiguous requirements, build econometrics models using Amazon’s world-class data systems, and develop cutting-edge solutions for non-standard problems. Key job responsibilities · Develop new econometric models or improve existing approaches using scalable techniques. · Extract data for analysis and model development from large, complex datasets. · Closely work with engineering teams to build scalable, efficient systems that implement prototypes in production. · Apply economic theory to solve business problems in a fast moving environment. · Distill problem definitions from informal business requirements and communicate technical solutions to senior business leaders. · Drive innovation and best practices in applied research across the Amazon research science community. We are open to hiring candidates to work out of one of the following locations: New York, NY, USA
US, WA, Bellevue
We are seeking a passionate, talented, and inventive individual to join the Applied AI team and help build industry-leading technologies that customers will love. This team offers a unique opportunity to make a significant impact on the customer experience and contribute to the design, architecture, and implementation of a cutting-edge product. Key job responsibilities On our team you will push the boundaries of ML and Generative AI techniques to scale the inputs for hundreds of billions of dollars of annual revenue for our eCommerce business. If you have a passion for AI technologies, a drive to innovate and a desire to make a meaningful impact, we invite you to become a valued member of our team. We are seeking an experienced Scientist who combines superb technical, research, analytical and leadership capabilities with a demonstrated ability to get the right things done quickly and effectively. This person must be comfortable working with a team of top-notch developers and collaborating with our research teams. We’re looking for someone who innovates, and loves solving hard problems. You will be expected to have an established background in building highly scalable systems and system design, great communication skills, and a motivation to achieve results in a fast-paced environment. You should be somebody who enjoys working on complex problems, is customer-centric, and feels strongly about building good software as well as making that software achieve its operational goals. A day in the life You will be responsible for developing and maintaining the systems and tools that enable us to accelerate knowledge operations and work in the intersection of Science and Engineering. You will push the boundaries of ML and Generative AI techniques to scale the inputs for hundreds of billions of dollars of annual revenue for our eCommerce business. If you have a passion for AI technologies, a drive to innovate and a desire to make a meaningful impact, we invite you to become a valued member of our team. About the team The mission of the Applied AI team is to enable organizations within Worldwide Amazon.com Stores to accelerate the adoption of AI technologies across various parts of our business. We are looking for an Applied Scientist to join our Applied AI team to work on LLM-based solutions. We are open to hiring candidates to work out of one of the following locations: Bellevue, WA, USA
US, WA, Bellevue
We are seeking a passionate, talented, and inventive individual to join the Applied AI team and help build industry-leading technologies that customers will love. This team offers a unique opportunity to make a significant impact on the customer experience and contribute to the design, architecture, and implementation of a cutting-edge product. The mission of the Applied AI team is to enable organizations within Worldwide Amazon.com Stores to accelerate the adoption of AI technologies across various parts of our business. We are looking for a Senior Applied Scientist to join our Applied AI team to work on LLM-based solutions. We are seeking an experienced Scientist who combines superb technical, research, analytical and leadership capabilities with a demonstrated ability to get the right things done quickly and effectively. This person must be comfortable working with a team of top-notch developers and collaborating with our research teams. We’re looking for someone who innovates, and loves solving hard problems. You will be expected to have an established background in building highly scalable systems and system design, excellent project management skills, great communication skills, and a motivation to achieve results in a fast-paced environment. You should be somebody who enjoys working on complex problems, is customer-centric, and feels strongly about building good software as well as making that software achieve its operational goals. Key job responsibilities You will be responsible for developing and maintaining the systems and tools that enable us to accelerate knowledge operations and work in the intersection of Science and Engineering. A day in the life On our team you will push the boundaries of ML and Generative AI techniques to scale the inputs for hundreds of billions of dollars of annual revenue for our eCommerce business. If you have a passion for AI technologies, a drive to innovate and a desire to make a meaningful impact, we invite you to become a valued member of our team. We are open to hiring candidates to work out of one of the following locations: Bellevue, WA, USA
US, MD, Virtual Location - Maryland
We are looking for detail-oriented, organized, and responsible individuals who are eager to learn how to work with large and complicated data sets. Some knowledge of econometrics, as well as basic familiarity with Python is necessary, and experience with SQL and UNIX would be a plus. This is a part time position, 29 hours per week, with compensation being awarded on an hourly basis. You will learn how to build data sets and perform applied econometric analysis at Internet speed collaborating with economists, scientists, and product managers. These skills will translate well into writing applied chapters in your dissertation and provide you with work experience that may help you with placement. Roughly 85% of previous cohorts have converted to full time economics employment at Amazon. If you are interested, please send your CV to our mailing list at econ-internship@amazon.com. We are open to hiring candidates to work out of one of the following locations: Virtual Location - MD
US, WA, Bellevue
We are seeking a passionate, talented, and inventive individual to join the Applied AI team and help build industry-leading technologies that customers will love. This team offers a unique opportunity to make a significant impact on the customer experience and contribute to the design, architecture, and implementation of a cutting-edge product. The mission of the Applied AI team is to enable organizations within Worldwide Amazon.com Stores to accelerate the adoption of AI technologies across various parts of our business. We are looking for a Senior Applied Scientist to join our Applied AI team to work on LLM-based solutions. We are seeking an experienced Scientist who combines superb technical, research, analytical and leadership capabilities with a demonstrated ability to get the right things done quickly and effectively. This person must be comfortable working with a team of top-notch developers and collaborating with our research teams. We’re looking for someone who innovates, and loves solving hard problems. You will be expected to have an established background in building highly scalable systems and system design, excellent project management skills, great communication skills, and a motivation to achieve results in a fast-paced environment. You should be somebody who enjoys working on complex problems, is customer-centric, and feels strongly about building good software as well as making that software achieve its operational goals. Key job responsibilities You will be responsible for developing and maintaining the systems and tools that enable us to accelerate knowledge operations and work in the intersection of Science and Engineering. You will push the boundaries of ML and Generative AI techniques to scale the inputs for hundreds of billions of dollars of annual revenue for our eCommerce business. If you have a passion for AI technologies, a drive to innovate and a desire to make a meaningful impact, we invite you to become a valued member of our team. We are open to hiring candidates to work out of one of the following locations: Bellevue, WA, USA
IN, KA, Bengaluru
Amazon strives to be Earth's most customer-centric company where people can find and discover virtually anything they want to buy online. By giving customers more of what they want - low prices, vast selection, and convenience - Amazon continues to grow and evolve as a world-class e-commerce platform. The AOP team is an integral part of this and strives to provide Analytical Capabilities to fulfil all customer processes in the IN-ECCF regions. We’re seeking a Data Scientist with expertise in a breadth of ML techniques. Your responsibilities will include developing, prototyping and productionizing innovative models using a range of techniques (Supervised/Unsupervised/Reinforcement). We are also looking for innovators capable of using generative AI to design, evangelize, and implement state-of-the-art solutions for never-before-solved problems. Key job responsibilities - Demonstrate thorough technical knowledge on feature engineering of massive datasets, effective exploratory data analysis, and model building using industry standard AI/ML models and working with Large Language Models - Proficiency in both Supervised(Linear/Logistic Regression) and UnSupervised algorithms(k means clustering) - Understand the business reality behind large sets of data and develop meaningful solutions comprising of analytics as well as marketing management. - Work closely with internal stakeholders like the business teams, engineering teams and partner teams and align them with respect to your focus area - Innovate by adapting new modeling techniques and procedures - Passionate about working with huge data sets ( training/fine tuning) and be someone who loves to bring datasets together to answer business questions. You should have deep expertise in creation and management of datasets - Exposure at implementing and operating stable, scalable data flow solutions from production systems into end-user facing applications/reports. These solutions will be fault tolerant, self-healing and adaptive. - Work with distributed machine learning and statistical algorithms to harness enormous volumes of data at scale to serve our customers We are open to hiring candidates to work out of one of the following locations: Bengaluru, KA, IND | Hyderabad, TS, IND
DE, Aachen
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Senior Applied Scientist with a strong deep learning background, to build industry-leading Generative Artificial Intelligence (GenAI) technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As a Senior Applied Scientist with the AGI team, you will work with talented peers to lead the development of novel algorithms and modeling techniques, to advance the state of the art with LLMs. Your work will directly impact our customers in the form of products and services that make use of speech and language technology. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in spoken language understanding. About the team The AGI team has a mission to push the envelope in GenAI with LLMs and multimodal systems, in order to provide the best-possible experience for our customers. We are open to hiring candidates to work out of one of the following locations: Aachen, DEU