“Building a model that can save as many lives as possible”

How ARA recipient Supreeth Shashikumar is using machine learning to help hospitals detect sepsis — before it’s too late.

Sometimes, good luck wears bad luck's clothing — and that was certainly the case in 2015 for the young electrical and computer engineer Supreeth Shashikumar, when his hunt for a PhD project came up empty. At the Georgia Institute of Technology, no professors were looking for students with his specialty — speech recognition and voice processing.

Supreeth_photo.jpeg
Supreeth Shashikumar, a research scientist at the University of California, San Diego, whose Amazon Research Award supports his research on using machine learning models to predict the onset of sepsis.

So Shashikumar decided to cast his net into a different field and came across a visionary mentor in Shamim Nemati, then an assistant professor in the Department of Biomedical Informatics at the Emory University in Atlanta, where he was also leading the Nemati Lab.

Today, Shashikumar and Nemati are continuing their collaboration at the University of California, San Diego (UCSD), and the medical technology they developed together, supported by an Amazon Research Award, is safeguarding and saving the lives of hospital patients. But let's rewind for a moment.

Back in 2015, when Shashikumar was still stymied in his search, he shifted his focus slightly from speech recognition to the somewhat similar field of biomedical signal processing. The fields overlap in that both are reliant on time series data, such as voice recordings or electrocardiograms.

Shashikumar saw that the Nemati Lab was pioneering the use of time series data held in hospital patients' electronic health records (EHRs) to develop early-warning systems that can aid clinicians by flagging patients who may be on the verge of sudden deterioration. Shashikumar found that to be a compelling idea, so he reached out to Nemati.

“It was a gamble, but it paid off,” Shashikumar says.

Taking on Shashikumar was an easy decision, says Nemati. “Georgia Tech produces some of the best engineers in the country. Add to that somebody who also enjoys bungee jumping and at the same time is extremely detail-oriented, and you’ll get a 10x engineer with a desire to push himself to the limits.” There they started on a multiyear journey toward the development of clinically actionable predictive models in healthcare. Shashikumar would later move with the Nemati Lab to its current home at UCSD.

Related content
Method enforces “coherence” of hierarchical time series, in which the values at each level of the hierarchy are sums of the values at the level below.

The lab’s main focus is the onset of sepsis in hospital emergency departments (EDs), wards, and intensive-care units (ICUs). Sepsis is a sudden and life-threatening condition caused by an infection entering the bloodstream, triggering a catastrophic immune response that can lead to organ failure, septic shock, and death. It is a medical emergency that requires early and aggressive treatment with antibiotics. According to the US-based Sepsis Alliance, for every hour that treatment is delayed, the chance of sepsis moving through severe sepsis to septic shock and death rises by 4%-9%.

ED clinicians are constantly monitoring for signs of sepsis, such as fever and elevated heart rate or respiratory rate. When they suspect sepsis, they order lab tests to look for markers of organ damage. Thus detection, particularly early detection, is crucial.

Shashikumar was drawn to the fact that the Nemati Lab was focused on developing deployable technology. Many researchers take historical time-series patient data from single hospitals and create models to make predictions based on that data, but there is often a chasm between theory and practical deployment, due to the many challenges of working in the healthcare space.

“For us, whenever we pick a project, we are interested in how we can deploy a model into the real world, to do some good by making it clinically actionable,” says Shashikumar. “And, crucially, also make it generalizable.”

This generalizability of medical models is critical if machine learning is to realize its enormous potential benefit to patients.

“Generalizability is about ensuring that your claims about the performance of your model hold in other healthcare system settings,” says Nemati.

For example, say a machine learning model was trained to successfully predict the onset of sepsis in patients in hospital A, using data from that hospital. Could that model then be usefully applied to hospital B with different patient demographics, standards of care, and testing and monitoring procedures? And could it generalize again to hospitals C and D, too? It is an incredibly difficult challenge and one of the reasons for the chasm between research and implementation.

WUPERR

In 2022, in Nature Scientific Reports, Shashikumar and the Nemati team demonstrated that it was indeed possible, with a model called WUPERR (weight uncertainty propagation and episodic representation replay). The model was trained on the EHR data of more than 104,000 patients across four separate healthcare systems. The patient data included over 40 inputs, including ongoing vital signs such as blood pressure and pulse rate, lab test results such as lactate levels in the blood and white blood cell count, patient age, and comorbidities, such as cancer or liver failure.

Related content
ARA recipient Marinka Zitnik is focused on how machine learning can enable accurate diagnoses and the development of new treatments and therapies.

The model overcame several big scientific and practical hurdles inherent to generalization across multiple hospitals: "catastrophic forgetting" and the necessity of keeping patient data confidential.

Catastrophic forgetting is a common problem with transfer learning. When a predictive model is successfully trained on one hospital (hospital A) and then transferred to the next (hospital B), the process will often involve fine-tuning the model on data from hospital B, as no two hospitals are the same. There's a risk, however, that the introduction of new hospital B data will lead the model to "forget" what it learned from hospital A's data.

In theory, one could keep the original model for hospital A and use the fine-tuned model for hospital B, and so on for hospitals C and D. However, not only is this approach impractical, but it also presents a daunting level of regulatory hurdles, according to Shashikumar. Having to deal with a growing number of different models, each of which must meet FDA evaluation and regulation, is simply not scalable.

WUPERR, however, tested a different solution using historical hospital data — a technique called "elastic weight consolidation". This approach echoes a concept found in cognitive neuroscience, according to Shashikumar.

"There are a bunch of neurons in your brain that are trained in the tasks you’ve learned,” he explains. “When you learn a new, similar task, you build on your previous experience — but you don't interfere with those neurons. Instead, you teach additional neurons the nuances of the new task."

With this approach — but with neurons replaced by adjustable model parameters — the team was able to maintain high accuracy in their sepsis predictions across the board with every new hospital added to the pool. By the end, the very accurate sepsis predictions for four hospitals were successfully produced by one model — an important advance.

Related content
Learning the complete quantile function, which maps probabilities to variable values, rather than building separate models for each quantile level, enables better optimization of resource trade-offs.

You may now wonder: How could this model share fiercely protected patient data between separate hospitals? This is the second hurdle that WUPERR overcame, using a technique called “episodic representation replay.” In simple terms, this means that when the model was trained on hospital A's patient data, that data is passed through a neural network that strips away all patient identifiers and creates a representation of the data that is safe to share. The representations of the data are then shown to the model while training at the next hospital.

"I believe this was the first application of sharing neural-network representations from an older hospital with a new hospital in the context of sepsis prediction," says Shashikumar.

The result of all this is a single, manageable model that can generalize across a whole set of hospitals, with all the institutions involved benefiting from each other's patient data while never actually having access to it.

“There is beauty in generalizable knowledge and generalizable models, like a unified theory of everything,” says Nemati.

Things get real

Today the latest iteration of WUPERR is in live action in the ED of a UC San Diego Health hospital, providing clinicians with early warnings about patients predicted to develop sepsis in the next four hours. This version of WUPERR has also been augmented with, among other things, a statistical model that monitors its input data for quality, helping to reduce false alarms.

Related content
With the support of an Amazon Research Award, Papoian’s team is deciphering the dynamics of intrinsically disordered proteins.

That’s important because false alarms are a big problem in sepsis detection. The hospital’s previous, less sophisticated system had a high rate of false alarms. Working with clinicians at the hospital, Shashikumar and his colleagues were able to tune WUPERR to predict 60% of all sepsis events. In the closely monitored environment of the ED, clinicians are expected to catch some portion of the sepsis cases with obvious signs and symptoms, and WUPERR provides a second pair of eyes to provide earlier warning and potentially catch additional cases of sepsis. What is critical to the clinicians is that false alarms, and the burdens they entail, remain low. While about half of WUPERR’s predictions were false alarms, that rate is relatively low, given the seriousness of sepsis.

Missed detections are also of great concern and are often attributable to patient complexity, inadequate monitoring, and low availability of data. Here, the team is applying active sensing to make timely recommendations for collecting sepsis-specific biomarkers in high-risk patients. The latest generation of the system combines false-alarm reduction with active sensing to achieve state-of-the-art performance.

The system has been in place for four months, with data collection ongoing. The clinicians in the ED have reported that, on average, the alarm is going off an hour or two earlier than when the doctors would have started to suspect an infection.

“They’re happy with that performance, particularly the lower false-alarm rate. It’s a very good validation of our work,” says Shashikumar. “But we still have a long way to go. In time, we want to extend this to other hospitals, intensive-care units, and hospital wards across the US and the world.”

The scaling up of this life-saving service is made easier by the fact that WUPERR is entirely cloud-based and hosted on Amazon Web Services.

Related content
New method goes beyond Granger causality to identify only the true causes of a target time series, given some graph constraints.

“Using AWS services has been great for us," says Shashikumar. "Our sepsis software is running in real time in the hospital lab, and that’s mission-critical — it has to be up and running 100% of the time, without fail.” The team makes use of a wide range of AWS services, including autoscaling, load balancing, fault tolerance, and CloudWatch alarms.

Deploying the model in different locations is also greatly simplified. AWS provides HIPAA-compliant infrastructure, which is legally required to protect private health data transmitted to the cloud.

In fact, when the Nemati Lab moved to UC San Diego, they had to decide whether to buy their own in-house servers or move to the cloud. They moved the entirety of their computing services to AWS. “It has been super convenient,” says Shashikumar.

Last year, Nemati's team, including Shashikumar, co-founded Healcisio, a startup, as part of an effort to commercialize their model and ultimately receive FDA clearance, which will be essential for deploying the system to multiple hospitals in the US and abroad.

Meanwhile, they have great ambitions to improve the model. For now, it is limited to the time series data in EHRs. But the team’s current focus is on multimodal data, including wearable sensors, clinical notes, imaging, and more. They want their model to see everything a clinician has access to when they treat patients — all the contextual information — and additionally address “data deserts” via continuous monitoring of patients and active sensing.

Increasing the sensitivity of the model and reducing its false-alarm rate even further is the ultimate goal.

“At the end of the day, our focus is on building a model that can save as many lives as possible,” Shashikumar said. “I didn't get into healthcare out of passion, but it has become my passion.”

Research areas

Related content

KR, Seoul
Are you looking to work at the forefront of Machine Learning and AI? Would you be excited to apply Generative AI algorithms to solve real world problems with significant impact? The Generative AI Innovation Center helps AWS customers implement Generative AI solutions and realize transformational business opportunities. This is a team of strategists, scientists, engineers, and architects working step-by-step with customers to build bespoke solutions that harness the power of generative AI. Starting in 2024, the Innovation Center launched a new Custom Model and Optimization program to help customers develop and scale highly customized generative AI solutions. The team helps customers imagine and scope bespoke use cases that will create the greatest value for their businesses, define paths to navigate technical or business challenges, develop and optimize models to power their solutions, and make plans for launching solutions at scale. The GenAI Innovation Center team provides guidance on best practices for applying generative AI responsibly and cost efficiently. You will work directly with customers and innovate in a fast-paced organization that contributes to game-changing projects and technologies. You will design and run experiments, research new algorithms, and find new ways of optimizing risk, profitability, and customer experience. We’re looking for Applied Scientists capable of using GenAI and other techniques to design, evangelize, and implement state-of-the-art solutions for never-before-solved problems. As an Applied Scientist, you will - Collaborate with AI/ML scientists and architects to research, design, develop, and evaluate generative AI solutions to address real-world challenges - Interact with customers directly to understand their business problems, aid them in implementation of generative AI solutions, brief customers and guide them on adoption patterns and paths to production - Help customers optimize their solutions through approaches such as model selection, training or tuning, right-sizing, distillation, and hardware optimization - Provide customer and market feedback to product and engineering teams to help define product direction About the team Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
CN, 31, Shanghai
Are you looking to work at the forefront of Machine Learning and AI? Would you be excited to apply Generative AI algorithms to solve real world problems with significant impact? The Generative AI Innovation Center helps AWS customers implement Generative AI solutions and realize transformational business opportunities. This is a team of strategists, scientists, engineers, and architects working step-by-step with customers to build bespoke solutions that harness the power of generative AI. Starting in 2024, the Innovation Center launched a new Custom Model and Optimization program to help customers develop and scale highly customized generative AI solutions. The team helps customers imagine and scope bespoke use cases that will create the greatest value for their businesses, define paths to navigate technical or business challenges, develop and optimize models to power their solutions, and make plans for launching solutions at scale. The GenAI Innovation Center team provides guidance on best practices for applying generative AI responsibly and cost efficiently. You will work directly with customers and innovate in a fast-paced organization that contributes to game-changing projects and technologies. You will design and run experiments, research new algorithms, and find new ways of optimizing risk, profitability, and customer experience. We’re looking for Applied Scientists capable of using GenAI and other techniques to design, evangelize, and implement state-of-the-art solutions for never-before-solved problems. As an Applied Scientist, you will - Collaborate with AI/ML scientists and architects to research, design, develop, and evaluate generative AI solutions to address real-world challenges - Interact with customers directly to understand their business problems, aid them in implementation of generative AI solutions, brief customers and guide them on adoption patterns and paths to production - Help customers optimize their solutions through approaches such as model selection, training or tuning, right-sizing, distillation, and hardware optimization - Provide customer and market feedback to product and engineering teams to help define product direction About the team Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
US, CA, Pasadena
We’re on the lookout for the curious, those who think big and want to define the world of tomorrow. At Amazon, you will grow into the high impact, visionary person you know you’re ready to be. Every day will be filled with exciting new challenges, developing new skills, and achieving personal growth. How often can you say that your work changes the world? At Amazon, you’ll say it often. Join us and define tomorrow. The Amazon Web Services (AWS) Center for Quantum Computing (CQC) in Pasadena, CA, is looking for a Quantum Research Scientist Intern in the Device and Architecture Theory group. You will be joining a multi-disciplinary team of scientists, engineers, and technicians, all working at the forefront of quantum computing to innovate for the benefit of our customers. Key job responsibilities As an intern with the Device and Architecture Theory team, you will conduct pathfinding theoretical research to inform the development of next-generation quantum processors. Potential focus areas include device physics of superconducting circuits, novel qubits and gate schemes, and physical implementations of error-correcting codes. You will work closely with both theorists and experimentalists to explore these directions. We are looking for candidates with excellent problem-solving and communication skills who are eager to work collaboratively in a team environment. Amazon Science gives you insight into the company’s approach to customer-obsessed scientific innovation. Amazon fundamentally believes that scientific innovation is essential to being the most customer-centric company in the world. It’s the company’s ability to have an impact at scale that allows us to attract some of the brightest minds in quantum computing and related fields. Our scientists continue to publish, teach, and engage with the academic community, in addition to utilizing our working backwards method to enrich the way we live and work. A day in the life Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS. Within AWS UC, Amazon Dedicated Cloud (ADC) roles engage with AWS customers who require specialized security solutions for their cloud services. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness. Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Export Control Requirement: Due to applicable export control laws and regulations, candidates must be either a U.S. citizen or national, U.S. permanent resident (i.e., current Green Card holder), or lawfully admitted into the U.S. as a refugee or granted asylum, or be able to obtain a US export license. If you are unsure if you meet these requirements, please apply and Amazon will review your application for eligibility.
US, MA, Boston
**This is a 12 month contract opportunity with the possibility to extend based on business needs** Embark on a transformative journey as our Domain Expert Lead, where intellectual rigor meets cutting-edge technological innovation. In this pivotal role, you will serve as a strategic architect of data integrity, leveraging your domain expertise to advance AI model training and evaluation. Your domain knowledge and experience will be instrumental in elevating our artificial intelligence capabilities, meticulously refining data collection processes and ensuring the highest standards of quality and precision across complex computational landscapes. Key job responsibilities • Critically analyze and evaluate responses generated by our LLMs across various domains and use cases in your area of expertise. • Develop and write demonstrations to illustrate "what good data looks like" in terms of meeting benchmarks for quality and efficiency • Participate in the creation of tooling that helps create such data by providing your feedback on what works and what doesn’t. • Champion effective knowledge-sharing initiatives by translating domain expertise into actionable insights, while cultivating strategic partnerships across multidisciplinary teams. • Provide detailed feedback and explanations for your evaluations, helping to refine and improve the LLM's understanding and output • Collaborate with the AI research team to identify areas for improvement in the LLM’s capabilities • Stay abreast of the latest developments in how LLMs and GenAI can be applied to your area of expertise to ensure our evaluations remain cutting-edge.
US, CA, Pasadena
Do you enjoy solving challenging problems and driving innovations in research? As a Research Science intern with the Quantum Algorithms Team at CQC, you will work alongside global experts to develop novel quantum algorithms, evaluate prospective applications of fault-tolerant quantum computers, and strengthen the long-term value proposition of quantum computing. A strong candidate will have experience applying methods of mathematical and numerical analysis to assess the performance of quantum algorithms and establish their advantage over classical algorithms. Key job responsibilities We are particularly interested in candidates with expertise in any of the following subareas related to quantum algorithms: quantum chemistry, many-body physics, quantum machine learning, cryptography, optimization theory, quantum complexity theory, quantum error correction & fault tolerance, quantum sensing, and scientific computing, among others. A day in the life Throughout your journey, you'll have access to unparalleled resources, including state-of-the-art computing infrastructure, cutting-edge research papers, and mentorship from industry luminaries. This immersive experience will not only sharpen your technical skills but also cultivate your ability to think critically, communicate effectively, and thrive in a fast-paced, innovative environment where bold ideas are celebrated. Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Hybrid Work We value innovation and recognize this sometimes requires uninterrupted time to focus on a build. We also value in-person collaboration and time spent face-to-face. Our team affords employees options to work in the office every day or in a flexible, hybrid work model near one of our U.S. Amazon offices. This is not a remote internship opportunity. About the team Amazon Web Services (AWS) Center for Quantum Computing (CQC) is a multi-disciplinary team of theoretical and experimental physicists, materials scientists, and hardware and software engineers on a mission to develop a fault-tolerant quantum computer.
US, CA, Pasadena
The Amazon Web Services (AWS) Center for Quantum Computing in Pasadena, CA, is looking to hire a Research Scientist specializing in hardware design for cryogenic environements. The candidate should have expertise in 3D CAD (SolidWorks), thermal and structural FEA (Ansys/COMSOL), hardware design for cryogenic applications, design for manufacturing, and mechanical engineering principles. The candidate must have demonstrated driving designs through full product development cycles (requirements, conceptual design, detailed design, manufacturing, integration, and testing). Candidates must have a strong background in both cryogenic mechanical engineering theory and implementation. Working effectively within a cross-functional team environment is critical. Key job responsibilities Our scientists and engineers collaborate across diverse teams and projects to offer state of the art, cost effective solutions for scaling the signal delivery to AWS quantum processor systems at cryogenic temperatures. Equally important is the ability to scale the thermal performance and improve EMI mitigation of the cryogenic environment. You'll bring passion, enthusiasm, and innovation to work on the following: - High density novel packaging solutions for quantum processor units. - Cryogenic mechanical design for novel cryogenic signal conditioning sub-assemblies. - Cryogenic mechanical design for signal delivery systems. - Simulation driven designs (shielding, filtering, etc.) to reduce sources of EMI within the qubit environment. - Own end-to-end product development through requirements, design reports, design reviews, assembly/testing documentation, and final delivery. A day in the life As you design and implement cryogenic hardware solutions, from requirements definition to deployment, you will also: - Participate in requirements, design, and test reviews and communicate with internal stakeholders. - Work cross-functionally to help drive decisions using your unique technical background and skill set. - Refine and define standards and processes for operational excellence. - Work in a high-paced, startup-like environment where you are provided the resources to innovate quickly. About the team Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS. Within AWS UC, Amazon Dedicated Cloud (ADC) roles engage with AWS customers who require specialized security solutions for their cloud services. Inclusive Team Culture AWS values curiosity and connection. Our employee-led and company-sponsored affinity groups promote inclusion and empower our people to take pride in what makes us unique. Our inclusion events foster stronger, more collaborative teams. Our continual innovation is fueled by the bold ideas, fresh perspectives, and passionate voices our teams bring to everything we do. Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Export Control Requirement: Due to applicable export control laws and regulations, candidates must be either a U.S. citizen or national, U.S. permanent resident (i.e., current Green Card holder), or lawfully admitted into the U.S. as a refugee or granted asylum, or be able to obtain a US export license. If you are unsure if you meet these requirements, please apply and Amazon will review your application for eligibility.
US, CA, Santa Clara
Amazon Web Services (AWS) is assembling an elite team of world-class scientists and engineers to pioneer the next generation of AI-driven development tools. Join the Amazon Kiro LLM-Training team and help create groundbreaking generative AI technologies including Kiro IDE and Amazon Q Developer that are transforming the software development landscape. Key job responsibilities As a key member of our team, you'll be at the forefront of innovation, where cutting-edge research meets real-world application: - Push the boundaries of reinforcement learning and post-training methodologies for large language models specialized in code intelligence - Invent and implement state-of-the-art machine learning solutions that operate at unprecedented Amazon scale - Deploy revolutionary products that directly impact the daily workflows of millions of developers worldwide - Break new ground in AI and machine learning, challenging what's possible in intelligent code assistance - Publish and present your pioneering work at premier ML and NLP conferences (NeurIPS, ICML, ICLR , ACL, EMNLP) - Accelerate innovation by working directly with customers to rapidly transition research breakthroughs into production systems About the team The AWS Developer Agents and Experiences (DAE) team is reimagining the builder experience through generative AI and foundation models. We're leveraging the latest advances in AI to transform how engineers work from IDE environments to web-based tools and services, empowering developers to tackle projects of any scale with unprecedented efficiency. Broadly, AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS. Within AWS UC, Amazon Dedicated Cloud (ADC) roles engage with AWS customers who require specialized security solutions for their cloud services. Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Hybrid Work We value innovation and recognize this sometimes requires uninterrupted time to focus on a build. We also value in-person collaboration and time spent face-to-face. Our team affords employees options to work in the office every day or in a flexible, hybrid work model near one of our U.S. Amazon offices.
IN, KA, Bengaluru
Alexa+ is Amazon’s next-generation, AI-powered virtual assistant. Building on the original Alexa, it uses generative AI to deliver a more conversational, personalised, and effective experience. Alexa Sensitive Content Intelligence (ASCI) team is developing responsible AI (RAI) solutions for Alexa+, empowering it to provide useful information responsibly. The team is currently looking for Senior Applied Scientists with a strong background in NLP and/or CV to design and develop ML solutions in the RAI space using generative AI across all languages and countries. A Senior Applied Scientist will be a tech lead for a team of exceptional scientists to develop novel algorithms and modeling techniques to advance the state of the art in NLP or CV related tasks. You will work in a dynamic, fast-paced organization where scientists, engineers, and product managers work together to build customer facing experiences. You will collaborate with and mentor other scientists to raise the bar of scientific research in Amazon. Your work will directly impact our customers in the form of products and services that make use of speech, language, and computer vision technologies. We are looking for a leader with strong technical experiences a passion for building scientific driven solutions in a fast-paced environment. You should have good understanding of Artificial Intelligence (AI), Natural Language Understanding (NLU), Machine Learning (ML), Dialog Management, Automatic Speech Recognition (ASR), and Audio Signal Processing where to apply them in different business cases. You leverage your exceptional technical expertise, a sound understanding of the fundamentals of Computer Science, and practical experience of building large-scale distributed systems to creating reliable, scalable, and high-performance products. In addition to technical depth, you must possess exceptional communication skills and understand how to influence key stakeholders. You will be joining a select group of people making history producing one of the most highly rated products in Amazon's history, so if you are looking for a challenging and innovative role where you can solve important problems while growing as a leader, this may be the place for you. Key job responsibilities 1. Define and own the scientific vision and roadmap for ML solutions for building end-to-end Responsible AI solutions 2. Lead and grow a high-performing team of Applied Scientists, providing technical guidance, mentorship, and career development. 3. Guide model and system design to build innovative ML solutions at Alexa scale using state-of-the-art NLP and CV techniques. 4. Ensure models are production-ready, scalable, and robust through close partnership with stakeholders. Partner with Product, Operations, and Engineering leaders to enable proactive decision-making and corrective actions. 5. Own end-to-end business metrics, directly influencing customer experience and trust. 6. Help contribute to the broader ML community through publications, conference submissions, and internal knowledge sharing. A day in the life As an Applied Science Manager on the Alexa Sensitive Content team, you'll lead a team of scientists and ML engineers building AI systems that keep Alexa safe and trustworthy for millions of users worldwide. Your role combines technical leadership with strategic decision-making and collaborating with product teams and policy experts to deliver engaging and safe experiences across Amazon devices. You'll stay current with advances in generative AI to design, develop, and own state-of-the-art NLP solutions. You will be coaching scientists to identify and mitigate risks early, building more robust ML systems. You'll balance near-term delivery with long-term innovation, ensuring solutions are robust, interpretable, and scalable. Your work directly impacts delivery reliability, cost efficiency, and customer experience at massive scale. About the team The mission of the Alexa Sensitive Content Intelligence (ASCI) team is to (1) minimize negative surprises to customers caused by sensitive content, (2) detect and prevent potential brand-damaging interactions, and (3) build customer trust through appropriate interactions on sensitive topics. The term “sensitive content” includes within its scope a wide range of categories of content such as offensive content (e.g., hate speech, racist speech), profanity, content that is suitable only for certain age groups, politically polarizing content, and religiously polarizing content. The term “content” refers to any material that is exposed to customers by Alexa (including both 1P and 3P experiences) and includes text, speech, audio, and video.
US, MA, Boston
**This is an experimental role to support a business pilot and can potentially span up to 12 months** Embark on a transformative journey as our Sr. Domain Expert Lead, where intellectual rigor meets technological innovation. As a Sr. Domain Expert Lead, you will blend your advanced analytical skills and domain expertise to provide strategic oversight to our human-in-the-loop and model-in-the-loop data pipelines. You will also provide mentorship and guidance to junior team members. Your responsibilities will ensure data excellence through strategic oversight of high-quality data output, while delivering expert consultation throughout the pipeline and fostering iterative development. This position directly impacts the effectiveness and reliability of our AI solutions by maintaining the highest standards of data quality throughout the development process while building capability within the broader team. Key job responsibilities • Serve as a trusted domain advisor to cross-functional teams, providing strategic direction and specialized problem-solving support • Champion domain knowledge sharing across multiple channels and teams to maintain data quality excellence and standardization • Drive collaborative efforts with science teams to optimize output of complex data collections in your domain expertise, ensuring data excellence through iterative feedback loops • Foster team excellence through mentorship and motivation of peers and junior team members • Make informed decisions on behalf of our customers, ensuring that selected code meets industry standards, best practices, and specific client needs • Collaborate with AI teams to innovate model-in-the-loop and human-in-the-loop approaches, to ensure the collection of high-quality data, safeguarding data privacy and security for LLM training, and more. • Stay abreast of the latest developments in how LLMs and GenAI can be applied to your area of expertise to ensure our evaluations remain cutting-edge. • Develop and write demonstrations to illustrate "what good data looks like" in terms of meeting benchmarks for quality and efficiency • Provide detailed feedback and explanations for your evaluations, helping to refine and improve the LLM's understanding and output
US, MA, Boston
**This is an experimental role to support a business pilot and can potentially span up to 12 months** Embark on a transformative journey as our Sr. Domain Expert Lead, where intellectual rigor meets technological innovation. As a Sr. Domain Expert Lead, you will blend your advanced analytical skills and domain expertise to provide strategic oversight to our human-in-the-loop and model-in-the-loop data pipelines. You will also provide mentorship and guidance to junior team members. Your responsibilities will ensure data excellence through strategic oversight of high-quality data output, while delivering expert consultation throughout the pipeline and fostering iterative development. This position directly impacts the effectiveness and reliability of our AI solutions by maintaining the highest standards of data quality throughout the development process while building capability within the broader team. Key job responsibilities • Serve as a trusted domain advisor to cross-functional teams, providing strategic direction and specialized problem-solving support • Champion domain knowledge sharing across multiple channels and teams to maintain data quality excellence and standardization • Drive collaborative efforts with science teams to optimize output of complex data collections in your domain expertise, ensuring data excellence through iterative feedback loops • Foster team excellence through mentorship and motivation of peers and junior team members • Make informed decisions on behalf of our customers, ensuring that selected code meets industry standards, best practices, and specific client needs • Collaborate with AI teams to innovate model-in-the-loop and human-in-the-loop approaches, to ensure the collection of high-quality data, safeguarding data privacy and security for LLM training, and more. • Stay abreast of the latest developments in how LLMs and GenAI can be applied to your area of expertise to ensure our evaluations remain cutting-edge. • Develop and write demonstrations to illustrate "what good data looks like" in terms of meeting benchmarks for quality and efficiency • Provide detailed feedback and explanations for your evaluations, helping to refine and improve the LLM's understanding and output