Pittsburgh skyline at night
Pittsburgh is home to the University of Pittsburgh Medical Center, the University of Pittsburgh, and Carnegie Mellon University. Researchers from these institutions are using machine learning techniques to study breast cancer risk, identify depression markers, and understand what drives tumor growth, among other projects.
Credit: Rudolf Balasko

AWS and PHDA collaborate to produce more accurate machine learning models for breast cancer screening and depression

One year later: AWS collaboration with Pittsburgh Health Data Alliance begins to pay dividends with new machine learning innovation.

In August of 2019, the Pittsburgh Health Data Alliance (PHDA) and Amazon Web Services (AWS) announced a new collaboration to advance innovation in areas such as cancer diagnostics, precision medicine, electronic health records, and medical imaging.

Researchers from the University of Pittsburgh Medical Center (UPMC), the University of Pittsburgh, and Carnegie Mellon University (CMU), who were already supported by the PHDA, received additional support from Amazon Research Awards to use machine learning techniques to study breast cancer risk, identify depression markers, and understand what drives tumor growth, among other projects.

In work funded through the PHDA-AWS collaboration, a research team led by Shandong Wu, an associate professor in the University of Pittsburgh Department of Radiology, is using deep-learning systems to analyze mammograms in order to predict the short‐term risk of developing breast cancer.  A team of experts in computer vision, deep learning, bioinformatics, and breast cancer imaging are working together to develop a more personalized approach for patients undergoing breast cancer screening.

Shandong Wu, associate professor, University of Pittsburgh Department of Radiology
Shandong Wu, associate professor, University of Pittsburgh Department of Radiology
Credit: UPMC

Wu and his colleagues collected 452 de-identified normal screening mammogram images from 226 patients, half of whom later developed breast cancer and half of whom did not. Leveraging AWS tools, such as Amazon SageMaker, they used two different machine learning models to analyze the images for characteristics that could help predict breast cancer risk. As they reported in the American Association of Physicists in Medicine, both models consistently outperformed the simple measure of breast density, which today is the primary imaging marker for breast cancer risk. The team’s models demonstrated between 33% and 35% improvement over these existing models, based on metrics that incorporate sensitivity and specificity.

“This preliminary work demonstrates the feasibility and promise of applying deep-learning methodologies for in-depth interpretation of mammogram images to enhance breast cancer risk assessment,” Wu said. “Identifying additional risk factors for breast cancer, including those that can lead to a more personalized approach to screening, may help patients and providers take more appropriate preventive measures to reduce the likelihood of developing the disease or catching it early on when interventions are most effective. “

Tools that could provide more accurate predictions from screening images could be used to guide clinical decision making related to frequency of follow-up imaging and other forms of preventative monitoring. This could reduce unnecessary imaging examinations or clinical procedures, decreasing patients’ anxiety resulting from inaccurate risk assessments, and cutting costs.

Moving forward, researchers at the University of Pittsburgh and UPMC will pursue studies with more training samples and longitudinal imaging data to further evaluate the models. They also plan to combine deep learning with known clinical risk factors to improve upon the ability to diagnose and treat breast cancer earlier.

In a second project, Louis-Philippe Morency, associate professor of computer science at CMU, and Eva Szigethy, a clinical researcher at UPMC and professor of psychiatry, medicine, and pediatrics at the University of Pittsburgh, are developing sensing technologies that can automatically measure subtle changes in individuals’ behavior — such as facial expressions and use of language — that can act as biomarkers for depression.

Eva Szigethy, clinical researcher, University of Pittsburgh Medical Center, and professor of psychiatry, medicine, and pediatrics, University of Pittsburgh
Eva Szigethy, clinical researcher, University of Pittsburgh Medical Center, and professor of psychiatry, medicine, and pediatrics, University of Pittsburgh
Credit: UPMC

These biomarkers will later be compared with the results of traditional clinical assessments, allowing investigators to evaluate the performance of their technology and make improvements where necessary. This machine learning technology is intended to complement the ability of a clinician to make decisions about diagnosis and treatment. The team is working with a gastrointestinal-disorder clinic at UPMC, due to the high rate of depression observed in patients with functional gastrointestinal disorders.

This work involves training machine learning models on tens of thousands of examples across multiple modalities, including language (the spoken word), acoustic (prosody), and visual (facial expressions). The computational load is heavy, but by running experiments in parallel on multiple GPUS AWS services have allowed the researchers to train their models in a few days instead of weeks.

A quick and objective marker of depression could help clinicians more efficiently assess patients at baseline, identify patients who would otherwise go undiagnosed, and more accurately measure patients’ responses to interventions. The team presented a paper on the work, “Integrating Multimodal Information in Large Pretrained Transformers”, at the July 2020 meeting of the Association for Computational Linguistics.

“Depression is a disease that affects more than 17 million adults in the United States, up to two-thirds of all depression cases are left undiagnosed and therefore untreated,” Morency said. “New insights to increase the accuracy, efficiency, and adoption of depression screening have the potential to impact millions of patients, their families, and the healthcare system as a whole.”

Louis-Philippe Morency, associate professor of computer science, Carnegie Mellon University
Louis-Philippe Morency, associate professor of computer science, Carnegie Mellon University
Credit: CMU

The research projects on breast cancer and depression represent just the tip of the iceberg when it comes to the research and insights the collaboration across PHDA and AWS will ultimately deliver to improve patient care. Teams of researchers, health-care professionals, and machine learning experts across the PHDA continue to make progress on key research topics, from the risk of aneurysms and predicting how cancer cells progress, to improving the complex electronic-health-records system.

”Amazon is excited and encouraged by the progress these researchers are making and how machine learning is central to their work,” says An Luo, senior technical program manager for academic programs, Amazon AI. “We look forward to continuing to share how this unique collaboration between the PHDA and AWS is enabling new discoveries to help patients on a global scale.”

Research areas

Related content

IN, KA, Bengaluru
Alexa International is looking for passionate, talented, and inventive Senior Applied Scientists to help build industry-leading technology with Large Language Models (LLMs) and multimodal systems, requiring strong deep learning and generative models knowledge. Senior applied scientists will drive cross-team scientific strategy, influence partner teams, and deliver solutions that have broad impact across Alexa's international products and services. Key job responsibilities As a Applied Scientist with II the Alexa International team, you will work with talented peers to develop novel algorithms and modeling techniques to advance the state of the art with LLMs, particularly delivering industry-leading scientific research and applied AI for multi-lingual applications — a challenging area for the industry globally. Your work will directly impact our global customers in the form of products and services that support Alexa+. You will leverage Amazon's heterogeneous data sources and large-scale computing resources to accelerate advances in text, speech, and vision domains. The ideal candidate possesses a solid understanding of machine learning, speech and/or natural language processing, modern LLM architectures, LLM evaluation & tooling, and a passion for pushing boundaries in this vast and quickly evolving field. They thrive in fast-paced environment, like to tackle complex challenges, excel at swiftly delivering impactful solutions while iterating based on user feedback, and are able to influence and align multiple teams around a shared scientific vision. A day in the life * Analyze, understand, and model customer behavior and the customer experience based on large-scale data. * Build novel online & offline evaluation metrics and methodologies for multimodal personal digital assistants. * Fine-tune/post-train LLMs using advanced and innovative techniques like SFT, DPO, Reinforcement Learning (RLHF and RLAIF) for supporting model performance specific to a customer’s location and language. * Quickly experiment and set up experimentation framework for agile model and data analysis or A/B testing. * Contribute through industry-first research to drive innovation forward. * Drive cross-team scientific strategy and influence partner teams on LLM evaluation frameworks, post-training methodologies, and best practices for international speech and language systems. * Lead end-to-end delivery of scientifically complex solutions from research to production, including reusable science components and services that resolve architecture deficiencies across teams. * Serve as a scientific thought leader, communicating solutions clearly to partners, stakeholders, and senior leadership. * Actively mentor junior scientists and contribute to the broader internal and external scientific community through publications and community engagement.
US, NY, New York
About the Role In this role, you will own the science strategy and technical vision for this intelligence layer, leading a team of applied scientists working across GenAI and predictive modeling. You will shape how heterogeneous signals — text, behavioral, network, temporal — come together to power talent applications at Amazon scale, from workforce forecasting to personalized development to compensation strategy. You will identify opportunities where science investment can have material impact on long-term objectives or annual goals and build consensus around needed investments, working comfortably across different modeling paradigms and data modalities to guide principal and senior scientists in their most challenging and strategic decisions while serving as the strategic science advisor to PXT leaders operating at the Director, VP, and SVP levels. As a hands-on leader, you will personally own development and delivery of the most complex science problems at the intersection of multiple ML disciplines, stay current with emergent AI/ML science and engineering trends to influence focus areas in a rapidly evolving landscape, and participate in organizational planning, hiring, mentorship, and leadership development. Key job responsibilities • Lead technical initiatives in people science models, driving breakthrough approaches through hands-on research and development in areas like foundation models for predictive modeling, efficient multi-modal LLMs, and zero-shot learning • Design and implement novel ML architectures that push the boundaries of how workforce signals are represented, fused, and predicted at scale • Guide technical direction for research initiatives across the team, ensuring robust performance in production environments serving hundreds of thousands of employees • Mentor and develop senior scientists while maintaining strong individual technical contributions on the most complex cross-domain problems • Collaborate with engineering teams to optimize and scale models for real-world talent applications • Influence technical decisions and implementation strategies across teams, shaping the long-term platform architecture About the team The People eXperience and Technology (PXT) Core Science Team uses science, engineering, and customer-obsessed problem solving to proactively identify mechanisms, process improvements, and products that simultaneously improve Amazon and Amazonians' lives, wellbeing, and value of work. As an interdisciplinary team combining talents from machine learning, statistics, economics, behavioral science, engineering, and product development, the Core Science team develops and delivers measurable solutions through innovation and rapid prototyping to accelerate informed, accurate, and reliable decision-making backed by science and data.
US, MA, N.reading
Amazon is seeking exceptional talent to help develop the next generation of advanced robotics systems that will transform automation at Amazon's scale. We're building revolutionary robotic systems that combine cutting-edge AI, sophisticated control systems, and advanced mechanical design to create adaptable automation solutions capable of working safely alongside humans in dynamic environments. This is a unique opportunity to shape the future of robotics and automation at an unprecedented scale, working with world-class teams pushing the boundaries of what's possible in robotic dexterous manipulation, locomotion, and human-robot interaction. This role presents an opportunity to shape the future of robotics through innovative applications of deep learning and large language models. At Amazon we leverage advanced robotics, machine learning, and artificial intelligence to solve complex operational challenges at an unprecedented scale. Our fleet of robots operates across hundreds of facilities worldwide, working in sophisticated coordination to fulfill our mission of customer excellence. The ideal candidate will contribute to research that bridges the gap between theoretical advancement and practical implementation in robotics. You will be part of a team that's revolutionizing how robots learn, adapt, and interact with their environment. Join us in building the next generation of intelligent robotics systems that will transform the future of automation and human-robot collaboration. Key job responsibilities - Design and implement whole body control methods for balance, locomotion, and dexterous manipulation - Utilize state-of-the-art in methods in learned and model-based control - Create robust and safe behaviors for different terrains and tasks - Implement real-time controllers with stability guarantees - Collaborate effectively with multi-disciplinary teams to co-design hardware and algorithms for loco-manipulation - Mentor junior engineer and scientists
IN, KA, Bengaluru
Have you ever ordered a product on Amazon and when that box with the smile arrived you wondered how it got to you so fast? Have you wondered where it came from and how much it cost Amazon to deliver it to you? If so, the WW Amazon Logistics, Business Analytics team is for you. We manage the delivery of tens of millions of products every week to Amazon’s customers, achieving on-time delivery in a cost-effective manner. We are looking for an enthusiastic, customer obsessed, Applied Scientist with good analytical skills to help manage projects and operations, implement scheduling solutions, improve metrics, and develop scalable processes and tools. The primary role of an Operations Research Scientist within Amazon is to address business challenges through building a compelling case, and using data to influence change across the organization. This individual will be given responsibility on their first day to own those business challenges and the autonomy to think strategically and make data driven decisions. Decisions and tools made in this role will have significant impact to the customer experience, as it will have a major impact on how the final phase of delivery is done at Amazon. Ideal candidates will be a high potential, strategic and analytic graduate with a PhD in (Operations Research, Statistics, Engineering, and Supply Chain) ready for challenging opportunities in the core of our world class operations space. Great candidates have a history of operations research, and the ability to use data and research to make changes. This role requires robust program management skills and research science skills in order to act on research outcomes. This individual will need to be able to work with a team, but also be comfortable making decisions independently, in what is often times an ambiguous environment. Responsibilities may include: - Develop input and assumptions based preexisting models to estimate the costs and savings opportunities associated with varying levels of network growth and operations - Creating metrics to measure business performance, identify root causes and trends, and prescribe action plans - Managing multiple projects simultaneously - Working with technology teams and product managers to develop new tools and systems to support the growth of the business - Communicating with and supporting various internal stakeholders and external audiences
GB, London
Come build the future of entertainment with us. Are you interested in shaping the future of movies and television? Do you want to define the next generation of how and what Amazon customers are watching? Prime Video is a premium streaming service that offers customers a vast collection of TV shows and movies - all with the ease of finding what they love to watch in one place. We offer customers thousands of popular movies and TV shows including Amazon Originals and exclusive licensed content to exciting live sports events. We also offer our members the opportunity to subscribe to add-on channels which they can cancel at anytime and to rent or buy new release movies and TV box sets on the Prime Video Store. Prime Video is a fast-paced, growth business - available in over 200 countries and territories worldwide. The team works in a dynamic environment where innovating on behalf of our customers is at the heart of everything we do. If this sounds exciting to you, please read on. The Insights team is looking for an Applied Scientist for our London office experienced in generative AI and large models. This is a wide impact role working with development teams across the UK, India, and the US. This greenfield project will deliver features that reduce the operational load for internal Prime Video builders and for this, you will need to develop personalized recommendations for their services. You will have strong technical ability, excellent teamwork and communication skills, and a strong motivation to deliver customer value from your research. Our position offers opportunities to grow your technical and non-technical skills and make a global impact immediately. Key job responsibilities - Develop machine learning algorithms for high-scale recommendations problems - Rapidly design, prototype and test many possible hypotheses in a high-ambiguity environment, making use of both quantitative analysis and business judgement - Collaborate with software engineers to integrate successful experimental results into Prime Video wide processes - Communicate results and insights to both technical and non-technical audiences, including through presentations and written reports A day in the life You will lead the design of machine learning models that scale to very large quantities of data across multiple dimensions. You will embody scientific rigor, designing and executing experiments to demonstrate the technical effectiveness and business value of your methods. You will work alongside other scientists and engineering teams to deliver your research into production systems. About the team Our team owns Prime Video observability features for development teams. We consume PBs of data daily which feed into multiple observability features focussed on reducing the customer impact time.
CN, 31, Shanghai
You will be working with a unique and gifted team developing exciting products for consumers. The team is a multidisciplinary group of engineers and scientists engaged in a fast paced mission to deliver new products. The team faces a challenging task of balancing cost, schedule, and performance requirements. You should be comfortable collaborating in a fast-paced and often uncertain environment, and contributing to innovative solutions, while demonstrating leadership, technical competence, and meticulousness. Your deliverables will include development of thermal solutions, concept design, feature development, product architecture and system validation through to manufacturing release. You will support creative developments through application of analysis and testing of complex electronic assemblies using advanced simulation and experimentation tools and techniques. Key job responsibilities * Evaluate and optimize thermal solution requirements of consumer electronic products * Use simulation tools like Star-CCM+ or FloTherm XT/EFD for analysis and design of products * Validate design modifications for thermal concerns using simulation and actual prototypes * Establish temperature thresholds for user comfort level and component level considering reliability requirements * Have intimate knowledge of various materials and heat spreaders solutions to resolve thermal issues * Use of programming languages like Python and Matlab for analytical/statistical analyses and automation * Collaborate as part of device team to iterate and optimize design parameters of enclosures and structural parts to establish and deliver project performance objectives * Design and execute of tests using statistical tools to validate analytical models, identify risks and assess design margins * Create and present analytical and experimental results * Develop and apply design guidelines based on project learnings
US, CA, San Francisco
MULTIPLE POSITIONS AVAILABLE Employer: AMAZON DEVELOPMENT CENTER U.S., INC., Offered Position: Research Scientist II Job Location: San Francisco, California Job Number: AMZ9674001 Position Responsibilities: Design research studies to obtain scientific information. Develop theories or models of physical phenomena encountered in quantum computing, superconducting qubit device physics, materials or process development and characterization. Collaborate with others to determine design specifications, including of superconducting quantum processor chips, microwave chip packages, and associated electrical and mechanical components. Develop scientific or mathematical models to predict physical device behavior and performance, and verify the implementation of computational models. Apply mathematical principles or statistical approaches to solve problems, for example to validate modeling predictions under experimental uncertainty using statistical methods. Operate laboratory or field equipment and scientific instrumentation for device fabrication, device characterization, or advanced materials research. Develop new algorithms or methods for designing, simulating, or measuring quantum computers. Develop performance metrics or standards related to quantum information technology. Recommend technical design or process changes to improve quality or performance of superconducting quantum processors and efficiency of their design, manufacture, and testing. Collaborate on research activities with scientists or technical specialists. Prepare scientific or technical reports or presentations and present research results to others. 40 hours / week, 8:00am-5:00pm, Salary Range $168,126/year to $212,800/year. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, visit: https://www.aboutamazon.com/workplace/employee-benefits. Amazon.com is an Equal Opportunity-Affirmative Action Employer – Minority / Female / Disability / Veteran / Gender Identity / Sexual Orientation.#0000
US, WA, Seattle
This role leads the science function in WW Stores Finance as part of the IPAT organization (Insights, Planning, Analytics and Technology), driving transformative innovations in financial analytics through AI and machine learning across the global Stores finance organization. The successful candidate builds and directs a multidisciplinary team of data scientists, applied scientists, economists, and product managers to deliver scalable solutions that fundamentally change how finance teams generate insights, automate workflows, and make decisions. As part of the WW Stores Finance leadership team, this leader partners with engineering, product, and finance stakeholders to translate emerging AI capabilities into production systems that deliver measurable improvements in speed, accuracy, and efficiency. The role's outputs directly inform VP/SVP/CFO/CEO leadership decisions and drive impact across the entire Stores P&L. Success requires translating complex technical concepts for finance domain experts and business leaders while maintaining deep technical credibility with science and engineering teams. The role demands both strategic vision—identifying high-impact opportunities where AI can transform finance operations—and execution excellence in coordinating project planning, resource allocation, and delivery across multiple concurrent initiatives. This leader establishes methodologies and models that enable Amazon finance to achieve step-change improvements in both the speed and quality of business insights, directly supporting critical processes including month-end reporting, quarterly guidance, annual planning cycles, and financial controllership. Key job responsibilities Transformation of Finance Workflows — Lead development of agentic AI solutions that automate routine finance tasks and transform how teams communicate business insights. Deploy these solutions across financial analysis, narrative generation, and dynamic table creation for month-end reporting and planning cycles. Partner with engineering and product teams to integrate these capabilities into production systems that directly support Stores Finance and FGBS automation goals, delivering measurable reductions in manual effort and cycle time. Science-Based Forecasting — Develop and deploy machine learning forecasts that integrate into existing planning processes including OP1, OP2, and quarterly guidance cycles. Partner with finance teams across WW Stores to iterate on forecast accuracy, applying these models either as alternative viewpoints to complement bottoms-up forecasts or as hands-off replacements for manual forecasting processes. Establish evaluation frameworks that demonstrate forecast performance against business benchmarks and drive adoption across critical planning workflows. Financial Controllership — Scale AI capabilities across controllership workstreams to improve reporting accuracy and automate manual processes. Leverage generative AI to identify financial risk through systematic pattern recognition in transaction data, account reconciliations, and variance analysis. Develop production systems that enhance decision-making speed and quality in financial close, audit preparation, and compliance reporting, delivering quantifiable improvements in error detection rates and process efficiency. About the team IPAT (Insights, Planning, Analytics, and Technology) is a team in the Worldwide Amazon Stores Finance organization composed of leaders across engineering, finance, product, and science. Our mission is to reimagine finance using technology and science to provide fast, efficient, and accurate insights that drive business decisions and strengthen governance. We are dedicated to improving financial operations through innovative applications of technology and science. Our work focuses on developing adaptive solutions for diverse financial use cases, applying AI to solve complex financial challenges, and conducting financial data analysis. Operating globally, we strive to develop adaptable solutions for diverse markets. We aim to advance financial science, continually improving accuracy, efficiency, and insight generation in support of Amazon's mission to be Earth's most customer-centric company.
US, NY, New York
Do you want to lead the Ads industry and redefine how we measure the effectiveness of Amazon Ads business? Are you passionate about causal inference, Deep Learning/DNN, raising the science bar, and connecting leading-edge science research to Amazon-scale implementation? If so, come join Amazon Ads to be an Economist leader within our Advertising Incrementality Measurement science team! Our work builds the foundations for providing customer-facing experimentation tools, furthering internal research & development on Econometrics, and building out Amazon's advertising measurement offerings. Incrementality is a lynchpin for the next generation of Amazon Advertising measurement solutions and this role will play a key role in the release and expansion of these offerings. Key job responsibilities As an Economist leader within the Advertising Incrementality Measurement (AIM) science team, you are responsible for defining and executing on key workstreams within our overall causal measurement science vision. In particular, you can lead the development of experimental methodologies to measure ad effectiveness, and also build observational models that lay the foundations for understanding the impact of individual ad touchpoints for billions of daily ad interactions. You will work on a team of Applied Scientists, Economists, and Data Scientists, alongside a dedicated Engineering team, to work backwards from customer needs and translate product ideas into concrete science deliverables. You will be a thought leader for inventing scalable causal measurement solutions that support highly accurate and actionable insights--from defining and executing hundreds of thousands of RCTs, to developing an exciting science R&D agenda. You will be working with massive data and industry-leading partner scientists, while also interfacing with leadership to define our future vision. Your work will help shape the future of Amazon Advertising. About the team AIM is a cross disciplinary team of engineers, product managers, economists, data scientists, and applied scientists with a charter to build scientifically-rigorous causal inference methodologies at scale. Our job is to help customers cut through the noise of the modern advertising landscape and understand what actions, behaviors, and strategies actually have a real, measurable impact on key outcomes. The data we produce becomes the effective ground truth for advertisers and partners making decisions affecting millions in advertising spend.
US, NY, New York
The Measurement Intelligence Science Team (MIST) in the Measurement, Ad Tech, and Data Science (MADS) organization of Amazon Ads serves a centralized role developing solutions for a multitude of performance measurement products. We create solutions which measure the comprehensive impact of their ad spend, including sales impacts both online and offline and across timescales, and provide actionable insights that enable our advertisers to optimize their media portfolios. We leverage a host of scientific technologies to accomplish this mission, including Generative AI, classical ML, Causal Inference, Natural Language Processing, and Computer Vision. As an Applied Science Manager on the team, you will lead a team of scientists to define and execute a transformative vision for holistic measurement and reporting insights for ad effectiveness. Your team will own the science solutions for foundational experimentation platforms, foundational customer journey understanding technologies, state of the art attribution algorithms to measure the role of advertising in driving observed retail outcomes, and/or agentic AI solutions that help advertisers get quick access to custom insights that inform how to get the most out of their ad spend. Key job responsibilities You independently manage a team of scientists. You identify the needs of your team and effectively grow, hire, and promote scientists to maintain a high-performing team. You have a broad understanding of scientific techniques, several of which may fall out of your specific job function. You define the strategic vision for your team. You establish a roadmap and successfully deliver scientific solutions that execute that vision. You define clear goals for your team and effectively prioritize, balancing short-term needs and long-term value. You establish clear and effective metrics and scientific process to enforce consistent, high-quality artifact delivery. You proactively identify risks and bring them to the attention of your manager, customers, and stakeholders with plans for mitigation before they become roadblocks. You know when to escalate. You communicate ideas effectively, both verbally and in writing, to all types of audiences. You author strategic documentation for your team. You communicate issues and options with leaders in such a way that facilitates understanding and that leads to a decision. You work successfully with customers, leaders, and engineering teams. You foster a constructive dialogue, harmonize discordant views, and lead the resolution of contentious issues. About the team We are a team of scientists across Applied, Research, Data Science and Economist disciplines. You will work with colleagues with deep expertise in ML, NLP, CV, Gen AI, and Causal Inference with a diverse range of backgrounds. We partner closely with top-notch engineers, product managers, sales leaders, and other scientists with expertise in the ads industry and on building scalable modeling and software solutions.