Garegin Papoian, the Monroe Martin Professor at the University of Maryland, is seen sitting at a desk with an open laptop in front of him. He has turned around in his seat to face the camera.
Garegin Papoian is the Monroe Martin Professor at the University of Maryland. Within his Papoian Lab, a theoretical physical chemistry group located at the university, his team is working toward developing fundamental molecular models of the whole cell, a concept still in its infancy
Courtesy of Garegin Papoian

Garegin Papoian’s quest to model an elusive class of proteins

With the support of an Amazon Research Award, Papoian’s team is deciphering the dynamics of intrinsically disordered proteins.

How do molecules come together and start to behave like a living system? This is the type of question that drives Garegin Papoian’s research. At the University of Maryland, where he is the Monroe Martin Professor, he has been focusing on computational modeling of biological molecules like proteins and DNA. Within his Papoian Lab, a theoretical physical chemistry group also located at the university, his team is also working toward developing fundamental molecular models of the whole cell, a concept still in its infancy.

Papoian’s path into science was determined early on. Growing up in Armenia, then a part of the Soviet Union, he went to a special school of physics and mathematics, where he was introduced to Science Olympiads. While in high school, he won the first place in the Republic of Armenia in separate Olympiads in chemistry, physics, mathematics and biology. “Science Olympiads were a big reason why I got drawn into science, in particular to chemistry and physics”, he says.

Because of his success in the competitions, he was invited to study at an advanced chemistry college in Moscow established specifically for Olympiad winners.

“I was 16,” he says, “but it was assumed that we already knew all university level chemistry. So, they would start immediately with a very high-level training.” The program included an internship in the United States, at the University of Kansas. From there he eventually enrolled as a graduate student at Cornell University, where he pursued his PhD in quantum chemistry, working under the Nobel Laureate, Roald Hoffmann.

During his postdoc, he turned to classical physics with a particular emphasis on biophysics. “I was interested in bringing concepts of physical chemistry to understand biological phenomena from the molecular perspective,” he says. “And my long-term career goal is to develop concepts both for proteins and cells.”

Predicting a protein’s shape

A protein is a large molecule essential to all living things. The sequence of amino acids that form a protein determines its three-dimensional structure. Each protein has a unique shape that dictates its function. Being able to predict what a protein structure looks like from its amino acid sequence has been a long-standing scientific challenge and one of the research interests of Papoian’s group, for which he received an AWS Machine Learning Research Award in 2018.

This animation shows the structure of a protein called linker histone H1
This animation shows the structure of a protein called linker histone H1, including its disordered tails, predicted by Papoian's team. "We discovered that interactions of those disordered tails with DNA help to structurally position H1 with respect to the nucleosome. In terms of the bigger picture, the H1-nucleosome interactions regulate epigenetic processes, determining for example which particular genes should be turned on or off,” says Papoian.

One of the applications of protein structure prediction is drug design. “When you design a drug, you need to know what the target looks like,” says Papoian. If you know that the target protein has a certain pocket, for example, you can develop a molecule that will fit nicely into that pocket. While identifying genes associated with diseases has become easier, the sequence of a gene doesn’t tell you what the protein expressed by it looks like, and experimental methods to determine the protein shape are lengthy and expensive.

IDPs ... are more like this crazy spaghetti. It's very hard to deal with them both experimentally and computationally.
Garegin Papoian

Even in the wake of DeepMind demonstrating that AlphaFold is capable of predicting protein structures with an unprecedented level of accuracy, challenges still remain.

It turns out that a large proportion of human proteins are not completely structured in neat three-dimensional shapes. These are called the intrinsically disordered proteins (IDPs). “They are much more dynamic and mostly never fall into a single structure,” says Papoian. “They are more like this crazy spaghetti. It's very hard to deal with them both experimentally and computationally because they are so elusive.” He notes that about a third of human proteins are like that, including many important disease-causing proteins.

Papoian’s AWS Machine Learning Research Award enabled his team to advance the development of a system that is better suited to simulating these proteins.

Tackling disordered proteins

For the past few years, Papoian Lab has been working with a protein modeling framework called AWSEM-MD (pronounces “awesome”), which stands for associative memory, water-mediated, structure and energy model — molecular dynamics. It has been developed jointly with Peter Wolynes, Papoian’s former postdoctoral advisor who is currently at Rice University and with whom he continued to collaborate over the years.

Using the AWS Machine Learning Research Award, Papoian and his colleagues developed AWSEM-IDP, an AWSEM branch specifically designed to simulate intrinsically disordered proteins.

This system uses a database of protein fragment structures obtained experimentally, for example, through nuclear magnetic resonance (NMR) spectroscopy — a technique that determines the structure and dynamics of proteins. "These fragments serve as structural memories that guide the IDP to undergo structural transformations that are informed by the experiment,” Papoian explains. “This allows simulating more realistic IDP dynamics.”

The fragment database may also contain structures from atomistic simulations — a type of simulation where every atom of a protein is present. “The reason why we prefer not to do those in general is that they’re very expensive, so we cannot do very big simulations. But we can do atomistic simulations of short fragments to give us good fragment memories, again improving the accuracy of IDP’s structural exploration in AWSEM simulations,” he says.

An IDP will prefer multiple structures, not just one.

“That's the key difference from regular proteins: IDPs are multi-faceted in essence. But they still prefer certain structures over others. And the AWSEM-IDP model allows you to correctly describe those preferences,” Papoian explained. This model was described in a 2018 article published at the Journal of Physical Chemistry B.

In another work published earlier this year that was supported by the AWS Machine Learning Award, Papoian and his colleagues applied AWSEM-IDP to study a protein called linker histone H1, which plays an essential role in regulating many important biological processes. This protein has two intrinsically disordered regions, parts of its structure that are not well folded and resemble two tails. Because they are disordered, it’s much harder to understand what they do and how they interact.

Proteins like linker histone H1 regulate histone complexes, which act like a spool around which the DNA wraps to create structures called nucleosomes. “In this paper, we used AWSEM-IDP to model the nucleosome with linker histone H1, in particular with these disordered tails. And that allowed us to understand how the linker histone and the nucleosome come together and interact, and what's the role of these disordered tails,” says Papoian. Understanding proteins’ interactions with nucleosomes may give important insights on epigenetics, which is one of Papoian Lab’s interests.

Future challenges

Because making sense of IDPs is such a difficult process, Papoian says that AWSEM-IDP is an ongoing program with room for improvement. “What we have currently works better in some classes of proteins, and not so much in others. So next we’ll explore what are the challenges for what we currently have in ASWEM-IDP and try to come up with new advances to overcome them.”

In addition to IDPs, Papoian Lab will also continue to pursue the use of deep learning for structure prediction of well-folded proteins. Although there is some conceptual overlap with AlphaFold, Papoian believes that AWSEM-MD is a powerful tool and has advantages to other approaches when it comes to molecular dynamics.

Proteins are not frozen objects. Some of them are well structured, but many are not structured at all, and they are dynamic and move and shape-shift incessantly.
Garegin Papoian

“Proteins are not frozen objects,” he says. “Some of them are well structured, but many are not structured at all, and they are dynamic and move and shape-shift incessantly. So, to understand how these proteins function, you must model their dynamics and that’s what AWSEM-MD can do best.”

Papoian thinks one exciting area to be explored in coming decades will be combining machine learning and physics to work on protein structure prediction, protein dynamics, multiprotein complexes, and epigenetics.

“There are lots of things that still remain to be understood in our models. And I think that probably neither physics nor machine learning by themselves can tackle them. But a program that brings them together in a productive way can be very powerful,” he said.

Modeling an entire cell

Another ambitious project that Papoian and his colleagues are pursuing is to develop a computational model of an entire cell. “We still don’t have a blueprint of a cell the way we have a blueprint of a car or a Boeing airplane.”

To do that, his group develops their own software from scratch.

Garegin Papoian: How do cells move? Chemistry meets mechanics

“We basically do the science, the physics, and biophysics of what is needed to model our cells. We derive the needed algorithms from scratch based on the laws of physics and chemistry and then we program that into a computer and run simulations on a supercomputer,” he explained. This has to be done at a single molecule resolution, he adds, meaning that they have to track every single molecule within a cell.

To achieve that, the Papoian Lab developed a model called MEDYAN.

“We can already model some number of proteins, the membrane, we can model rich chemistry. We have developed some of the fundamental chemistry and physics components of what needs to be done,” he says. The next step is to scale it. “We usually do simulations with several types of proteins. So instead of several, you will need maybe hundreds or thousands of different types of proteins, so it just brings more complexity.”

When that happens, it will be a huge revolution in biomedicine, he says. “Then lots of things that people laboriously spend years doing in the laboratory could just run on AWS servers. And you could do your experiments and search for treatments computationally, which would be much cheaper and faster.”

Research areas

Related content

US, WA, Seattle
Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports – including Amazon MGM Studios-produced series and movies; licensed fan favorites; and programming from Prime Video add-on subscriptions such as Apple TV+, Max, Crunchyroll and MGM+. All customers, regardless of whether they have a Prime membership or not, can rent or buy titles via the Prime Video Store, and can enjoy even more content for free with ads. Are you interested in shaping the future of entertainment? Prime Video's technology teams are creating best-in-class digital video experience. As a Prime Video team member, you’ll have end-to-end ownership of the product, user experience, design, and technology required to deliver state-of-the-art experiences for our customers. You’ll get to work on projects that are fast-paced, challenging, and varied. You’ll also be able to experiment with new possibilities, take risks, and collaborate with remarkable people. We’ll look for you to bring your diverse perspectives, ideas, and skill-sets to make Prime Video even better for our customers. With global opportunities for talented technologists, you can decide where a career Prime Video Tech takes you! Key job responsibilities As an Applied Scientist in the Content Understanding Team, you will lead the end-to-end research and deployment of video and multi-modal models applied to a variety of downstream applications. More specifically, you will: - Work backwards from customer problems to research and design scientific approaches for solving them - Work closely with other scientists, engineers and product managers to expand the depth of our product insights with data, create a variety of experiments to determine the high impact projects to include in planning roadmaps - Stay up-to-date with advancements and the latest modeling techniques in the field - Publish your research findings in top conferences and journals About the team Our Prime Video Content Understanding team builds holistic media representations (e.g. descriptions of scenes, semantic embeddings) and apply them to new customer experiences supply chain problems. Our technology spans the entire Prime Video catalogue globally, and we enable instant recaps, skip intro timing, ad placement, search, and content moderation.
US, WA, Seattle
Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports – including Amazon MGM Studios-produced series and movies; licensed fan favorites; and programming from Prime Video add-on subscriptions such as Apple TV+, Max, Crunchyroll and MGM+. All customers, regardless of whether they have a Prime membership or not, can rent or buy titles via the Prime Video Store, and can enjoy even more content for free with ads. Are you interested in shaping the future of entertainment? Prime Video's technology teams are creating best-in-class digital video experience. As a Prime Video team member, you’ll have end-to-end ownership of the product, user experience, design, and technology required to deliver state-of-the-art experiences for our customers. You’ll get to work on projects that are fast-paced, challenging, and varied. You’ll also be able to experiment with new possibilities, take risks, and collaborate with remarkable people. We’ll look for you to bring your diverse perspectives, ideas, and skill-sets to make Prime Video even better for our customers. With global opportunities for talented technologists, you can decide where a career Prime Video Tech takes you! Key job responsibilities As an Applied Scientist in the Content Understanding Team, you will lead the end-to-end research and deployment of video and multi-modal models applied to a variety of downstream applications. More specifically, you will: - Work backwards from customer problems to research and design scientific approaches for solving them - Work closely with other scientists, engineers and product managers to expand the depth of our product insights with data, create a variety of experiments to determine the high impact projects to include in planning roadmaps - Stay up-to-date with advancements and the latest modeling techniques in the field - Publish your research findings in top conferences and journals About the team Our Prime Video Content Understanding team builds holistic media representations (e.g. descriptions of scenes, semantic embeddings) and apply them to new customer experiences supply chain problems. Our technology spans the entire Prime Video catalogue globally, and we enable instant recaps, skip intro timing, ad placement, search, and content moderation.
IN, HR, Gurugram
Our customers have immense faith in our ability to deliver packages timely and as expected. A well planned network seamlessly scales to handle millions of package movements a day. It has monitoring mechanisms that detect failures before they even happen (such as predicting network congestion, operations breakdown), and perform proactive corrective actions. When failures do happen, it has inbuilt redundancies to mitigate impact (such as determine other routes or service providers that can handle the extra load), and avoids relying on single points of failure (service provider, node, or arc). Finally, it is cost optimal, so that customers can be passed the benefit from an efficiently set up network. Amazon Shipping is hiring Applied Scientists to help improve our ability to plan and execute package movements. As an Applied Scientist in Amazon Shipping, you will work on multiple challenging machine learning problems spread across a wide spectrum of business problems. You will build ML models to help our transportation cost auditing platforms effectively audit off-manifest (discrepancies between planned and actual shipping cost). You will build models to improve the quality of financial and planning data by accurately predicting ship cost at a package level. Your models will help forecast the packages required to be pick from shipper warehouses to reduce First Mile shipping cost. Using signals from within the transportation network (such as network load, and velocity of movements derived from package scan events) and outside (such as weather signals), you will build models that predict delivery delay for every package. These models will help improve buyer experience by triggering early corrective actions, and generating proactive customer notifications. Your role will require you to demonstrate Think Big and Invent and Simplify, by refining and translating Transportation domain-related business problems into one or more Machine Learning problems. You will use techniques from a wide array of machine learning paradigms, such as supervised, unsupervised, semi-supervised and reinforcement learning. Your model choices will include, but not be limited to, linear/logistic models, tree based models, deep learning models, ensemble models, and Q-learning models. You will use techniques such as LIME and SHAP to make your models interpretable for your customers. You will employ a family of reusable modelling solutions to ensure that your ML solution scales across multiple regions (such as North America, Europe, Asia) and package movement types (such as small parcel movements and truck movements). You will partner with Applied Scientists and Research Scientists from other teams in US and India working on related business domains. Your models are expected to be of production quality, and will be directly used in production services. You will work as part of a diverse data science and engineering team comprising of other Applied Scientists, Software Development Engineers and Business Intelligence Engineers. You will participate in the Amazon ML community by authoring scientific papers and submitting them to Machine Learning conferences. You will mentor Applied Scientists and Software Development Engineers having a strong interest in ML. You will also be called upon to provide ML consultation outside your team for other problem statements. If you are excited by this charter, come join us!
IN, HR, Gurugram
We're on a journey to build something new a green field project! Come join our team and build new discovery and shopping products that connect customers with their vehicle of choice. We're looking for a talented Senior Applied Scientist to join our team of product managers, designers, and engineers to design, and build innovative automotive-shopping experiences for our customers. This is a great opportunity for an experienced engineer to design and implement the technology for a new Amazon business. We are looking for a Applied Scientist to design, implement and deliver end-to-end solutions. We are seeking passionate, hands-on, experienced and seasoned Senior Applied Scientist who will be deep in code and algorithms; who are technically strong in building scalable computer vision machine learning systems across item understanding, pose estimation, class imbalanced classifiers, identification and segmentation.. You will drive ideas to products using paradigms such as deep learning, semi supervised learning and dynamic learning. As a Senior Applied Scientist, you will also help lead and mentor our team of applied scientists and engineers. You will take on complex customer problems, distill customer requirements, and then deliver solutions that either leverage existing academic and industrial research or utilize your own out-of-the-box but pragmatic thinking. In addition to coming up with novel solutions and prototypes, you will directly contribute to implementation while you lead. A successful candidate has excellent technical depth, scientific vision, project management skills, great communication skills, and a drive to achieve results in a unified team environment. You should enjoy the process of solving real-world problems that, quite frankly, haven’t been solved at scale anywhere before. Along the way, we guarantee you’ll get opportunities to be a bold disruptor, prolific innovator, and a reputed problem solver—someone who truly enables AI and robotics to significantly impact the lives of millions of consumers. Key job responsibilities Architect, design, and implement Machine Learning models for vision systems on robotic platforms Optimize, deploy, and support at scale ML models on the edge. Influence the team's strategy and contribute to long-term vision and roadmap. Work with stakeholders across , science, and operations teams to iterate on design and implementation. Maintain high standards by participating in reviews, designing for fault tolerance and operational excellence, and creating mechanisms for continuous improvement. Prototype and test concepts or features, both through simulation and emulators and with live robotic equipment Work directly with customers and partners to test prototypes and incorporate feedback Mentor other engineer team members. A day in the life - 6+ years of building machine learning models for retail application experience - PhD, or Master's degree and 6+ years of applied research experience - Experience programming in Java, C++, Python or related language - Experience with neural deep learning methods and machine learning - Demonstrated expertise in computer vision and machine learning techniques.
US, WA, Seattle
Do you want to re-invent how millions of people consume video content on their TVs, Tablets and Alexa? We are building a free to watch streaming service called Fire TV Channels (https://techcrunch.com/2023/08/21/amazon-launches-fire-tv-channels-app-400-fast-channels/). Our goal is to provide customers with a delightful and personalized experience for consuming content across News, Sports, Cooking, Gaming, Entertainment, Lifestyle and more. You will work closely with engineering and product stakeholders to realize our ambitious product vision. You will get to work with Generative AI and other state of the art technologies to help build personalization and recommendation solutions from the ground up. You will be in the driver's seat to present customers with content they will love. Using Amazon’s large-scale computing resources, you will ask research questions about customer behavior, build state-of-the-art models to generate recommendations and run these models to enhance the customer experience. You will participate in the Amazon ML community and mentor Applied Scientists and Software Engineers with a strong interest in and knowledge of ML. Your work will directly benefit customers and you will measure the impact using scientific tools.
US, MA, Boston
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Applied Scientist with a strong deep learning background, to build industry-leading technology with Large Language Models (LLMs) and multi-modal systems. You will support projects that work on technologies including multi-modal model alignment, moderation systems and evaluation. Key job responsibilities As an Applied Scientist with the AGI team, you will support the development of novel algorithms and modeling techniques, to advance the state of the art with LLMs. Your work will directly impact our customers in the form of products and services that make use of speech and language technology. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in generative artificial intelligence (GenAI). You are also expected to publish in top tier conferences. About the team The AGI team has a mission to push the envelope in LLMs and multimodal systems. Specifically, we focus on model alignment with an aim to maintain safety while not denting utility, in order to provide the best-possible experience for our customers.
US, MA, Boston
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Senior Applied Scientist with a strong deep learning background, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As a Senior Applied Scientist with the AGI team, you will work with talented peers to lead the development of novel algorithms and modeling techniques, to advance the state of the art with LLMs. Your work will directly impact our customers in the form of products and services that make use of speech and language technology. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in generative artificial intelligence (GenAI). About the team The AGI team has a mission to push the envelope in LLMs and multimodal systems, in order to provide the best-possible experience for our customers.
IN, KA, Bangalore
Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports – including Amazon MGM Studios-produced series and movies; licensed fan favorites; and programming from Prime Video add-on subscriptions such as Apple TV+, Max, Crunchyroll and MGM+. All customers, regardless of whether they have a Prime membership or not, can rent or buy titles via the Prime Video Store, and can enjoy even more content for free with ads. Are you interested in shaping the future of entertainment? Prime Video's technology teams are creating best-in-class digital video experience. As a Prime Video technologist, you’ll have end-to-end ownership of the product, user experience, design, and technology required to deliver state-of-the-art experiences for our customers. You’ll get to work on projects that are fast-paced, challenging, and varied. You’ll also be able to experiment with new possibilities, take risks, and collaborate with remarkable people. We’ll look for you to bring your diverse perspectives, ideas, and skill-sets to make Prime Video even better for our customers. With global opportunities for talented technologists, you can decide where a career Prime Video Tech takes you! Key job responsibilities As a highly experienced and seasoned science leader, you will apply state of the art natural language processing and computer vision research to video centric digital media, while also responsible for creating and maintaining the best environment for applied science in order to recruit, retain and develop top talent. You will lead the research direction for a team of deeply talented applied scientists, creating the roadmaps for forward-looking research and communicate them effectively to senior leadership. You will also hire and develop applied scientists - growing the team to meet the evolving needs of our customers.
IN, KA, Bengaluru
The Amazon Alexa AI team in India is seeking a talented, self-driven Applied Scientist to work on prototyping, optimizing, and deploying ML algorithms within the realm of Generative AI. Key responsibilities include: - Research, experiment and build Proof Of Concepts advancing the state of the art in AI & ML for GenAI. - Collaborate with cross-functional teams to architect and execute technically rigorous AI projects. - Thrive in dynamic environments, adapting quickly to evolving technical requirements and deadlines. - Engage in effective technical communication (written & spoken) with coordination across teams. - Conduct thorough documentation of algorithms, methodologies, and findings for transparency and reproducibility. - Publish research papers in internal and external venues of repute - Support on-call activities for critical issues Basic Qualifications: - Master’s or PhD in computer science, statistics or a related field or relevant science experience (publications/scientific prototypes) in lieu of Masters - Experience in deep learning, machine learning, and data science. - Proficiency in coding and software development, with a strong focus on machine learning frameworks. - Experience in Python, or another language; command line usage; familiarity with Linux and AWS ecosystems. - Understanding of relevant statistical measures such as confidence intervals, significance of error measurements, development and evaluation data sets, etc. - Excellent communication skills (written & spoken) and ability to collaborate effectively in a distributed, cross-functional team setting. Preferred Qualifications: - Track record of diving into data to discover hidden patterns and conducting error/deviation analysis - Ability to develop experimental and analytic plans for data modeling processes, use of strong baselines, ability to accurately determine cause and effect relations - The motivation to achieve results in a fast-paced environment. - Exceptional level of organization and strong attention to detail - Comfortable working in a fast paced, highly collaborative, dynamic work environment - Papers published in AI/ML venues of repute
IN, KA, Bengaluru
The Amazon Alexa AI team in India is seeking a talented, self-driven Applied Scientist to work on prototyping, optimizing, and deploying ML algorithms within the realm of Generative AI. Key responsibilities include: - Research, experiment and build Proof Of Concepts advancing the state of the art in AI & ML for GenAI. - Collaborate with cross-functional teams to architect and execute technically rigorous AI projects. - Thrive in dynamic environments, adapting quickly to evolving technical requirements and deadlines. - Engage in effective technical communication (written & spoken) with coordination across teams. - Conduct thorough documentation of algorithms, methodologies, and findings for transparency and reproducibility. - Publish research papers in internal and external venues of repute - Support on-call activities for critical issues Basic Qualifications: - Master’s or PhD in computer science, statistics or a related field - 2-7 years experience in deep learning, machine learning, and data science. - Proficiency in coding and software development, with a strong focus on machine learning frameworks. - Experience in Python, or another language; command line usage; familiarity with Linux and AWS ecosystems. - Understanding of relevant statistical measures such as confidence intervals, significance of error measurements, development and evaluation data sets, etc. - Excellent communication skills (written & spoken) and ability to collaborate effectively in a distributed, cross-functional team setting. - Papers published in AI/ML venues of repute Preferred Qualifications: - Track record of diving into data to discover hidden patterns and conducting error/deviation analysis - Ability to develop experimental and analytic plans for data modeling processes, use of strong baselines, ability to accurately determine cause and effect relations - The motivation to achieve results in a fast-paced environment. - Exceptional level of organization and strong attention to detail - Comfortable working in a fast paced, highly collaborative, dynamic work environment