Garegin Papoian, the Monroe Martin Professor at the University of Maryland, is seen sitting at a desk with an open laptop in front of him. He has turned around in his seat to face the camera.
Garegin Papoian is the Monroe Martin Professor at the University of Maryland. Within his Papoian Lab, a theoretical physical chemistry group located at the university, his team is working toward developing fundamental molecular models of the whole cell, a concept still in its infancy
Courtesy of Garegin Papoian

Garegin Papoian’s quest to model an elusive class of proteins

With the support of an Amazon Research Award, Papoian’s team is deciphering the dynamics of intrinsically disordered proteins.

How do molecules come together and start to behave like a living system? This is the type of question that drives Garegin Papoian’s research. At the University of Maryland, where he is the Monroe Martin Professor, he has been focusing on computational modeling of biological molecules like proteins and DNA. Within his Papoian Lab, a theoretical physical chemistry group also located at the university, his team is also working toward developing fundamental molecular models of the whole cell, a concept still in its infancy.

Papoian’s path into science was determined early on. Growing up in Armenia, then a part of the Soviet Union, he went to a special school of physics and mathematics, where he was introduced to Science Olympiads. While in high school, he won the first place in the Republic of Armenia in separate Olympiads in chemistry, physics, mathematics and biology. “Science Olympiads were a big reason why I got drawn into science, in particular to chemistry and physics”, he says.

Because of his success in the competitions, he was invited to study at an advanced chemistry college in Moscow established specifically for Olympiad winners.

“I was 16,” he says, “but it was assumed that we already knew all university level chemistry. So, they would start immediately with a very high-level training.” The program included an internship in the United States, at the University of Kansas. From there he eventually enrolled as a graduate student at Cornell University, where he pursued his PhD in quantum chemistry, working under the Nobel Laureate, Roald Hoffmann.

During his postdoc, he turned to classical physics with a particular emphasis on biophysics. “I was interested in bringing concepts of physical chemistry to understand biological phenomena from the molecular perspective,” he says. “And my long-term career goal is to develop concepts both for proteins and cells.”

Predicting a protein’s shape

A protein is a large molecule essential to all living things. The sequence of amino acids that form a protein determines its three-dimensional structure. Each protein has a unique shape that dictates its function. Being able to predict what a protein structure looks like from its amino acid sequence has been a long-standing scientific challenge and one of the research interests of Papoian’s group, for which he received an AWS Machine Learning Research Award in 2018.

This animation shows the structure of a protein called linker histone H1
This animation shows the structure of a protein called linker histone H1, including its disordered tails, predicted by Papoian's team. "We discovered that interactions of those disordered tails with DNA help to structurally position H1 with respect to the nucleosome. In terms of the bigger picture, the H1-nucleosome interactions regulate epigenetic processes, determining for example which particular genes should be turned on or off,” says Papoian.

One of the applications of protein structure prediction is drug design. “When you design a drug, you need to know what the target looks like,” says Papoian. If you know that the target protein has a certain pocket, for example, you can develop a molecule that will fit nicely into that pocket. While identifying genes associated with diseases has become easier, the sequence of a gene doesn’t tell you what the protein expressed by it looks like, and experimental methods to determine the protein shape are lengthy and expensive.

IDPs ... are more like this crazy spaghetti. It's very hard to deal with them both experimentally and computationally.
Garegin Papoian

Even in the wake of DeepMind demonstrating that AlphaFold is capable of predicting protein structures with an unprecedented level of accuracy, challenges still remain.

It turns out that a large proportion of human proteins are not completely structured in neat three-dimensional shapes. These are called the intrinsically disordered proteins (IDPs). “They are much more dynamic and mostly never fall into a single structure,” says Papoian. “They are more like this crazy spaghetti. It's very hard to deal with them both experimentally and computationally because they are so elusive.” He notes that about a third of human proteins are like that, including many important disease-causing proteins.

Papoian’s AWS Machine Learning Research Award enabled his team to advance the development of a system that is better suited to simulating these proteins.

Tackling disordered proteins

For the past few years, Papoian Lab has been working with a protein modeling framework called AWSEM-MD (pronounces “awesome”), which stands for associative memory, water-mediated, structure and energy model — molecular dynamics. It has been developed jointly with Peter Wolynes, Papoian’s former postdoctoral advisor who is currently at Rice University and with whom he continued to collaborate over the years.

Using the AWS Machine Learning Research Award, Papoian and his colleagues developed AWSEM-IDP, an AWSEM branch specifically designed to simulate intrinsically disordered proteins.

This system uses a database of protein fragment structures obtained experimentally, for example, through nuclear magnetic resonance (NMR) spectroscopy — a technique that determines the structure and dynamics of proteins. "These fragments serve as structural memories that guide the IDP to undergo structural transformations that are informed by the experiment,” Papoian explains. “This allows simulating more realistic IDP dynamics.”

The fragment database may also contain structures from atomistic simulations — a type of simulation where every atom of a protein is present. “The reason why we prefer not to do those in general is that they’re very expensive, so we cannot do very big simulations. But we can do atomistic simulations of short fragments to give us good fragment memories, again improving the accuracy of IDP’s structural exploration in AWSEM simulations,” he says.

An IDP will prefer multiple structures, not just one.

“That's the key difference from regular proteins: IDPs are multi-faceted in essence. But they still prefer certain structures over others. And the AWSEM-IDP model allows you to correctly describe those preferences,” Papoian explained. This model was described in a 2018 article published at the Journal of Physical Chemistry B.

In another work published earlier this year that was supported by the AWS Machine Learning Award, Papoian and his colleagues applied AWSEM-IDP to study a protein called linker histone H1, which plays an essential role in regulating many important biological processes. This protein has two intrinsically disordered regions, parts of its structure that are not well folded and resemble two tails. Because they are disordered, it’s much harder to understand what they do and how they interact.

Proteins like linker histone H1 regulate histone complexes, which act like a spool around which the DNA wraps to create structures called nucleosomes. “In this paper, we used AWSEM-IDP to model the nucleosome with linker histone H1, in particular with these disordered tails. And that allowed us to understand how the linker histone and the nucleosome come together and interact, and what's the role of these disordered tails,” says Papoian. Understanding proteins’ interactions with nucleosomes may give important insights on epigenetics, which is one of Papoian Lab’s interests.

Future challenges

Because making sense of IDPs is such a difficult process, Papoian says that AWSEM-IDP is an ongoing program with room for improvement. “What we have currently works better in some classes of proteins, and not so much in others. So next we’ll explore what are the challenges for what we currently have in ASWEM-IDP and try to come up with new advances to overcome them.”

In addition to IDPs, Papoian Lab will also continue to pursue the use of deep learning for structure prediction of well-folded proteins. Although there is some conceptual overlap with AlphaFold, Papoian believes that AWSEM-MD is a powerful tool and has advantages to other approaches when it comes to molecular dynamics.

Proteins are not frozen objects. Some of them are well structured, but many are not structured at all, and they are dynamic and move and shape-shift incessantly.
Garegin Papoian

“Proteins are not frozen objects,” he says. “Some of them are well structured, but many are not structured at all, and they are dynamic and move and shape-shift incessantly. So, to understand how these proteins function, you must model their dynamics and that’s what AWSEM-MD can do best.”

Papoian thinks one exciting area to be explored in coming decades will be combining machine learning and physics to work on protein structure prediction, protein dynamics, multiprotein complexes, and epigenetics.

“There are lots of things that still remain to be understood in our models. And I think that probably neither physics nor machine learning by themselves can tackle them. But a program that brings them together in a productive way can be very powerful,” he said.

Modeling an entire cell

Another ambitious project that Papoian and his colleagues are pursuing is to develop a computational model of an entire cell. “We still don’t have a blueprint of a cell the way we have a blueprint of a car or a Boeing airplane.”

To do that, his group develops their own software from scratch.

Garegin Papoian: How do cells move? Chemistry meets mechanics

“We basically do the science, the physics, and biophysics of what is needed to model our cells. We derive the needed algorithms from scratch based on the laws of physics and chemistry and then we program that into a computer and run simulations on a supercomputer,” he explained. This has to be done at a single molecule resolution, he adds, meaning that they have to track every single molecule within a cell.

To achieve that, the Papoian Lab developed a model called MEDYAN.

“We can already model some number of proteins, the membrane, we can model rich chemistry. We have developed some of the fundamental chemistry and physics components of what needs to be done,” he says. The next step is to scale it. “We usually do simulations with several types of proteins. So instead of several, you will need maybe hundreds or thousands of different types of proteins, so it just brings more complexity.”

When that happens, it will be a huge revolution in biomedicine, he says. “Then lots of things that people laboriously spend years doing in the laboratory could just run on AWS servers. And you could do your experiments and search for treatments computationally, which would be much cheaper and faster.”

Research areas

Related content

IN, HR, Gurugram
We're on a journey to build something new a green field project! Come join our team and build new discovery and shopping products that connect customers with their vehicle of choice. We're looking for a talented Senior Applied Scientist to join our team of product managers, designers, and engineers to design, and build innovative automotive-shopping experiences for our customers. This is a great opportunity for an experienced engineer to design and implement the technology for a new Amazon business. We are looking for a Applied Scientist to design, implement and deliver end-to-end solutions. We are seeking passionate, hands-on, experienced and seasoned Senior Applied Scientist who will be deep in code and algorithms; who are technically strong in building scalable computer vision machine learning systems across item understanding, pose estimation, class imbalanced classifiers, identification and segmentation.. You will drive ideas to products using paradigms such as deep learning, semi supervised learning and dynamic learning. As a Senior Applied Scientist, you will also help lead and mentor our team of applied scientists and engineers. You will take on complex customer problems, distill customer requirements, and then deliver solutions that either leverage existing academic and industrial research or utilize your own out-of-the-box but pragmatic thinking. In addition to coming up with novel solutions and prototypes, you will directly contribute to implementation while you lead. A successful candidate has excellent technical depth, scientific vision, project management skills, great communication skills, and a drive to achieve results in a unified team environment. You should enjoy the process of solving real-world problems that, quite frankly, haven’t been solved at scale anywhere before. Along the way, we guarantee you’ll get opportunities to be a bold disruptor, prolific innovator, and a reputed problem solver—someone who truly enables AI and robotics to significantly impact the lives of millions of consumers. Key job responsibilities Architect, design, and implement Machine Learning models for vision systems on robotic platforms Optimize, deploy, and support at scale ML models on the edge. Influence the team's strategy and contribute to long-term vision and roadmap. Work with stakeholders across , science, and operations teams to iterate on design and implementation. Maintain high standards by participating in reviews, designing for fault tolerance and operational excellence, and creating mechanisms for continuous improvement. Prototype and test concepts or features, both through simulation and emulators and with live robotic equipment Work directly with customers and partners to test prototypes and incorporate feedback Mentor other engineer team members. A day in the life - 6+ years of building machine learning models for retail application experience - PhD, or Master's degree and 6+ years of applied research experience - Experience programming in Java, C++, Python or related language - Experience with neural deep learning methods and machine learning - Demonstrated expertise in computer vision and machine learning techniques.
US, MA, Boston
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Applied Scientist with a strong deep learning background, to build industry-leading technology with Large Language Models (LLMs) and multi-modal systems. You will support projects that work on technologies including multi-modal model alignment, moderation systems and evaluation. Key job responsibilities As an Applied Scientist with the AGI team, you will support the development of novel algorithms and modeling techniques, to advance the state of the art with LLMs. Your work will directly impact our customers in the form of products and services that make use of speech and language technology. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in generative artificial intelligence (GenAI). You are also expected to publish in top tier conferences. About the team The AGI team has a mission to push the envelope in LLMs and multimodal systems. Specifically, we focus on model alignment with an aim to maintain safety while not denting utility, in order to provide the best-possible experience for our customers.
IN, HR, Gurugram
Our customers have immense faith in our ability to deliver packages timely and as expected. A well planned network seamlessly scales to handle millions of package movements a day. It has monitoring mechanisms that detect failures before they even happen (such as predicting network congestion, operations breakdown), and perform proactive corrective actions. When failures do happen, it has inbuilt redundancies to mitigate impact (such as determine other routes or service providers that can handle the extra load), and avoids relying on single points of failure (service provider, node, or arc). Finally, it is cost optimal, so that customers can be passed the benefit from an efficiently set up network. Amazon Shipping is hiring Applied Scientists to help improve our ability to plan and execute package movements. As an Applied Scientist in Amazon Shipping, you will work on multiple challenging machine learning problems spread across a wide spectrum of business problems. You will build ML models to help our transportation cost auditing platforms effectively audit off-manifest (discrepancies between planned and actual shipping cost). You will build models to improve the quality of financial and planning data by accurately predicting ship cost at a package level. Your models will help forecast the packages required to be pick from shipper warehouses to reduce First Mile shipping cost. Using signals from within the transportation network (such as network load, and velocity of movements derived from package scan events) and outside (such as weather signals), you will build models that predict delivery delay for every package. These models will help improve buyer experience by triggering early corrective actions, and generating proactive customer notifications. Your role will require you to demonstrate Think Big and Invent and Simplify, by refining and translating Transportation domain-related business problems into one or more Machine Learning problems. You will use techniques from a wide array of machine learning paradigms, such as supervised, unsupervised, semi-supervised and reinforcement learning. Your model choices will include, but not be limited to, linear/logistic models, tree based models, deep learning models, ensemble models, and Q-learning models. You will use techniques such as LIME and SHAP to make your models interpretable for your customers. You will employ a family of reusable modelling solutions to ensure that your ML solution scales across multiple regions (such as North America, Europe, Asia) and package movement types (such as small parcel movements and truck movements). You will partner with Applied Scientists and Research Scientists from other teams in US and India working on related business domains. Your models are expected to be of production quality, and will be directly used in production services. You will work as part of a diverse data science and engineering team comprising of other Applied Scientists, Software Development Engineers and Business Intelligence Engineers. You will participate in the Amazon ML community by authoring scientific papers and submitting them to Machine Learning conferences. You will mentor Applied Scientists and Software Development Engineers having a strong interest in ML. You will also be called upon to provide ML consultation outside your team for other problem statements. If you are excited by this charter, come join us!
US, WA, Seattle
Do you want to re-invent how millions of people consume video content on their TVs, Tablets and Alexa? We are building a free to watch streaming service called Fire TV Channels (https://techcrunch.com/2023/08/21/amazon-launches-fire-tv-channels-app-400-fast-channels/). Our goal is to provide customers with a delightful and personalized experience for consuming content across News, Sports, Cooking, Gaming, Entertainment, Lifestyle and more. You will work closely with engineering and product stakeholders to realize our ambitious product vision. You will get to work with Generative AI and other state of the art technologies to help build personalization and recommendation solutions from the ground up. You will be in the driver's seat to present customers with content they will love. Using Amazon’s large-scale computing resources, you will ask research questions about customer behavior, build state-of-the-art models to generate recommendations and run these models to enhance the customer experience. You will participate in the Amazon ML community and mentor Applied Scientists and Software Engineers with a strong interest in and knowledge of ML. Your work will directly benefit customers and you will measure the impact using scientific tools.
US, MA, Boston
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Senior Applied Scientist with a strong deep learning background, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As a Senior Applied Scientist with the AGI team, you will work with talented peers to lead the development of novel algorithms and modeling techniques, to advance the state of the art with LLMs. Your work will directly impact our customers in the form of products and services that make use of speech and language technology. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate advances in generative artificial intelligence (GenAI). About the team The AGI team has a mission to push the envelope in LLMs and multimodal systems, in order to provide the best-possible experience for our customers.
IN, KA, Bengaluru
The Amazon Alexa AI team in India is seeking a talented, self-driven Applied Scientist to work on prototyping, optimizing, and deploying ML algorithms within the realm of Generative AI. Key responsibilities include: - Research, experiment and build Proof Of Concepts advancing the state of the art in AI & ML for GenAI. - Collaborate with cross-functional teams to architect and execute technically rigorous AI projects. - Thrive in dynamic environments, adapting quickly to evolving technical requirements and deadlines. - Engage in effective technical communication (written & spoken) with coordination across teams. - Conduct thorough documentation of algorithms, methodologies, and findings for transparency and reproducibility. - Publish research papers in internal and external venues of repute - Support on-call activities for critical issues Basic Qualifications: - Master’s or PhD in computer science, statistics or a related field or relevant science experience (publications/scientific prototypes) in lieu of Masters - Experience in deep learning, machine learning, and data science. - Proficiency in coding and software development, with a strong focus on machine learning frameworks. - Experience in Python, or another language; command line usage; familiarity with Linux and AWS ecosystems. - Understanding of relevant statistical measures such as confidence intervals, significance of error measurements, development and evaluation data sets, etc. - Excellent communication skills (written & spoken) and ability to collaborate effectively in a distributed, cross-functional team setting. Preferred Qualifications: - Track record of diving into data to discover hidden patterns and conducting error/deviation analysis - Ability to develop experimental and analytic plans for data modeling processes, use of strong baselines, ability to accurately determine cause and effect relations - The motivation to achieve results in a fast-paced environment. - Exceptional level of organization and strong attention to detail - Comfortable working in a fast paced, highly collaborative, dynamic work environment - Papers published in AI/ML venues of repute
IN, KA, Bengaluru
The Amazon Alexa AI team in India is seeking a talented, self-driven Applied Scientist to work on prototyping, optimizing, and deploying ML algorithms within the realm of Generative AI. Key responsibilities include: - Research, experiment and build Proof Of Concepts advancing the state of the art in AI & ML for GenAI. - Collaborate with cross-functional teams to architect and execute technically rigorous AI projects. - Thrive in dynamic environments, adapting quickly to evolving technical requirements and deadlines. - Engage in effective technical communication (written & spoken) with coordination across teams. - Conduct thorough documentation of algorithms, methodologies, and findings for transparency and reproducibility. - Publish research papers in internal and external venues of repute - Support on-call activities for critical issues Basic Qualifications: - Master’s or PhD in computer science, statistics or a related field - 2-7 years experience in deep learning, machine learning, and data science. - Proficiency in coding and software development, with a strong focus on machine learning frameworks. - Experience in Python, or another language; command line usage; familiarity with Linux and AWS ecosystems. - Understanding of relevant statistical measures such as confidence intervals, significance of error measurements, development and evaluation data sets, etc. - Excellent communication skills (written & spoken) and ability to collaborate effectively in a distributed, cross-functional team setting. - Papers published in AI/ML venues of repute Preferred Qualifications: - Track record of diving into data to discover hidden patterns and conducting error/deviation analysis - Ability to develop experimental and analytic plans for data modeling processes, use of strong baselines, ability to accurately determine cause and effect relations - The motivation to achieve results in a fast-paced environment. - Exceptional level of organization and strong attention to detail - Comfortable working in a fast paced, highly collaborative, dynamic work environment
IN, KA, Bengaluru
Amazon is investing heavily in building a world class advertising business and we are responsible for defining and delivering a collection of self-service performance advertising products that drive discovery and sales. Our products are strategically important to our Retail and Marketplace businesses driving long term growth. We deliver billions of ad impressions and millions of clicks daily and are breaking fresh ground to create world-class products. We are highly motivated, collaborative and fun-loving with an entrepreneurial spirit and bias for action. With a broad mandate to experiment and innovate, we are growing at an unprecedented rate with a seemingly endless range of new opportunities. The ATT team, based in Bangalore, is responsible for ensuring that ads are relevant and is of good quality, leading to higher conversion for the sellers and providing a great experience for the customers. We deal with one of the world’s largest product catalog, handle billions of requests a day with plans to grow it by order of magnitude and use automated systems to validate tens of millions of offers submitted by thousands of merchants in multiple countries and languages. In this role, you will build and develop ML models to address content understanding problems in Ads. These models will rely on a variety of visual and textual features requiring expertise in both domains. These models need to scale to multiple languages and countries. You will collaborate with engineers and other scientists to build, train and deploy these models. As part of these activities, you will develop production level code that enables moderation of millions of ads submitted each day.
US, WA, Seattle
The Search Supply & Experiences team, within Sponsored Products, is seeking an Applied Scientist to solve challenging problems in natural language understanding, personalization, and other areas using the latest techniques in machine learning. In our team, you will have the opportunity to create new ads experiences that elevate the shopping experience for our hundreds of millions customers worldwide. As an Applied Scientist, you will partner with other talented scientists and engineers to design, train, test, and deploy machine learning models. You will be responsible for translating business and engineering requirements into deliverables, and performing detailed experiment analysis to determine how shoppers and advertisers are responding to your changes. We are looking for candidates who thrive in an exciting, fast-paced environment and who have a strong personal interest in learning, researching, and creating new technologies with high customer impact. Key job responsibilities As an Applied Scientist on the Search Supply & Experiences team you will: - Perform hands-on analysis and modeling of enormous datasets to develop insights that increase traffic monetization and merchandise sales, without compromising the shopper experience. - Drive end-to-end machine learning projects that have a high degree of ambiguity, scale, and complexity. - Build machine learning models, perform proof-of-concept, experiment, optimize, and deploy your models into production; work closely with software engineers to assist in productionizing your ML models. - Design and run experiments, gather data, and perform statistical analysis. - Establish scalable, efficient, automated processes for large-scale data analysis, machine-learning model development, model validation and serving. - Stay up to date on the latest advances in machine learning. About the team We are a customer-obsessed team of engineers, technologists, product leaders, and scientists. We are focused on continuous exploration of contexts and creatives where advertising delivers value to shoppers and advertisers. We specifically work on new ads experiences globally with the goal of helping shoppers make the most informed purchase decision. We obsess about our customers and we are continuously innovating on their behalf to enrich their shopping experience on Amazon
US, WA, Seattle
Amazon.com strives to be Earth's most customer-centric company where customers can shop in our stores to find and discover anything they want to buy. We hire the world's brightest minds, offering them a fast paced, technologically sophisticated and friendly work environment. Economists at Amazon partner closely with senior management, business stakeholders, scientist and engineers, and economist leadership to solve key business problems ranging from Amazon Web Services, Kindle, Prime, inventory planning, international retail, third party merchants, search, pricing, labor and employment planning, effective benefits (health, retirement, etc.) and beyond. Amazon Economists build econometric models using our world class data systems and apply approaches from a variety of skillsets – applied macro/time series, applied micro, econometric theory, empirical IO, empirical health, labor, public economics and related fields are all highly valued skillsets at Amazon. You will work in a fast moving environment to solve business problems as a member of either a cross-functional team embedded within a business unit or a central science and economics organization. You will be expected to develop techniques that apply econometrics to large data sets, address quantitative problems, and contribute to the design of automated systems around the company. About the team The International Seller Services (ISS) Economics team is a dynamic group at the forefront of shaping Amazon's global seller ecosystem. As part of ISS, we drive innovation and growth through sophisticated economic analysis and data-driven insights. Our mission is critical: we're transforming how Amazon empowers millions of international sellers to succeed in the digital marketplace. Our team stands at the intersection of innovative technology and practical business solutions. We're leading Amazon's transformation in seller services through work with Large Language Models (LLMs) and generative AI, while tackling fundamental questions about seller growth, marketplace dynamics, and operational efficiency. What sets us apart is our unique blend of rigorous economic methodology and practical business impact. We're not just analyzing data – we're building the frameworks and measurement systems that will define the future of Amazon's seller services. Whether we're optimizing the seller journey, evaluating new technologies, or designing innovative service models, our team transforms complex economic challenges into actionable insights that drive real-world results. Join us in shaping how millions of businesses worldwide succeed on Amazon's marketplace, while working on problems that combine economic theory, advanced analytics, and innovative technology.