Garegin Papoian, the Monroe Martin Professor at the University of Maryland, is seen sitting at a desk with an open laptop in front of him. He has turned around in his seat to face the camera.
Garegin Papoian is the Monroe Martin Professor at the University of Maryland. Within his Papoian Lab, a theoretical physical chemistry group located at the university, his team is working toward developing fundamental molecular models of the whole cell, a concept still in its infancy
Courtesy of Garegin Papoian

Garegin Papoian’s quest to model an elusive class of proteins

With the support of an Amazon Research Award, Papoian’s team is deciphering the dynamics of intrinsically disordered proteins.

How do molecules come together and start to behave like a living system? This is the type of question that drives Garegin Papoian’s research. At the University of Maryland, where he is the Monroe Martin Professor, he has been focusing on computational modeling of biological molecules like proteins and DNA. Within his Papoian Lab, a theoretical physical chemistry group also located at the university, his team is also working toward developing fundamental molecular models of the whole cell, a concept still in its infancy.

Papoian’s path into science was determined early on. Growing up in Armenia, then a part of the Soviet Union, he went to a special school of physics and mathematics, where he was introduced to Science Olympiads. While in high school, he won the first place in the Republic of Armenia in separate Olympiads in chemistry, physics, mathematics and biology. “Science Olympiads were a big reason why I got drawn into science, in particular to chemistry and physics”, he says.

Because of his success in the competitions, he was invited to study at an advanced chemistry college in Moscow established specifically for Olympiad winners.

“I was 16,” he says, “but it was assumed that we already knew all university level chemistry. So, they would start immediately with a very high-level training.” The program included an internship in the United States, at the University of Kansas. From there he eventually enrolled as a graduate student at Cornell University, where he pursued his PhD in quantum chemistry, working under the Nobel Laureate, Roald Hoffmann.

During his postdoc, he turned to classical physics with a particular emphasis on biophysics. “I was interested in bringing concepts of physical chemistry to understand biological phenomena from the molecular perspective,” he says. “And my long-term career goal is to develop concepts both for proteins and cells.”

Predicting a protein’s shape

A protein is a large molecule essential to all living things. The sequence of amino acids that form a protein determines its three-dimensional structure. Each protein has a unique shape that dictates its function. Being able to predict what a protein structure looks like from its amino acid sequence has been a long-standing scientific challenge and one of the research interests of Papoian’s group, for which he received an AWS Machine Learning Research Award in 2018.

This animation shows the structure of a protein called linker histone H1
This animation shows the structure of a protein called linker histone H1, including its disordered tails, predicted by Papoian's team. "We discovered that interactions of those disordered tails with DNA help to structurally position H1 with respect to the nucleosome. In terms of the bigger picture, the H1-nucleosome interactions regulate epigenetic processes, determining for example which particular genes should be turned on or off,” says Papoian.

One of the applications of protein structure prediction is drug design. “When you design a drug, you need to know what the target looks like,” says Papoian. If you know that the target protein has a certain pocket, for example, you can develop a molecule that will fit nicely into that pocket. While identifying genes associated with diseases has become easier, the sequence of a gene doesn’t tell you what the protein expressed by it looks like, and experimental methods to determine the protein shape are lengthy and expensive.

IDPs ... are more like this crazy spaghetti. It's very hard to deal with them both experimentally and computationally.
Garegin Papoian

Even in the wake of DeepMind demonstrating that AlphaFold is capable of predicting protein structures with an unprecedented level of accuracy, challenges still remain.

It turns out that a large proportion of human proteins are not completely structured in neat three-dimensional shapes. These are called the intrinsically disordered proteins (IDPs). “They are much more dynamic and mostly never fall into a single structure,” says Papoian. “They are more like this crazy spaghetti. It's very hard to deal with them both experimentally and computationally because they are so elusive.” He notes that about a third of human proteins are like that, including many important disease-causing proteins.

Papoian’s AWS Machine Learning Research Award enabled his team to advance the development of a system that is better suited to simulating these proteins.

Tackling disordered proteins

For the past few years, Papoian Lab has been working with a protein modeling framework called AWSEM-MD (pronounces “awesome”), which stands for associative memory, water-mediated, structure and energy model — molecular dynamics. It has been developed jointly with Peter Wolynes, Papoian’s former postdoctoral advisor who is currently at Rice University and with whom he continued to collaborate over the years.

Using the AWS Machine Learning Research Award, Papoian and his colleagues developed AWSEM-IDP, an AWSEM branch specifically designed to simulate intrinsically disordered proteins.

This system uses a database of protein fragment structures obtained experimentally, for example, through nuclear magnetic resonance (NMR) spectroscopy — a technique that determines the structure and dynamics of proteins. "These fragments serve as structural memories that guide the IDP to undergo structural transformations that are informed by the experiment,” Papoian explains. “This allows simulating more realistic IDP dynamics.”

The fragment database may also contain structures from atomistic simulations — a type of simulation where every atom of a protein is present. “The reason why we prefer not to do those in general is that they’re very expensive, so we cannot do very big simulations. But we can do atomistic simulations of short fragments to give us good fragment memories, again improving the accuracy of IDP’s structural exploration in AWSEM simulations,” he says.

An IDP will prefer multiple structures, not just one.

“That's the key difference from regular proteins: IDPs are multi-faceted in essence. But they still prefer certain structures over others. And the AWSEM-IDP model allows you to correctly describe those preferences,” Papoian explained. This model was described in a 2018 article published at the Journal of Physical Chemistry B.

In another work published earlier this year that was supported by the AWS Machine Learning Award, Papoian and his colleagues applied AWSEM-IDP to study a protein called linker histone H1, which plays an essential role in regulating many important biological processes. This protein has two intrinsically disordered regions, parts of its structure that are not well folded and resemble two tails. Because they are disordered, it’s much harder to understand what they do and how they interact.

Proteins like linker histone H1 regulate histone complexes, which act like a spool around which the DNA wraps to create structures called nucleosomes. “In this paper, we used AWSEM-IDP to model the nucleosome with linker histone H1, in particular with these disordered tails. And that allowed us to understand how the linker histone and the nucleosome come together and interact, and what's the role of these disordered tails,” says Papoian. Understanding proteins’ interactions with nucleosomes may give important insights on epigenetics, which is one of Papoian Lab’s interests.

Future challenges

Because making sense of IDPs is such a difficult process, Papoian says that AWSEM-IDP is an ongoing program with room for improvement. “What we have currently works better in some classes of proteins, and not so much in others. So next we’ll explore what are the challenges for what we currently have in ASWEM-IDP and try to come up with new advances to overcome them.”

In addition to IDPs, Papoian Lab will also continue to pursue the use of deep learning for structure prediction of well-folded proteins. Although there is some conceptual overlap with AlphaFold, Papoian believes that AWSEM-MD is a powerful tool and has advantages to other approaches when it comes to molecular dynamics.

Proteins are not frozen objects. Some of them are well structured, but many are not structured at all, and they are dynamic and move and shape-shift incessantly.
Garegin Papoian

“Proteins are not frozen objects,” he says. “Some of them are well structured, but many are not structured at all, and they are dynamic and move and shape-shift incessantly. So, to understand how these proteins function, you must model their dynamics and that’s what AWSEM-MD can do best.”

Papoian thinks one exciting area to be explored in coming decades will be combining machine learning and physics to work on protein structure prediction, protein dynamics, multiprotein complexes, and epigenetics.

“There are lots of things that still remain to be understood in our models. And I think that probably neither physics nor machine learning by themselves can tackle them. But a program that brings them together in a productive way can be very powerful,” he said.

Modeling an entire cell

Another ambitious project that Papoian and his colleagues are pursuing is to develop a computational model of an entire cell. “We still don’t have a blueprint of a cell the way we have a blueprint of a car or a Boeing airplane.”

To do that, his group develops their own software from scratch.

Garegin Papoian: How do cells move? Chemistry meets mechanics

“We basically do the science, the physics, and biophysics of what is needed to model our cells. We derive the needed algorithms from scratch based on the laws of physics and chemistry and then we program that into a computer and run simulations on a supercomputer,” he explained. This has to be done at a single molecule resolution, he adds, meaning that they have to track every single molecule within a cell.

To achieve that, the Papoian Lab developed a model called MEDYAN.

“We can already model some number of proteins, the membrane, we can model rich chemistry. We have developed some of the fundamental chemistry and physics components of what needs to be done,” he says. The next step is to scale it. “We usually do simulations with several types of proteins. So instead of several, you will need maybe hundreds or thousands of different types of proteins, so it just brings more complexity.”

When that happens, it will be a huge revolution in biomedicine, he says. “Then lots of things that people laboriously spend years doing in the laboratory could just run on AWS servers. And you could do your experiments and search for treatments computationally, which would be much cheaper and faster.”

View from space of a connected network around planet Earth representing the Internet of Things.
Sign up for our newsletter

Research areas

Related content

US, WA, Seattle
Job summaryPrime Video is an industry leading, high-growth business and a critical driver of Amazon Prime subscriptions, which contribute to customer loyalty and lifetime value. Prime Video is a digital video streaming and download service that offers Amazon customers the ability to rent, purchase or subscribe to a huge catalog of videos. The Prime Video Economist team works on disruptive ideas in the Prime Video space.We are looking for a truly innovative Data Scientist to work on disruptive ideas within the Prime Video space. Examples of problem spaces you may be working on include video product pricing, ecosystem effects (how streaming affects rentals or purchases), and forecasting demand for new content on the platform.On our team you will work with a diverse scientific team including engineers and economists as well as other data scientist to build statistical models using world-class data systems and partner directly with the business to implement the solutions.Key job responsibilities· Implement code (Python, R, Scala, etc.) for analyzing data and building machine learning/econometric models to solve specific business problems. Work with software engineering teams to productionize algorithms where appropriate.· Lead the development of the scientific roadmap, guide and develop junior engineers in designing and implementing scientific solutions.· Translate analytic insights into concrete, actionable recommendations for business or product improvement. Develop and present these as reports to senior stakeholders with ranging levels of technical knowledge.· Create, enhance, and maintain technical documentation, and present to other scientists, engineers and business leaders.· Demonstrate thorough technical knowledge on feature engineering of massive datasets, effective exploratory data analysis, and model building to deliver accurate and effective business insights.· Innovate by researching, learning, and adapting new modeling techniques and procedures to existing business problems.· Manage and execute entire project from start to finish including problem solving, data gathering and manipulation, predictive modeling, and stakeholder engagement.
US, WA, Bellevue
Job summaryDo you enjoy solving challenging problems and driving innovations in research? Are you seeking for an environment with a group of motivated and talented scientists like yourself? Do you want to create scalable optimization models and apply machine learning techniques to guide real-world decisions? Do you want to play a key role in the future of Amazon transportation and operations? Come and join us at Amazon's Modeling and Optimization team (MOP).Key job responsibilitiesAn Applied Scientist in the Modeling and Optimization (MOP) team· provides analytical decision support to Amazon planning teams via applying advanced mathematical and statistical techniques.· collaborates effectively with Amazon internal business customers, and is their trusted partner· is proactive and autonomous in discovering and resolving business pain-points within a given scope· is able to identify a suitable level of sophistication in resolving the different business needs· is confident in leveraging existing solutions to new problems where appropriate and is independent in designing and implementing new solutions where needed· is aware of the limitations of his/her proposed solutions and is proactive in communicating them to the business, and advances the application of sciences towards Amazon business problems by bringing new methods, ideas, and practices to the team and scientific community.A day in the life· Your will be developing model-based optimization, simulation, and/or predictive tools to identify and evaluate opportunities to improve customer experience, network speed, cost, and efficiency of capital investment.· You will quantify the improvements resulting from the application of these tools and you will evaluate the trade-offs between potentially competing objectives.· You will develop good communication skills and ability to speak at a level appropriate for the audience, will collaborate effectively with fellow scientists, software development engineers, and product managers, and will deliver business value in a close partnership with many stakeholders from operations, finance, IT, and business leadership.About the team· At the Modeling and Optimization (MOP) team, we use mathematical optimization, algorithm design, statistics, and machine learning to improve decision-making capabilities across WW Operations and Amazon Logistics.· We focus on transportation topology, labor and resource planning for fulfillment centers (FC), routing science, visualization research, data science and development, and process optimization.· We create models to simulate, optimize, and control the fulfillment network with the objective of reducing cost while improving speed and reliability.· We support multiple business lanes, therefore maintain a comprehensive and objective view, coordinating solutions across organizational lines where possible.
US, WA, Seattle
Job summaryAt Amazon, we're working to be the most customer-centric company on earth. To get there, we need exceptionally talented, bright, result oriented, and driven people. Amazon is seeking a Data Scientist - Simulation to assist in designing and optimizing the fulfillment network concepts and process improvements using discrete event simulations for our World Wide Design Engineering Team. Successful candidates will be natural self-starters who have the drive to design, model, and simulate new fulfillment center concepts and processes. The Simulation Data Scientist will be expected to deep dive problems and drive relentlessly towards creative solutions. This individual needs to be comfortable interfacing and driving various functional teams and individuals at all levels of the organization in order to be successful. Perform process modelling and simulation using discrete event simulation software’s, process optimization, statistical data analysis, and Design of Experiments (DOE) etc. to drive decisions on process and designs. Need based remote work option is available.Responsibilities:· Lead system level complex Discrete Event Simulation (DES) projects to build , simulate, and optimize the fulfillment center operational process flow models using FlexSim, Demo 3D, AnyLogic or any other Discrete Event Simulation (DES) software packages· Understand process flows , analyze data, perform Design of Experiments and effectively represent in simulation model to achieve better correlation and process improvements· Manage multiple DES simulation projects and tasks simultaneously and effectively influence, negotiate, and communicate with internal and external business partners, contractors and vendors.· Facilitate process improvement initiatives among site operations, engineering, and corporate systems groups.· Utilize code (python or another object oriented language) for data analysis and modeling algorithms· Analyze historical data to identify trends and support decision making using Statistical Techniques· Lead and coordinate simulation efforts between internal teams and outside vendors to develop optimal solutions for the network, including equipment specification, material flow, process design, and site layout.· Deliver results according to project schedules and quality· Provide written and verbal presentations to share insights and recommendations to audiences of varying levels of technical sophistication.· Make technical trade-offs for long term/short-term needs considering challenges in business area by applying relevant data science disciplines, and interactions among systems.
US, WA, Seattle
Job summaryAmazon is seeking an outstanding Data Scientist to uncover key insights on how customers engage with live sports events on Prime Video globally. With prestigious US sporting matches on Prime Video from NFL’s Thursday Night Football, the WNBA, AVP, the New York Yankees, and the Seattle Sounders, as well as global events like the English Premiere League (UK), UEFA Champions League (Italy, Germany), Ligue 1 (France), US Open Tennis (UK), Roland Garros (France), Autumn Nations Cup Rugby (UK) and more, live sports are an integral and growing component of Prime Video. As our selection of events expands, the Prime Video Content Analytics team is looking to enable agile decision making on live sports by developing key insights into customer engagement with live sport and translating these insights into large scale predictive modeling and analytics solutions.Key job responsibilitiesYou will have the following responsibilities within the scope of our global Prime Video business:· Drive analytics in an uncharted field that is not only developing at a fast pace but also becoming increasingly important to the Prime Video business· Support the analytical needs of stakeholders in the sports, advertising, finance, and live events teams, inclusive of statistical inference, demand modeling, and feature engineering· Build profitability models for new sports rights and partner with finance on business use cases· Think outside the box to use novel data and methodological approaches· Create new metrics that effectively guide the business and deploy dashboards to surface them to senior leadership· Ensure that the quality and timeliness of analytic deliverables meet business expectationsAbout the teamThe Prime Video Content Analytics team uses machine learning, econometrics, and data science to optimize Amazon’s streaming-video catalogue, driving customer engagement and Prime member acquisition. We generate insights to guide Amazon’s digital-video strategy, and we provide direct support to the content-acquisition process. We use detailed customer behavioral data (e.g. streaming history) and detailed information about content (e.g. IMDb-sourced characteristics) to predict and understand what customers like to watch.
ES, M, Madrid
Job summaryAmazon is looking for creative Applied Scientists to tackle some of the most interesting problems on the leading edge of machine learning (ML), search, natural language processing (NLP), and related areas with our Amazon Books team. At Amazon Books we believe that books are not only needed to work, education and entertainment, but are also required for a healthy society. As such, we aim to create an unmatched book discovery experience for our customers worldwide. We enable customers to discover new books, authors and genres through sophisticated recommendation engines, smart search tools and through social interaction, and we need your help to keep innovating in this space.If you are looking for an opportunity to solve deep technical problems and build innovative solutions in a fast-paced environment working within a smart and passionate team, this might be the role for you. You will develop and implement novel algorithms and modeling techniques to advance the state-of-the-art in technology areas at the intersection of ML, search, NLP, and deep learning. You will innovate, help move the needle for applied research in these exciting areas and build cutting-edge and scalable technologies that enable delightful experiences for hundreds of millions of people.In this role you will:· Work collaboratively with other scientists and developers to design and implement scalable models for improving our customers' experience discovering and getting the most out of their books;· Have the opportunity to work with a variety of technologies in a variety of use cases;· Drive scalable solutions from the business to prototyping, production testing and through engineering directly to production;· Drive best practices on the team, deal with ambiguity and competing objectives, and mentor and guide other members to achieve their career growth potential.About the teamWe aspire to be experts at the forefront of AI, machine learning and data science and their application to books e-commerce to help engineering teams innovate for readers, authors and publishers.As an Applied Scientist, you'll help us translate customer problems into tractable technical problems, and find ways to solve them by combining your expertise and that of other scientists and team members. You will work with partner engineering and business teams to ensure solutions have a real impact.
US, WA, Seattle
Job summaryAre you inspired by building new technologies to benefit customers? Do you dream of being at the forefront of robotics and autonomous system technology? Would you enjoy working in a fast paced, highly collaborative, start-up like environment? If you answered yes to any of these then you've got to check out the Amazon Scout team.We’ve been hard at work developing a new, fully-electric delivery system – Amazon Scout – designed to get packages to customers using autonomous delivery devices. These devices were created by Amazon, are the size of a small cooler, and roll along sidewalks at a walking pace. We developed Amazon Scout at our research and development lab in Seattle, ensuring the devices can safely and efficiently navigate around pets, pedestrians and anything else in their path.The Amazon Scout team shares a passion for innovation using advanced technologies, a love of solving complex challenges, and a desire to delight customers. We're looking for people who like dealing with ambiguity, solving hard, large scale problems, and working in a startup like environment. To learn more about Amazon Scout, check out our Amazon Day One Blog here: http://amazon.com/scoutAs a part of the localization team you will:· Collaborate closely with engineers, applied researchers and hardware teams to develop computer vision and machine learning algorithms and software for robots.· Take responsibility for technical problem solving, including creatively meeting product objectives and developing best practices.· Interact with teammates in variety of roles to accomplish your goals· Identify and initiate investigations of new technologies, prototype and test solutions for product features, and design and validate designs that deliver an exceptional user experience.· Recruit, hire and develop other applied scientists.
US, WA, Bellevue
Job summaryThe People eXperience and Technology Central Science Team (PXTCS) uses economics, behavioral science, statistics, and machine learning to proactively identify mechanisms and process improvements which simultaneously improve Amazon and the lives, wellbeing, and the value of work to Amazonians. We are an interdisciplinary team that combines the talents of science and engineering to develop and deliver solutions that measurably achieve this goal.We are looking for economists who are able to work with business partners to hone complex problems into specific, scientific questions, and test those questions to generate insights. The ideal candidate will work with engineers and computer scientists to estimate models and algorithms on large scale data, design pilots and measure their impact, and transform successful prototypes into improved policies and programs at scale. We are looking for creative thinkers who can combine a strong technical economic toolbox with a desire to learn from other disciplines, and who know how to execute and deliver on big ideas as part of an interdisciplinary technical team.Ideal candidates will work closely with business partners to develop science that solves the most important business challenges. They will work in a team setting with individuals from diverse disciplines and backgrounds. They will serve as an ambassador for science and a scientific resource for business teams, so that scientific processes permeate throughout the HR organization to the benefit of Amazonians and Amazon. Ideal candidates will own the data analysis, modeling, and experimentation that is necessary for estimating and validating models. They will work closely with engineering teams to develop scalable data resources to support rapid insights, and take successful models and findings into production as new products and services. They will be customer-centric and will communicate scientific approaches and findings to business leaders, listening to and incorporate their feedback, and delivering successful scientific solutions.Key job responsibilitiesUse causal inference methods to evaluate the impact of policies on employee outcomes. Examine how external labor market and economic conditions impact Amazon's ability to hire and retain talent. Use scientifically rigorous methods to develop and recommend career paths for employees.A day in the lifeWork with teammates to apply economic methods to business problems. This might include identifying the appropriate research questions, writing code to implement a DID analysis or estimate a structural model, or writing and presenting a document with findings to business leaders. Our economists also collaborate with partner teams throughout the process, from understanding their challenges, to developing a research agenda that will address those challenges, to help them implement solutions.About the teamWe are a multidisciplinary team that combines the talents of science and engineering to develop innovative solutions to make Amazon Earth's Best Employer.
US, Virtual
Job summaryAmazon’s Global Reliability Team is seeking a Principal Research Scientist to help envision, design and build the next generation of predictive maintenance capabilities and inventory management optimization behind Amazon’s Fulfillment Centers, Transportation Services, and Global Specialty Fulfillment.Key job responsibilitiesThe Principal Research Scientist will partner with senior leadership to develop long term strategic products/solutions and will represent and advocate them to leaders in our organization and other partner organizations such as Amazon Fulfillment Technologies, Workplace Health and Safety, amongst others. They will interact with Amazon scholars and universities among other research institutions to ensure that our team and our senior executives are up to speed on important trends, tools and technologies and how they can be used to impact the business.A day in the lifeIn this role, you will participate and lead the brainstorming sessions and review other scientists’ research. They will actively participate in the science community through presenting their research at the internal and external conference. They will mentor senior scientists for their career development and growth and help the company to identify and acquire scientists with the right skillset.About the teamWe are seeking high-energy individuals that are passionate about working with real-time machine and sensor data to build automated systems aimed to improve equipment availability.This position is perfect for someone who has a deep and broad analytic background and is passionate about using mathematical modeling and statistical analysis to make a real difference. Experience in applied analytics is essential, and they should be familiar with modern tools for data science and business analysis. We are particularly interested in candidates with research background in reliability engineering, econometrics, statistical inference, and time series modeling.
US, MA, Cambridge
Job summaryAmazon Lab126 is an inventive research and development company that designs and engineers high-profile consumer electronics. Lab126 began in 2004 as a subsidiary of Amazon.com, Inc., originally creating the best-selling Kindle family of products. Since then, we have produced groundbreaking devices like Fire tablets, Fire TV and Amazon Echo. What will you help us create?The Role:We are looking for a high caliber Applied Scientist Lead to join our team. As part of the larger technology team working on new consumer technology, your work will have a large impact to hardware, internal software developers, ecosystem, and ultimately the lives of Amazon customers. In this role, you will:• Lead a team of talented audio scientists and SW developers to bring a new and innovative audio products and services to delight customers• Propose new research projects, get buy-in from stakeholders, plan and budget the project and lead the team for successful execution• Work closely with an inter-disciplinary product development team including outside partners to bring the prototype algorithm into commercialization• Mentor team on music/speech/acoustic processing technology development• Manage small team of world class scientists and SW engineers in audio• Take a big part in the mission to create earth's best employerBe a respectable team leader in an open and collaborative environment
US, MA, Boston
Job summaryAre you inspired by invention? Is problem solving through teamwork in your DNA? Do you like the idea of seeing how your work impacts the bigger picture? Answer yes to any of these and you’ll fit right in here at Amazon Robotics. We are a smart team of doers that work passionately to apply cutting edge advances in robotics and software to solve real-world challenges that will transform our customers’ experiences in ways we can’t even image yet. We invent new improvements every day. We are Amazon Robotics and we will give you the tools and support you need to invent with us in ways that are rewarding, fulfilling and fun.We seek a talented and motivated engineer to tackle broad challenges in system-level analysis. You will work in a small team to quantify system performance at scale and to expand the breadth and depth of our analysis (e.g. increase the range of software components and warehouse processes covered by our models, develop our library of key performance indicators, construct experiments that efficiently root cause emergent behaviors). You will engage with growing teams of software development and warehouse design engineers to drive evolution of the AR system and of the simulation engine that supports our work.This role is a 6 month co-op to join AR full time (40 hours/week) from July-December 2022. Come join us in North Reading, MA, or in our newly expanded innovation hub in Westborough, MA!Both campuses provide a unique opportunity for co-ops to have direct access to robotics testing labs and manufacturing facilities. Remote and hybrid flexibility is available for this role.