News Release

Protein behavior can be predicted with simple maths

Mutations affect protein stability following remarkably simple rules, according to study in Nature

Peer-Reviewed Publication

Center for Genomic Regulation

GRB2-SH3

image: 

GRB2-SH3 is a protein with 34 amino acids, each which can be mutated into 19 different amino acids. This can result in 17 billion different combinations of the protein if only a single change allowed at each position.

view more 

Credit: Aina Martí Aranda/Wellcome Sanger Institute

In a study published today in the journal Nature, researchers from the Centre for Genomic Regulation (CRG) and the Wellcome Sanger Institute have discovered that mutations affect protein stability following remarkably simple rules. The discovery has profound implications for accelerating the development of new treatments for diseases or the design of new proteins with industrial applications.

Proteins are chains made up of twenty different types of smaller units called amino acids. A single mutation swaps one amino acid for another, changing the protein's shape. This can mark the difference between health and disease. Many diseases, including cancer and neurodegenerative disorders, are caused by more than one mutation in a protein.

Predicting how mutations alter a protein’s shape is critical for understanding their contribution to disease. However, with so many amino acids in a protein, there are an astronomical number of ways mutations can combine. Experimentally testing each possible combination to see how they affect a protein is practically impossible.

“There are 17 billion different combinations of a protein that is 34 amino acids in length with only a single change allowed at each position. If it took just one second to test a single combination, we’d need a total of 539 years to try them all. It’s not a feasible experiment,” says Aina Martí Aranda, co-author of the study who began the project at the CRG and is currently a PhD student at the Wellcome Sanger Institute in the UK.

As proteins get longer, the different number of combinations rises exponentially. For a protein one hundred amino acids long, there are more possible combinations than there are atoms in the entire universe. The vast majority of known proteins, especially those contributing to human disease, are much longer.

Even in this vast landscape of possibilities, research led by Dr. André Faure at the Centre for Genomic Regulation in Barcelona and ICREA Research Professor Ben Lehner, with dual affiliation between the CRG and the Wellcome Sanger Institute, have discovered that the impact of mutations on protein stability is more predictable than previously thought.

For years, there has been an underlying assumption that two mutations might interact with each other in unexpected ways, enhancing or suppressing each other’s effects. “The fear that two mutations interacting can unpredictably affect the whole structure made us use incredibly complex models,” says Martí Aranda.

The study found that while mutations do interact, it is a relatively rare occurrence, and the vast majority affect a protein independently of each other. “Our discovery turns an old understanding on its head, showing that the endless possibilities of protein mutations boil down to straightforward rules. We don't need supercomputers to predict a protein’s behaviour – just good measurements and simple maths will do,” says Dr. Lehner.

The researchers made the discovery by generating thousands of protein variants, each with different combinations of mutations that could produce functional proteins. They then tested the stability of the proteins, generating a vast amount of data on how each mutation and combination of mutations affect proteins. The experimental outcomes closely matched models which assume that the total effect of multiple mutations can be calculated by simply adding up the effects of each individual mutation.

The findings can help better understand and target genetic diseases. For example, some genetic disorders are caused by many mutations in one protein. Patients may have different combinations of mutations, making it challenging to predict disease severity and response to treatments.

With the new understanding that most mutations act independently, clinicians can find new ways of predicting how various mutation combinations affect a protein's stability and function. This can lead to more accurate prognoses and personalised treatment plans, improving patient outcomes.

The study can also lead to more efficient drug development. Some drugs correct misfolded proteins, such as in Alzheimer's disease, where the changing shape of amyloid-beta proteins form plaques in the brain. Researchers can now better predict which mutations are most destabilising and design molecules that specifically stabilise these regions.

The study also has implications for biotechnologists using protein design to tackle different types of problems. For example, some enzymes have the ability to break down plastics in the environment. Researchers could design new enzymes with enhanced activity and stability by adding beneficial mutations together.

While the discovery is a significant advance, the researchers raise some limitations in the study. For example, they did not capture more complex interactions involving three or more mutations. In some proteins, these higher-order interactions could significantly impact stability and are not predicted by simply adding up individual effects.

Also, while the findings can dramatically reduce the number of experiments needed, some level of experimental validation is still necessary to confirm predictions, especially for critical applications like drug development where there may be unforeseen effects or rare interactions that the models do not capture.


Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.