Strategic Communications and Marketing News Bureau

Researchers use new approach to predict protein function

A research team, led by biochemistry professor John A. Gerlt, was the first to use a computational approach to accurately predict a protein's function from its amino acid sequence.

A research team, led by biochemistry professor John A. Gerlt, was the first to use a computational approach to accurately predict a protein’s function from its amino acid sequence.

CHAMPAIGN, Ill. – In a paper published online this month in the journal Nature Chemical Biology, researchers report that they have developed a way to determine the function of some of the hundreds of thousands of proteins for which amino acid sequence data are available, but whose structure and function remain unknown.

The research team, led by University of Illinois biochemistry professor John A. Gerlt, is the first to use a computational approach to accurately predict a protein’s function from its amino acid sequence. Their “in silico” (computer-aided) predictions were validated in the laboratory by means of enzyme assays and X-ray crystallography.

The new approach involved searching databases of known proteins for those with amino acid sequences that had the greatest homology to the unknown proteins. The researchers then used the three-dimensional structures of the most closely matched known proteins in their analyses of protein function.

Using the structural data obtained from this homology modeling, the team performed computerized docking experiments to quickly evaluate whether the unknown proteins were likely to bind to any of a vast library of potential target molecules, or substrates. Determining which substrate binds to a given protein is vital to understanding the protein’s function.

“This study describes an integrated approach using experimental techniques, computational techniques and X-ray crystallography for predicting the function of a protein of previously unknown function,” Gerlt said.

These methods will speed the task of identifying the biological roles of some of the hundreds of thousands of proteins whose functions have not yet been discovered.

Homology modeled complex (in cyan) of the enolase enzyme, BC0371, with its substrate, N-succinyl-L-arginine, matches very closely the experimentally determined structure (in yellow) via X-ray crystallography.

Homology modeled complex (in cyan) of the enolase enzyme, BC0371, with its substrate, N-succinyl-L-arginine, matches very closely the experimentally determined structure (in yellow) via X-ray crystallography.

“Rather than trying to do (laboratory) experiments on 30,000 compounds to determine if they are substrates, with this approach you might do experiments on 10,” Gerlt said.

The study involved a family of proteins within the large and diverse enolase superfamily. Enolases are enzymes that catalyze the breakdown of glucose and related compounds into other molecules as needed for metabolism.

The enzymes within the enolase superfamily utilize similar reaction mechanisms to one another but catalyze different reactions, complicating the task of discovering their function. There are more than 3,000 proteins in the enolase superfamily, and a majority of them have not yet been fully – or accurately – characterized. (The new study also revealed that one family of enolase proteins had been misclassified.)

Gerlt and his colleagues expect that the computational approach they pioneered will help scientists more efficiently tackle the problem of understanding these – and other – unknown proteins.

“There are 4 1/2 million protein sequences in the sequence databanks, and maybe the functions are known for, can be assigned to, half of those with some reliability,” Gerlt said. “That tells you that there is a lot of biology to be discovered.”

The research team included scientists from the University of California, San Francisco, and the Albert Einstein College of Medicine.

This study was supported by the National Institute of General Medical Sciences at the National Institutes of Health.

John Gerlt is Gutgsell Chair professor of biochemistry, chemistry, biophysics and computational biology, and a professor of basic medical science.

Editor’s note: To reach John A. Gerlt, call 217- 244-7414; e-mail: j-gerlt@illinois.edu.

To view or subscribe to the RSS feed for Science News at Illinois, please go to: http://illinois.edu/lb/rss/608/text.xml.

Read Next

Life sciences Photo of Michael Ward standing in tall grass on a riverbank.

How are migrating wild birds affected by H5N1 infection in the U.S.?

Each spring, roughly 3.5 billion wild birds migrate from their warm winter havens to their breeding grounds across North America, eating insects, distributing plant seeds and providing a variety of other ecosystem services to stopping sites along the way. Some also carry diseases like avian influenza, a worry for agricultural, environmental and public health authorities. […]

Announcements Marcelo Garcia, professor of civil and environmental engineering at The Grainger College of Engineering.

Illinois faculty member elected to National Academy of Engineering

Champaign, Ill. — Marcelo Garcia, a professor of civil and environmental engineering in The Grainger College of Engineering, has been elected to the National Academy of Engineering.

Social sciences Male and female student embracing on the quad with flowering redbud tree and the ACES library in the background. Photo by Michelle Hassel

Dating is not broken, but the trajectories of relationships have changed

CHAMPAIGN, Ill. — According to some popular culture writers and online posts by discouraged singles lamenting their inability to find romantic partners, dating is “broken,” fractured by the social isolation created by technology, pandemic lockdowns and potential partners’ unrealistic expectations. Yet two studies of college students conducted a decade apart found that their ideas about […]

Strategic Communications and Marketing News Bureau

507 E. Green St
MC-426
Champaign, IL 61820

Email: stratcom@illinois.edu

Phone (217) 333-5010