Strategic Communications and Marketing News Bureau

AI predicts enzyme function better than leading tools

CHAMPAIGN, Ill. — A new artificial intelligence tool can predict the functions of enzymes based on their amino acid sequences, even when the enzymes are unstudied or poorly understood. The researchers said the AI tool, dubbed CLEAN, outperforms the leading state-of-the-art tools in accuracy, reliability and sensitivity. Better understanding of enzymes and their functions would be a boon for research in genomics, chemistry, industrial materials, medicine, pharmaceuticals and more.

“Just like ChatGPT uses data from written language to create predictive text, we are leveraging the language of proteins to predict their activity,” said study leader Huimin Zhao, a University of Illinois Urbana-Champaign professor of chemical and biomolecular engineering. “Almost every researcher, when working with a new protein sequence, wants to know right away what the protein does. In addition, when making chemicals for any application – biology, medicine, industry – this tool will help researchers quickly identify the proper enzymes needed for the synthesis of chemicals and materials.”

The researchers published their findings in the journal Science and have made CLEAN accessible online.

With advances in genomics, many enzymes have been identified and sequenced, but scientists have little or no information about what those enzymes do, said Zhao, a member of the Carl R. Woese Institute for Genomic Biology at Illinois.

Other computational tools try to predict enzyme functions. Typically, they attempt to assign an enzyme commission number – an ID code that indicates what kind of reaction an enzyme catalyzes – by comparing a queried sequence with a catalog of known enzymes and finding similar sequences. However, these tools don’t work as well with less-studied or uncharacterized enzymes, or with enzymes that perform multiple jobs, Zhao said. 

“We are not the first one to use AI tools to predict enzyme commission numbers, but we are the first one to use this new deep-learning algorithm called contrastive learning to predict enzyme function. We find that this algorithm works much better than the AI tools that are used by others,” Zhao said. “We cannot guarantee everyone’s product will be correctly predicted, but we can get higher accuracy than the other two or other three methods.”

The researchers verified their tool experimentally with both computational and in vitro experiments. They found that not only could the tool predict the function of previously uncharacterized enzymes, it also corrected enzymes mislabeled by the leading software and correctly identified enzymes with two or more functions. 

Zhao’s group has made CLEAN accessible online for other researchers seeking to characterize an enzyme or determine whether an enzyme could catalyze a desired reaction.

“We hope that this tool will be used widely by the broad research community,” Zhao said. “With the web interface, researchers can just enter the sequence in a search box, like a search engine, and see the results.”

Zhao said the group plans to expand the AI behind CLEAN to characterize other proteins, such as binding proteins. The team also hopes to further develop the machine-learning algorithms so that a user could search for a desired reaction and the AI would point to a proper enzyme for the job. 

“There are a lot of uncharacterized binding proteins, such as receptors and transcription factors. We also want to predict their functions as well,” Zhao said. “We want to predict the functions of all proteins so that we can know all the proteins a cell has and better study or engineer the whole cell for biotechnology or biomedical applications.” 

The National Science Foundation supported this work through the Molecule Maker Lab Institute, an AI Research Institute Zhao leads. 

Zhao also is a U. of I. professor of bioengineering, of chemistry, and of biomedical and translational sciences in the Carle Illinois College of Medicine

Editor’s notes: To reach Huimin Zhao, email: zhao5@illinois.eduThe paper “Enzyme function prediction using contrastive learning” is available online. CLEAN is accessible online

DOI: 10.1126/science.adf2465



This article was imported from a previous version of the News Bureau website. Please email news@illinois.edu to report missing photos and/or photo credits.

Read Next

Health and Medicine Sara Gerke, the Richard W. & Marie L. Corman Scholar at the University of Illinois Urbana-Champaign and expert in legal issues surrounding cutting-edge medical developments.

New paper urges caution as FDA plans to phase out animal testing in drug development

Replacing animal testing with alternate methodologies in preclinical drug trials holds potential for the development of cheaper, safer pharmaceuticals, but such a novel approach needs to be implemented judiciously and with caution.

Humanities Diptych image with the book cover of "Indifferent Cities" and a photo of Angel Garcia.

Illinois English professor explores his lost family history in new poetry collection

CHAMPAIGN, Ill. — Poet Ángel García examines his disrupted family lineage in his new collection of poetry, seeking answers about where he came from and trying to fill the many gaps in his family’s story. “Indifferent Cities” is the second book by García, a University of Illinois Urbana-Champaign English professor. The book was the winner […]

Arts Photo of a woman in a long, pink-colored dress posing onstage with a man aiming a box fan at her.

January Dance performances continue celebration of Black dance

CHAMPAIGN, Ill. — The January Dance performance of the University of Illinois Urbana-Champaign dance department features works that are inspired by supermodels, reflect the struggle for racial justice and imagine the spell a group of women are conjuring in a forest. January Dance is Jan. 29-31 at Krannert Center for the Performing Arts. It is […]

Strategic Communications and Marketing News Bureau

507 E. Green St
MC-426
Champaign, IL 61820

Email: stratcom@illinois.edu

Phone (217) 333-5010