AI Predicts Enzyme Function Better than Leading Tools

A new artificial intelligence (AI) tool can predict the functions of enzymes based on their amino acid sequences, even when the enzymes are unstudied or poorly understood. The researchers said the AI tool, dubbed CLEAN, outperforms the leading state-of-the-art tools in accuracy, reliability and sensitivity. Better understanding of enzymes and their functions would be a boon for research in genomics, chemistry, industrial materials, medicine, pharmaceuticals and more.

"Just like ChatGPT uses data from written language to create predictive text, we are leveraging the language of proteins to predict their activity," said study leader Huimin Zhao, a University of Illinois Urbana-Champaign professor of chemical and biomolecular engineering. "Almost every researcher, when working with a new protein sequence, wants to know right away what the protein does. In addition, when making chemicals for any application - biology, medicine, industry - this tool will help researchers quickly identify the proper enzymes needed for the synthesis of chemicals and materials."

The researchers will publish their findings in the journal Science and make CLEAN accessible online March 31.

With advances in genomics, many enzymes have been identified and sequenced, but scientists have little or no information about what those enzymes do, said Zhao, a member of the Carl R. Woese Institute for Genomic Biology at Illinois.

Other computational tools try to predict enzyme functions. Typically, they attempt to assign an enzyme commission number - an ID code that indicates what kind of reaction an enzyme catalyzes - by comparing a queried sequence with a catalog of known enzymes and finding similar sequences. However, these tools don’t work as well with less-studied or uncharacterized enzymes, or with enzymes that perform multiple jobs, Zhao said.

"We are not the first one to use AI tools to predict enzyme commission numbers, but we are the first one to use this new deep-learning algorithm called contrastive learning to predict enzyme function. We find that this algorithm works much better than the AI tools that are used by others," Zhao said. "We cannot guarantee everyone's product will be correctly predicted, but we can get higher accuracy than the other two or other three methods."

The researchers verified their tool experimentally with both computational and in vitro experiments. They found that not only could the tool predict the function of previously uncharacterized enzymes, it also corrected enzymes mislabeled by the leading software and correctly identified enzymes with two or more functions.

Zhao's group is making CLEAN accessible online for other researchers seeking to characterize an enzyme or determine whether an enzyme could catalyze a desired reaction.

"We hope that this tool will be used widely by the broad research community," Zhao said. "With the web interface, researchers can just enter the sequence in a search box, like a search engine, and see the results."

Zhao said the group plans to expand the AI behind CLEAN to characterize other proteins, such as binding proteins. The team also hopes to further develop the machine-learning algorithms so that a user could search for a desired reaction and the AI would point to a proper enzyme for the job.

"There are a lot of uncharacterized binding proteins, such as receptors and transcription factors. We also want to predict their functions as well," Zhao said. "We want to predict the functions of all proteins so that we can know all the proteins a cell has and better study or engineer the whole cell for biotechnology or biomedical applications."

The National Science Foundation supported this work through the Molecule Maker Lab Institute, an AI Research Institute Zhao leads.

Tianhao Yu, Haiyang Cui, Jianan Canal Li, Yunan Luo, Guangde Jiang, Huimin Zhao.
Enzyme function prediction using contrastive learning.
Science, 2023. doi: 10.1126/science.adf2465

Most Popular Now

Is AI in Medicine Playing Fair?

As artificial intelligence (AI) rapidly integrates into health care, a new study by researchers at the Icahn School of Medicine at Mount Sinai reveals that all generative AI models may...

Generative AI's Diagnostic Capabili…

The use of generative AI for diagnostics has attracted attention in the medical field and many research papers have been published on this topic. However, because the evaluation criteria were...

New System for the Early Detection of Au…

A team from the Human-Tech Institute-Universitat Politècnica de València has developed a new system for the early detection of Autism Spectrum Disorder (ASD) using virtual reality and artificial intelligence. The...

Diagnoses and Treatment Recommendations …

A new study led by Prof. Dan Zeltzer, a digital health expert from the Berglas School of Economics at Tel Aviv University, compared the quality of diagnostic and treatment recommendations...

AI Tool can Track Effectiveness of Multi…

A new artificial intelligence (AI) tool that can help interpret and assess how well treatments are working for patients with multiple sclerosis (MS) has been developed by UCL researchers. AI uses...

Dr Jason Broch Joins the Highland Market…

The Highland Marketing advisory board has welcomed a new member - Dr Jason Broch, a GP and director with a strong track record in the NHS and IT-enabled transformation. Dr Broch...

Surrey and Sussex Healthcare NHS Trust g…

Surrey and Sussex Healthcare NHS Trust has marked an important milestone in connecting busy radiologists across large parts of South East England, following the successful go live of Sectra's enterprise...

Multi-Resistance in Bacteria Predicted b…

An AI model trained on large amounts of genetic data can predict whether bacteria will become antibiotic-resistant. The new study shows that antibiotic resistance is more easily transmitted between genetically...

AI-Driven Smart Devices to Transform Hea…

AI-powered, internet-connected medical devices have the potential to revolutionise healthcare by enabling early disease detection, real-time patient monitoring, and personalised treatments, a new study suggests. They are already saving lives...

DMEA 2025 Ends with Record Attendance an…

8 - 10 April 2025, Berlin, Germany. DMEA 2025 came to a successful close with record attendance and an impressive program. 20,500 participants attended Europe's leading digital health event over the...

A Novel AI-Based Method Reveals How Cell…

Researchers from Tel Aviv University have developed an innovative method that can help to understand better how cells behave in changing biological environments, such as those found within a cancerous...