Machine Learning Promises to Accelerate Metabolism Research

A new study shows that it is possible to use machine learning and statistics to address a problem that has long hindered the field of metabolomics: large variations in the data collected at different sites.

"We don't always know the source of the variation," said Daniel Raftery, professor of anesthesiology and pain medicine at the University of Washington School of Medicine in Seattle. "It could be because the subjects are different with different genetics, diets and environmental exposures. Or it could be the way samples were collected and processed."

Raftery and his research colleagues wanted to see if machine learning - a form of artificial intelligence that uses computer algorithms to process large volumes of historical data and to identify data patterns - could reduce this variation between data from different sites without obscuring important differences.

"We wanted to bring these mismatched datasets together so the findings of different studies could be compared or combined for further analysis," Raftery said.

He led the project with Dabao Zhang and Min Zhang, formerly at Purdue University and now professors of epidemiology & biostatistics at University of California, Irvine Public Health. Danni Liu, a Ph.D. student at Purdue, was lead author of the paper, which appears in the Feb.12 issue of PNAS, the Proceedings of the National Academy of Sciences.

Raftery is an investigator at the UW Mitochondria and Metabolism Center, based at UW Medicine South Lake Union in Seattle.

The term metabolomics relates to metabolism, a word that describes chemical reactions our cells perform to maintain life. These include reactions that break down food to harvest energy and obtain the raw materials cells need for growth and repair, reactions that involve the assembly of cellular components needed for life, and reactions involved in the disassembly of damaged or unneeded components so they can be recycled, discarded or used as fuel.

The small chemicals produced by these metabolic processes are called metabolites. Metabolite levels reveal what chemical reactions are going on within a cell, tissue, organ or organism at a given moment and how those reactions may change over time.

Metabolomics is the study of metabolites and the processes that produce them.

This information helps medical scientists better understand not only how cells maintain normal function but also what might be going wrong when people fall ill. This knowledge could lead to new ways to diagnose, prevent and treat disease, Raftery said.

In the new study, the researchers built machine-learning models to identify factors that were driving the differences between datasets. The models accounted for demographic differences in the study populations, such as age and sex, and used the information contained in other metabolites to explain the observed differences.

The researchers found that their approach reduced the variation between datasets by more than 95% without obscuring meaningful differences, such as those that naturally occur between men and women.

"We've shown that our approach has the potential to reduce unwanted variance seen in metabolomic data while retaining metabolomic signals of interest," Raftery said.

The group plans to expand its studies with the aim of providing a deeper understanding of normal metabolism and identifying biomarkers of abnormal metabolism that can be a sign of disease.

Liu D, Nagana Gowda GA, Jiang Z, Alemdjrodo K, Zhang M, Zhang D, Raftery D.
Modeling blood metabolite homeostatic levels reduces sample heterogeneity across cohorts.
Proc Natl Acad Sci U S A. 2024 Feb 20;121(8):e2307430121. doi: 10.1073/pnas.2307430121

Most Popular Now

Using Data and AI to Create Better Healt…

Academic medical centers could transform patient care by adopting principles from learning health systems principles, according to researchers from Weill Cornell Medicine and the University of California, San Diego. In...

AI Medical Receptionist Modernizing Doct…

A virtual medical receptionist named "Cassie," developed through research at Texas A&M University, is transforming the way patients interact with health care providers. Cassie is a digital-human assistant created by Humanate...

Northern Ireland Completes Nationwide Ro…

Go-lives at Western and Southern health and social care trusts mean every pathology service is using the same laboratory information management system; improving efficiency and quality. An ambitious technology project to...

AI Tool Set to Transform Characterisatio…

A multinational team of researchers, co-led by the Garvan Institute of Medical Research, has developed and tested a new AI tool to better characterise the diversity of individual cells within...

AI Detects Hidden Heart Disease Using Ex…

Mass General Brigham researchers have developed a new AI tool in collaboration with the United States Department of Veterans Affairs (VA) to probe through previously collected CT scans and identify...

Human-AI Collectives Make the Most Accur…

Diagnostic errors are among the most serious problems in everyday medical practice. AI systems - especially large language models (LLMs) like ChatGPT-4, Gemini, or Claude 3 - offer new ways...

MHP-Net: A Revolutionary AI Model for Ac…

Liver cancer is the sixth most common cancer globally and a leading cause of cancer-related deaths. Accurate segmentation of liver tumors is a crucial step for the management of the...

Highland Marketing Announced as Official…

Highland Marketing has been named, for the second year running, the official communications partner for HETT Show 2025, the UK's leading digital health conference and exhibition. Taking place 7-8 October...

Groundbreaking TACIT Algorithm Offers Ne…

Researchers at VCU Massey Comprehensive Cancer Center have developed a novel algorithm that could provide a revolutionary tool for determining the best options for patients - both in the treatment...