"Self-Taught" AI Tool Helps to Diagnose and Predict Severity of Common Lung Cancer

A computer program based on data from nearly a half-million tissue images and powered by artificial intelligence (AI) can accurately diagnose cases of adenocarcinoma, the most common form of lung cancer, a new study shows.

Researchers at NYU Langone Health's Perlmutter Cancer Center and the University of Glasgow developed and tested the program. They say that because it incorporates structural features of tumors from 452 adenocarcinoma patients, who are among the more than 11,000 patients in the United States National Cancer Institute's Cancer Genome Atlas, the program offers an unbiased, detailed, and reliable second opinion for patients and oncologists about the presence of the cancer and the likelihood and timing of its return (prognosis).

The research team also points out that the program is independent and "self-taught," meaning that it determined on its own which structural features were statistically most significant to gauging the severity of disease and had the greatest impact on tumor recurrence.

Publishing in the journal Nature Communications online June 11, the study program, also called an algorithm, or specifically, histomorphological phenotype learning (HPL), was found to accurately distinguish between similar lung cancers, adenocarcinoma and squamous cell cancers, 99% of the time. The HPL program was also found to be 72% accurate at predicting the likelihood and timing of cancer’s return after therapy, bettering the 64% accuracy in the predictions made by pathologists who directly examined the same patients’ tumor images, researchers say.

"Our new histomorphological phenotype learning program has the potential to offer cancer specialists and their patients a quick and unbiased diagnostic tool for lung adenocarcinoma that, once further testing is complete, can also be used to help validate and even guide their treatment decisions," said study lead investigator Nicolas Coudray, PhD, a bioinformatics programmer at NYU Grossman School of Medicine and Perlmutter Cancer Center.

"Patients, physicians, and researchers know they can rely on this predictive modeling because it is self-taught, provides explainable decisions, and is based only on the knowledge drawn specifically from each patient's tissue, including such features as its proportion of dying cells, tumor-fighting immune cells, and how densely packed the tumor cells are, among other features," said Coudray.

"Lung tissue samples can now be analyzed in minutes by our computer program to provide fairly accurate predictions of whether their cancer will return, predictions that are better than current standards of care for making a prognosis in lung adenocarcinoma," said study co-senior investigator Aristotelis Tsirigos, PhD. Tsirigos is a professor in the Departments of Pathology and Medicine at NYU Grossman School of Medicine and Perlmutter Cancer Center, where he also serves as co-director of precision medicine and director of its Applied Bioinformatics Laboratories.

Tsirigos says that thanks to such tools and other advances in the lung cancer biology, pathologists will be examining tissue scans on their computer screens, and less and less on microscopes, and then using their AI program to analyze the image and produce its own image of the scan. The new image, or "landscape," they add, will offer a detailed breakdown of the tissue’s content. It might note, for example, that there is 5% necrosis and 10% tumor infiltration and what that means in terms of survival. That reading may statistically equate to an 80% chance of remaining cancer-free for two years or more, based on information from all the patient data in the program.

To develop the HPL program, the researchers first analyzed lung adenocarcinoma tissue slides from the Cancer Genome Atlas. Adenocarcinoma was chosen for the test model because the disease is known for characteristic features. As an example, they note that its tumor cells tend to group in so-called acinar, or saclike patterns and spread predictably along the surface lining of lung cells.

From their analysis of the slides, whose visual images were digitally scanned and broken into 432,231 small quadrants or tiles, researchers found 46 key characteristics, what they term histomorphological phenotype clusters, from both normal and diseased tissue, a subset of which were statistically linked to either cancer’s early return or to long-term survival. The findings were then confirmed by further and separate testing on tissue images from 276 men and women who were treated for adenocarcinoma at NYU Langone from 2006 to 2021.

Researchers say their goal is to use the HPL algorithm to assign to each patient a score between 0 and 1 that reflects their statistical chance of survival and tumor recurrence for up to five years. Because the program is self-learning, they stress HPL will become increasingly more accurate as more data is added over time. To build public trust, researchers have posted their programming code online and have plans to make the new HPL tool freely available upon completion of further testing.

Characteristics linked to tumors recurring included high tile percentages of dead cancer cells and tumor-fighting immune cells called lymphocytes, and the dense clustering of tumor cells in the outer linings of the lungs. Features tied to increased likelihood for survival were high percentages of unchanged or preserved lung sac tissue, and lack of or mild presence of inflammatory cells.

Tsirigos says the team next plans to look at developing HPL-like programs for other cancers, such as breast, ovarian, and colorectal, that are similarly based on distinctive and key morphological features and additional molecular data. The team also has plans to expand and improve the accuracy of the current adenocarcinoma HPL program by including other data from hospital electronic health records about other illnesses and diseases, or even income and home ZIP code.

Funding support for the new study was provided by National Institutes of Health grant P30CA016087, United Kingdom Research Council grants Ep/R018634/1 and BB/V016067/1, and European Union Horizon 2020 grant no. 101016851.

Claudio Quiros A, Coudray N, Yeaton A, Yang X, Liu B, Le H, Chiriboga L, Karimkhan A, Narula N, Moore DA, Park CY, Pass H, Moreira AL, Le Quesne J, Tsirigos A, Yuan K.
Mapping the landscape of histomorphological cancer phenotypes using self-supervised learning on unannotated pathology slides.
Nat Commun. 2024 Jun 11;15(1):4596. doi: 10.1038/s41467-024-48666-7

Most Popular Now

Using Data and AI to Create Better Healt…

Academic medical centers could transform patient care by adopting principles from learning health systems principles, according to researchers from Weill Cornell Medicine and the University of California, San Diego. In...

AI Medical Receptionist Modernizing Doct…

A virtual medical receptionist named "Cassie," developed through research at Texas A&M University, is transforming the way patients interact with health care providers. Cassie is a digital-human assistant created by Humanate...

Northern Ireland Completes Nationwide Ro…

Go-lives at Western and Southern health and social care trusts mean every pathology service is using the same laboratory information management system; improving efficiency and quality. An ambitious technology project to...

AI Tool Set to Transform Characterisatio…

A multinational team of researchers, co-led by the Garvan Institute of Medical Research, has developed and tested a new AI tool to better characterise the diversity of individual cells within...

AI Detects Hidden Heart Disease Using Ex…

Mass General Brigham researchers have developed a new AI tool in collaboration with the United States Department of Veterans Affairs (VA) to probe through previously collected CT scans and identify...

Human-AI Collectives Make the Most Accur…

Diagnostic errors are among the most serious problems in everyday medical practice. AI systems - especially large language models (LLMs) like ChatGPT-4, Gemini, or Claude 3 - offer new ways...

MHP-Net: A Revolutionary AI Model for Ac…

Liver cancer is the sixth most common cancer globally and a leading cause of cancer-related deaths. Accurate segmentation of liver tumors is a crucial step for the management of the...

Highland Marketing Announced as Official…

Highland Marketing has been named, for the second year running, the official communications partner for HETT Show 2025, the UK's leading digital health conference and exhibition. Taking place 7-8 October...

Groundbreaking TACIT Algorithm Offers Ne…

Researchers at VCU Massey Comprehensive Cancer Center have developed a novel algorithm that could provide a revolutionary tool for determining the best options for patients - both in the treatment...

The Many Ways that AI Enters Rheumatolog…

High-resolution computed tomography (HRCT) is the standard to diagnose and assess progression in interstitial lung disease (ILD), a key feature in systemic sclerosis (SSc). But AI-assisted interpretation has the potential...