AI System may Accelerate Search for Cancer Discoveries

Searching through the mountains of published cancer research could be made easier for scientists, thanks to a new AI system. The system, called LION LBD and developed by computer scientists and cancer researchers at the University of Cambridge, has been designed to assist scientists in the search for cancer-related discoveries. It is the first literature-based discovery system aimed at supporting cancer research. The results are reported in the journal Bioinformatics.

Global cancer research attracts massive amounts of funding worldwide, and the scientific literature is now so huge that researchers are struggling to keep up with it: critical hypothesis-generating evidence is now often discovered long after it was published.

Cancer is a complex class of diseases that are not completely understood and are the second-leading cause of death worldwide. Cancer development involves changes in numerous chemical and biochemical molecules, reactions and pathways, and cancer research is being conducted across a wide variety of scientific fields, which have variability in the way that they describe similar concepts.

"As a cancer researcher, even if you knew what you were looking for, there are literally thousands of papers appearing every day," said Professor Anna Korhonen, Co-Director of Cambridge's Language Technology Lab who led the development of LION LBD in collaboration with Dr Masashi Narita at Cancer Research UK Cambridge Institute and Professor Ulla Stenius at Karolinska Institutet in Sweden. "LION LBD uses AI to help scientists keep up-to-date with published discoveries in their field, but could also help them make new discoveries by combining what is already known in the literature by making connections between sources that may appear to be unrelated."

The 'LBD' in LION LBD stands for Literature-Based Discovery, a concept developed in the 1980s which seeks to make new discoveries by combing pieces of information from disconnected sources. The key idea behind the original version of LBD is that concepts that are never explicitly linked in the literature may be indirectly linked through intermediate concepts.

The design of the LION LBD system allows real-time search to discover indirect associations between entities in a database of tens of millions of publications while preserving the ability of users to explore each mention in its original context.

"For example, you may know that a cancer drug affects the behaviour of a certain pathway, but with LION LBD, you may find that a drug developed for a totally different disease affects the same pathway," said Korhonen.

LION LBD is the first system developed specifically for the needs of cancer research. It has a particular focus on the molecular biology of cancer and uses state-of-the-art machine learning and natural language processing techniques, in order to detect references to the hallmarks of cancer in the text. Evaluations of the system have demonstrated its ability to identify undiscovered links and to rank relevant concepts highly among potential connections.

The system is built using open data, open source and open standards, and is available as an interactive web-based interface or a programmable API.

The researchers are currently working on extending the scope of LION-LBD to include further concepts and relations. They are also working closely with cancer researchers to help and improve the technology for end users.

The system was developed in collaboration with University of Cambridge Language Technology Lab, Cancer Research UK Cambridge Institute, and Karolinska Institutet in Sweden, and was funded by the Medical Research Council.

Sampo Pyysalo, Simon Baker, Imran Ali, Stefan Haselwimmer, Tejas Shah, Andrew Young, Yufan Guo, Johan Högberg, Ulla Stenius, Masashi Narita, Anna Korhonen.
LION LBD: a literature-based discovery system for cancer biology.
Bioinformatics, doi: 10.1093/bioinformatics/bty845.

Most Popular Now

Is AI in Medicine Playing Fair?

As artificial intelligence (AI) rapidly integrates into health care, a new study by researchers at the Icahn School of Medicine at Mount Sinai reveals that all generative AI models may...

Generative AI's Diagnostic Capabili…

The use of generative AI for diagnostics has attracted attention in the medical field and many research papers have been published on this topic. However, because the evaluation criteria were...

New System for the Early Detection of Au…

A team from the Human-Tech Institute-Universitat Politècnica de València has developed a new system for the early detection of Autism Spectrum Disorder (ASD) using virtual reality and artificial intelligence. The...

AI Tool can Track Effectiveness of Multi…

A new artificial intelligence (AI) tool that can help interpret and assess how well treatments are working for patients with multiple sclerosis (MS) has been developed by UCL researchers. AI uses...

Diagnoses and Treatment Recommendations …

A new study led by Prof. Dan Zeltzer, a digital health expert from the Berglas School of Economics at Tel Aviv University, compared the quality of diagnostic and treatment recommendations...

Dr Jason Broch Joins the Highland Market…

The Highland Marketing advisory board has welcomed a new member - Dr Jason Broch, a GP and director with a strong track record in the NHS and IT-enabled transformation. Dr Broch...

Surrey and Sussex Healthcare NHS Trust g…

Surrey and Sussex Healthcare NHS Trust has marked an important milestone in connecting busy radiologists across large parts of South East England, following the successful go live of Sectra's enterprise...

Multi-Resistance in Bacteria Predicted b…

An AI model trained on large amounts of genetic data can predict whether bacteria will become antibiotic-resistant. The new study shows that antibiotic resistance is more easily transmitted between genetically...

DMEA 2025 Ends with Record Attendance an…

8 - 10 April 2025, Berlin, Germany. DMEA 2025 came to a successful close with record attendance and an impressive program. 20,500 participants attended Europe's leading digital health event over the...