Deep Machine Learning Completes Information about the Bioactivity of One Million Molecules

The Structural Bioinformatics and Network Biology laboratory, led by ICREA Researcher Dr. Patrick Aloy, has completed the bioactivity information for a million molecules using deep machine-learning computational models. It has also disclosed a tool to predict the biological activity of any molecule, even when no experimental data are available.

This new methodology is based on the Chemical Checker, the largest database of bioactivity profiles for pseudo pharmaceuticals to date, developed by the same laboratory and published in 2020. The Chemical Checker collects information from 25 spaces of bioactivity for each molecule. These spaces are linked to the chemical structure of the molecule, the targets with which it interacts or the changes it induces at the clinical or cellular level. However, this highly detailed information about the mechanism of action is incomplete for most molecules, implying that for a particular one there may be information for one or two spaces of bioactivity but not for all 25.

With this new development, researchers integrate all the experimental information available with deep machine learning methods, so that all the activity profiles, from chemistry to clinical level, for all molecules can be completed.

"The new tool also allows us to forecast the bioactivity spaces of new molecules, and this is crucial in the drug discovery process as we can select the most suitable candidates and discard those that, for one reason or another, would not work," explains Dr. Aloy.

The software library is freely accessible to the scientific community at bioactivitysignatures.org and it will be regularly updated by the researchers as more biological activity data become available. With each update of experimental data in the Chemical Checker, artificial neural networks will also be revised to refine the estimates.

Predictions and reliability

The bioactivity data predicted by the model have a greater or lesser degree of reliability depending on various factors, including the volume of experimental data available and the characteristics of the molecule.

In addition to predicting aspects of activity at the biological level, the system developed by Dr. Aloy's team provides a measure of the degree of reliability of the prediction for each molecule. "All models are wrong, but some are useful! A measure of confidence allows us to better interpret the results and highlight which spaces of bioactivity of a molecule are accurate and in which ones an error rate can be contemplated," explains Dr. Martino Bertoni, first author of the work.

Testing the system with the IRB Barcelona compound library

To validate the tool, the researchers have searched the library of compounds at IRB Barcelona for those that could be good drug candidates to modulate the activity of a cancer-related transcription factor (SNAIL1), whose activity is almost impossible to modulate due to the direct binding of drugs (it is considered an 'undruggable' target). Of a first set of 17,000 compounds, deep machine learning models predicted characteristics (in their dynamics, interaction with target cells and proteins, etc.) for 131 that fit the target.

The ability of these compounds to degrade SNAIL1 has been confirmed experimentally and it has been observed that, for a high percentage, this degradation capacity is consistent with what the models had predicted, thus validating the system.

This work has been possible thanks to the funding from the Government of Catalonia, the Spanish Ministry of Science and Innovation, the European Research Council, the European Commission, the State Research Agency and the ERDF.

Bertoni M, Duran-Frigola M, Badia-I-Mompel P, Pauls E, Orozco-Ruiz M, Guitart-Pla O, Alcalde V, Diaz VM, Berenguer-Llergo A, Brun-Heath I, Villegas N, de Herreros AG, Aloy P.
Bioactivity descriptors for uncharacterized chemical compounds.
Nat Commun. 2021 Jun 24;12(1):3932. doi: 10.1038/s41467-021-24150-4

Most Popular Now

Smartphone Photos can be Used to Detect …

A picture of a person's inner eyelid taken with a standard smartphone camera can be used to screen for anemia, according to a new study published this week in the...

Exscientia Accelerates COVID-19 Drug Dis…

Exscientia has received a grant from the Bill & Melinda Gates Foundation and Gates Philanthropy Partners, as part of the COVID-19 Therapeutics Accelerator. This is to expedite the optimisation of...

Collaboration Between AbbVie, Biogen an…

The access to the world's largest browsable resource linking rare protein-coding genetic variants to human health and disease was launched through a genetic exome sequence analysis collaboration between AbbVie (NYSE:...

It's Going to be Quite a Handover…

Health and social care secretary Matt Hancock has been abruptly replaced by Sajid Javid. The Highland Marketing advisory board consider the huge agenda he is now facing, and what it...

Philips and Cognizant Collaborate to Int…

Royal Philips (NYSE: PHG, AEX: PHIA), a global leader in health technology, and Cognizant (Nasdaq: CTSH), a world-leading professional services firm, announced a new collaboration to develop end-to-end digital health...

Philips Accelerates Stroke Diagnosis and…

Royal Philips (NYSE: PHG, AEX: PHIA), a global leader in health technology, announced a strategic partnership agreement with NICO.LAB, a MedTech stroke care company. Together with the recently expanded stroke...

Fujitsu Announces Support of AWS for Hea…

Fujitsu in the UK has announced its support of the AWS for Health initiative from Amazon Web Services (AWS) by bringing Epic in the Cloud to NHS trusts and UK...

Ethics and Governance of Artificial Inte…

Artificial Intelligence (AI) refers to the ability of algorithms encoded in technology to learn from data so that they can perform automated tasks without every step in the process having...

Open Call HORIZON-HLTH-2021-TOOL-06-01: …

This topic aims at supporting activities that are enabling or contributing to one or several expected impacts of destination 5 "Unlocking the full potential of new tools, technologies and digital...

New High-Tech Portal Launched to Speed H…

Researchers at the University of Maryland School of Medicine (UMSOM) launched a new online tool that could more quickly advance medical discoveries to reverse progressive hearing loss. The tool enables...

Artificial Intelligence Models to Analyz…

Artificial intelligence tools and deep learning models are a powerful tool in cancer treatment. They can be used to analyze digital images of tumor biopsy samples, helping physicians quickly classify...

Policy, Progress, Pace: Is the NHS Final…

Opinion Article by Lynette Ousby, UK Managing Director, Alcidion. There is a new energy emerging that might shake up the health tech market, and long-embedded challenges around siloed data and technologies. Political...