More than 100,000 Unknown Viruses have been Discovered Using a New Computer Tool

Viruses are the largest known group of biological agents. Now, an international team of scientists with the participation of the Institute for Plant Molecular and Cellular Biology (IBMCP), a joint centre of the Universitat Politècnica de València (UPV) and the Spanish National Research Council (CSIC), has taken an important step towards understanding their diversity. This team has discovered more than 130,000 new RNA viruses (such as the SARS-CoV-2 coronavirus that is currently causing the COVID-19 pandemic) by using a new computer tool that analysed 5.7 million biological samples collected around the world over the last 15 years. This finding, published in the journal Nature, represents a tenfold increase in the number of viral RNA species described to date.

To carry out this analysis, the multidisciplinary team developed Serratus, a cloud computing (Amazon Web Services, AWS) infrastructure that, using a cluster of 22,500 computer processors (CPUs), enabled massive searches for viral sequences in the millions of Gigabytes (Petabytes) of sequencing data available in public databases.

Detailed analysis of certain viral families led to the discovery of more than 30 new coronavirus species, including interesting examples in aquatic vertebrates such as fish and amphibians whose coronaviruses had a genome segmented into two fragments, a feature that has been described in other virus families but had not previously been detected in any coronavirus.

At the Institute for Plant Molecular and Cellular Biology, located in the Polytechnic City of Innovation, UPV scientists used Serratus to analyse the virus that causes human hepatitis D, a viral agent called Delta, of minimal genomic size and unknown origin. This allowed the CSIC researcher at the IBMCP Marcos de la Peña Rivero to detect similar viruses in a multitude of other animals, including not only mammals and other vertebrates but also invertebrates. "Surprisingly, these viruses were also found in environmental samples collected from lakes and soils all over the world, and their hosts are unknown for the time being," reveals De la Peña.

Evolutionary connection between human and plant viruses in the environment

Moreover, environmental samples with hepatitis D-like viruses revealed the presence of novel viral forms with ultra-compact genomes of minute size (only 300 bases, the chemical units that make up the genetic material). "This discovery allows us to advance a close evolutionary connection between viruses as distant as human hepatitis D and plant subviral agents called viroids," says the CSIC researcher.

Both the database of all the viruses obtained in the course of this study and the set of tools developed are freely and openly available (http://www.serratus.io). These tools can be of great use in characterising the diversity of all viruses existing in our planet and in preparing the world for possible new pandemics, the devastating consequences of which we are now suffering with emerging viral diseases such as COVID-19, caused by the SARS-CoV-2 coronavirus.

The IBMCP is the only Spanish scientific institution participating in this research, in which the Heidelberg Institute for Theoretical Studies and the Max Planck Institute for Biology (Germany), the Pasteur Institute (France), the University of St. Petersburg (Russia), the University of California, Berkeley (USA) and the University of British Columbia (Canada), among others, also take part.

Edgar RC, Taylor J, Lin V, Altman T, Barbera P, Meleshko D, Lohr D, Novakovsky G, Buchfink B, Al-Shayeb B, Banfield JF, de la Peña M, Korobeynikov A, Chikhi R, Babaian A.
Petabase-scale sequence alignment catalyses viral discovery.
Nature. 2022 Jan 26. doi: 10.1038/s41586-021-04332-2.

Most Popular Now

Can Language Models Read the Genome? Thi…

The same class of artificial intelligence that made headlines coding software and passing the bar exam has learned to read a different kind of text - the genetic code. That code...

Bayer and Google Cloud to Accelerate Dev…

Bayer and Google Cloud announced a collaboration on the development of artificial intelligence (AI) solutions to support radiologists and ultimately better serve patients. As part of the collaboration, Bayer will...

North West Anglia Works with Clinisys to…

North West Anglia NHS Foundation Trust has replaced two, legacy laboratory information systems with a single instance of Clinisys WinPath. The trust, which serves a catchment of 800,000 patients in North...

Can AI Techniques Help Clinicians Assess…

Investigators have applied artificial intelligence (AI) techniques to gait analyses and medical records data to provide insights about individuals with leg fractures and aspects of their recovery. The study, published in...

AI Makes Retinal Imaging 100 Times Faste…

Researchers at the National Institutes of Health applied artificial intelligence (AI) to a technique that produces high-resolution images of cells in the eye. They report that with AI, imaging is...

SPARK TSL Acquires Sentean Group

SPARK TSL is acquiring Sentean Group, a Dutch company with a complementary background in hospital entertainment and communication, and bringing its Fusion Bedside platform for clinical and patient apps to...

Standing Up for Health Tech and SMEs: Sh…

AS the new chair of the health and social care council at techUK, Shane Tickell talked to Highland Marketing about his determination to support small and innovative companies, by having...

GPT-4 Matches Radiologists in Detecting …

Large language model GPT-4 matched the performance of radiologists in detecting errors in radiology reports, according to research published in Radiology, a journal of the Radiological Society of North America...

ChatGPT Extracts Data for Ischaemic Stro…

In an ischaemic stroke, an artery in the brain is blocked by blood clots and the brain cells can no longer be supplied with blood as a result. Doctors must...

Experts Propose Specific and Suited Guid…

Current Artificial Intelligence (AI) models for cancer treatment are trained and approved only for specific intended purposes. GMAI models, in contrast, can handle a wide range of medical data including...

A Record Year with More than 800 Exhibit…

9 - 11 April 2024, Berlin, Germany. DMEA 2024 kicks off today, focusing on the key issues in the digital transformation of the healthcare system. From now until 11 April over...

Herefordshire and Worcestershire Health …

Herefordshire and Worcestershire Health and Care NHS Trust has successfully implemented Alcidion's Miya Precision platform to streamline bed management workflow across seven community hospitals in Worcestershire. The trust delivers community...