Mapping the 'Dark Matter' of Human DNA

Researchers from ERIBA, Radboud UMC, XJTU, Saarland University, CWI and UMC Utrecht have made a big step towards a better understanding of the human genome. By identifying large DNA variants in 250 Dutch families, the researchers have clarified part of the "dark matter", the great unknown, of the human genome. These new data enable researchers from all over the world to study the DNA variants and use the results to better understand genetic diseases.

The findings were published on October 6 in the scientific journal "Nature Communications".

Although our knowledge of the human DNA is extensive, it is nowhere near complete. For instance, our knowledge of exactly which changes in our DNA are responsible for a certain disease is often insufficient. This is related to the fact that no two people have exactly the same DNA. Even the DNA molecules of identical twins have differences, which occur during their development and ageing. Some differences ensure that not everybody looks exactly alike, while others determine our susceptibility to particular diseases. Knowledge about the DNA variants can therefore tell us a lot about potential health risks and is a first step towards personalized medicine. Many small variants in the human genome - the whole of genetic information in the cell - have already been documented. Although it is known that larger structural variants play an important role in many hereditary diseases, these variants are also more difficult to detect and are, therefore, much less investigated.

By comparing the DNA of 250 healthy Dutch families with the reference DNA database the researchers were able to identify 1.9 million variants affecting multiple DNA 'letters'. These variants include large sections of DNA that have disappeared, moved or even appear out of nowhere. When this happens in the middle of a gene that encodes a certain protein, it is likely that the functionality of the gene, and thus the production of the protein, is compromised. However, large structural variants often occur just before or after the coding part of a gene. The effect of this type of variation is hard to predict.

In the paper two occasions are described in which an extra piece of DNA was found just outside the coding region of a gene. In these occasions the variants had a demonstrable effect on the gene regulation. This proves that even structural variants that occur outside the coding regions need to be monitored closely in future DNA screenings. The catalogue of variants provided by this research enables other scientists to predict the occurrence of large structural variants from the known profile of the smaller ones. This technique opens new possibilities for studying the effects of large structural changes in our genomes.

Additionally, the research resulted in the discovery of large parts of DNA that were not included in the genome reference. This "extra" DNA does contain parts that could be involved in the production of proteins. One of the extra pieces of DNA that was described in the paper is a new "ZNF" gene that has previously never been found in humans. Nevertheless it appears to be present in roughly half of the Dutch population. This particular gene is a member of the ZNF gene family that was known from the reference genomes of several species of apes. The new variant will now be added to the human reference database. Authors subsequently showed that this gene is also present in genomes of several other human populations, however its function remains unknown. The fact that these and other pieces of "dark matter" now have been placed on the genetic map enables scientists worldwide to study them and use the results to better understand human genetic diseases.

This study is part of the Genome of the Netherlands (GoNL) project. One of the main goals of the study is to map the genome of the Dutch population and all its variants. Several teams of bio- informaticians from different countries work continuously on the development of new algorithms for data analysis, as well as on innovative ways to combine existing algorithms. The result: an accurate representation of the genomes of the Dutch population and thereby a solid base for the personalised medicine of the future.

Hehir-Kwa JY, Marschall T, Kloosterman WP, Francioli LC, Baaijens JA, Dijkstra LJ, Abdellaoui A, Koval V, Thung DT, Wardenaar R, Renkens I, Coe BP, Deelen P, de Ligt J, Lameijer EW, van Dijk F, Hormozdiari F; Genome of the Netherlands Consortium., Uitterlinden AG, van Duijn CM, Eichler EE, de Bakker PI, Swertz MA, Wijmenga C, van Ommen GB, Slagboom PE, Boomsma DI, Schönhuth A, Ye K, Guryev V.
A high-quality human reference panel reveals the complexity and distribution of genomic structural variants.
Nat Commun. 2016 Oct 6;7:12989. doi: 10.1038/ncomms12989.

Most Popular Now

SPARK TSL Acquires Sentean Group

SPARK TSL is acquiring Sentean Group, a Dutch company with a complementary background in hospital entertainment and communication, and bringing its Fusion Bedside platform for clinical and patient apps to...

ChatGPT Extracts Data for Ischaemic Stro…

In an ischaemic stroke, an artery in the brain is blocked by blood clots and the brain cells can no longer be supplied with blood as a result. Doctors must...

Herefordshire and Worcestershire Health …

Herefordshire and Worcestershire Health and Care NHS Trust has successfully implemented Alcidion's Miya Precision platform to streamline bed management workflow across seven community hospitals in Worcestershire. The trust delivers community...

A Shortcut for Drug Discovery

For most human proteins, there are no small molecules known to bind them chemically (so called "ligands"). Ligands frequently represent important starting points for drug development but this knowledge gap...

New Horizon Europe Funding Boosts Europe…

The European Commission has announced the launch of new Horizon Europe calls, with a substantial funding pool of over €112 million. These calls are aimed primarily at pioneering projects in...

Cleveland Clinic Study Finds AI can Deve…

Cleveland Clinic researchers developed an artficial intelligence (AI) model that can determine the best combination and timeline to use when prescribing drugs to treat a bacterial infection, based solely on...

New AI-Technology Estimates Brain Age Us…

As people age, their brains do, too. But if a brain ages prematurely, there is potential for age-related diseases such as mild-cognitive impairment, dementia, or Parkinson's disease. If "brain age...

Radboud University Medical Center and Ph…

Royal Philips (NYSE: PHG, AEX: PHIA), a global leader in health technology, and Radboud University Medical Center have signed a hospital-wide, long-term strategic partnership that delivers the latest patient monitoring...

With Huge Patient Dataset, AI Accurately…

Scientists have designed a new artificial intelligence (AI) model that emulates randomized clinical trials at determining the treatment options most effective at preventing stroke in people with heart disease. The model...

GPT-4, Google Gemini Fall Short in Breas…

Use of publicly available large language models (LLMs) resulted in changes in breast imaging reports classification that could have a negative effect on patient management, according to a new international...

ChatGPT fails at heart risk assessment

Despite ChatGPT's reported ability to pass medical exams, new research indicates it would be unwise to rely on it for some health assessments, such as whether a patient with chest...

Study Shows ChatGPT Failed when Challeng…

With artificial intelligence (AI) poised to become a fundamental part of clinical research and decision making, many still question the accuracy of ChatGPT, a sophisticated AI language model, to support...