AI Tool Offers Deep Insight into the Immune System

Researchers explore the human immune system by looking at the active components, namely the various genes and cells involved. But there is a broad range of these, and observations necessarily produce vast amounts of data. For the first time, researchers including those from the University of Tokyo built a software tool which leverages artificial intelligence to not only offer a more consistent analysis of these cells at speed but also categorizes them and aims to spot novel patterns people have not yet seen.

Our immune system is important - it’s impossible to imagine complex life existing without it. This system, comprising different kinds of cells, each playing a different role, helps to identify things that threaten our health, and take actions to defend us. They are both very effective, but also far from perfect; hence, the existence of diseases such as the notorious acquired immunodeficiency syndrome, or AIDS. And recent earth-shattering issues, such as the coronavirus pandemic, serve to highlight the importance of research around this intricate yet powerful system.

One key branch of research in immunology involves the identification of immune system components and ascertaining their function. Doing this through manual observation would be impossible due to the time it would take, and some automated tools exist, but have limitations around accuracy, consistency or flexibility. To this end, a team of researchers led by Professor Tatsuhiko Tsunoda from the University of Tokyo’s Department of Biological Sciences rose to the challenge and developed a system to boost immunology research.

"We present scHDeepInsight, an AI-based framework for rapidly and consistently identifying immune cells from the RNA of cells. Instead of viewing all cell types as unrelated, the system reflects the natural hierarchy of the immune system," said lead researcher Shangru Jia. "By turning cellular genetic profiles into images and applying a hierarchy-aware AI, known as a convolutional neural network, or CNN, it can distinguish both broad immune cell types and finer subtypes, and it can do so more consistently than previous attempts. In our benchmark, labeling about 10,000 cells only took a few minutes, whereas manual marker-based annotation can take many hours to days. In comparison with other automated methods, run time is in a similar range. The main advantages are the consistency of predictions across the hierarchy and the improved accuracy gained from incorporating hierarchical labels, rather than raw speed alone."

There are three main aspects to scHDeepInsight. Hierarchical learning, whereby the model mirrors the immune system’s ‘family tree,’ can distinguish both broad immune categories and finer subtypes. Image-based representation transforms gene data into 2D images so the CNN can capture subtle relationships between genes more effectively than by looking at tables of raw data. And analytics built into the system can highlight which genes contribute most to a behavior, and these can be checked against known markers to see how they align with past observations.

"A spreadsheet of gene numbers misses how genes relate to each other. When we map genes to pixels in an image so that related genes are placed nearby, the result is an image with meaningful structure. Image-recognition models such as CNNs are very good at detecting such patterns, allowing them to capture complex relationships between genes that are hard to learn from raw tables," said Jia. "The main challenge was balancing performance across both broad cell types and detailed subtypes, especially for rare cell populations. We addressed this by adapting the training process, so the model paid more attention to the categories that were harder to distinguish, reducing the risk of overlooking small but important subtypes."

scHDeepInsight is primarily a research tool rather than a full diagnostic system, partly due to its infancy, but mainly as the model is only trained on healthy cells. By applying it to patients’ samples, researchers can see where they deviate from a healthy baseline. Such deviations may provide clues for further study, but medical interpretation requires additional validation. So this development will aid in fundamental research throughout the field of immunology, but it might take time before descendants of scHDeepInsight find their way into diagnostic systems.

"Studies where immune changes are important, including cancer immunology, infections and autoimmune conditions, can benefit from more reliable cell labels. Since our model is trained on healthy immune cells, its immediate value is in providing a consistent healthy baseline for comparison. Disease-related shifts can then be measured relative to this baseline, but clinical interpretation requires validation in each context," said Jia. "Generalization and validation are key. Clinical samples are diverse, so the model must be tested across varied trials and protocols. Integration into clinical workflows, regulatory requirements for transparency and reproducibility are also essential before routine use. For research use today, scHDeepInsight is already available as a downloadable package - researchers can readily apply it in their own analyses. Broader validation and clinical integration remain goals for the future."

Work on scHDeepInsight has not finished. The team aims to improve its abilities and features, taking it beyond immune system-related cellular identification and into other biological domains. Ultimately, they hope to validate the system for use as a tool for clinical research by using precise immune system profiling to support studies of disease. And there’s also the matter of its capacity to spot novel cell types.

"For each cell, the model outputs probabilities at both the broad type and subtype levels. If confidence is high for the broad lineage but low for all known subtypes within that lineage, the cell may represent a potentially novel state. In test analyses of brain immune datasets, this probability pattern helped highlight regions that were rich in specialized microglia cells residing in the central nervous system," said Jia. "AI models reflect their training data. If a reference atlas is incomplete, some rare or context-specific populations can be misclassified or underrepresented. Predictions must therefore be interpreted with caution and validated experimentally. Our design emphasizes transparency to support careful, evidence-based use."

Jia S, Lysenko A, Boroevich KA, Sharma A, Tsunoda T.
scHDeepInsight: a hierarchical deep learning framework for precise immune cell annotation in single-cell RNA-seq data.
Brief Bioinform. 2025 Aug 31;26(5):bbaf523. doi: 10.1093/bib/bbaf523

Most Popular Now

AI Distinguishes Glioblastoma from Look-…

A Harvard Medical School–led research team has developed an AI tool that can reliably tell apart two look-alike cancers found in the brain but with different origins, behaviors, and treatments. The...

AI Body Composition Measurements can Pre…

Adiposity - or the accumulation of excess fat in the body - is a known driver of cardiometabolic diseases such as heart disease, stroke, type 2 diabetes, and kidney disease...

AI can Strengthen Pandemic Preparedness

How to identify the next dangerous virus before it spreads among people is the central question in a new Comment in The Lancet Infectious Diseases. In it, researchers discuss how...

'Future-Guided' AI Improves Se…

In the world around us, many things exist in the context of time: a bird’s path through the sky is understood as different positions over a period of time, and...

New AI Tool Scans Social Media for Hidde…

A new artificial intelligence tool can scan social media data to discover adverse events associated with consumer health products, according to a study published September 30th in the open-access journal...

Yousif's Story with Sectra and The …

Embarking on healthcare technology career after leaving his home as a refugee during his teenage years, Yousif is passionate about making a difference. He reflects on an apprenticeship in which...

New Antibiotic Targets IBD - and AI Pred…

Researchers at McMaster University and the Massachusetts Institute of Technology (MIT) have made two scientific breakthroughs at once: they not only discovered a brand-new antibiotic that targets inflammatory bowel diseases...

AI Tool Offers Deep Insight into the Imm…

Researchers explore the human immune system by looking at the active components, namely the various genes and cells involved. But there is a broad range of these, and observations necessarily...

Study Finds One-Year Change on CT Scans …

Researchers at National Jewish Health have shown that subtle increases in lung scarring, detected by an artificial intelligence-based tool on CT scans taken one year apart, are associated with disease...

Highland to Help Companies Seize 'N…

Health tech growth partner Highland has today revealed its new identity - reflecting a sharper focus as it helps health tech companies to find market opportunities, convince target audiences, and...

New AI Tools Help Scientists Track How D…

Artificial intelligence (AI) can solve problems at remarkable speed, but it’s the people developing the algorithms who are truly driving discovery. At The University of Texas at Arlington, data scientists...