AI Model Proactively Predicts if a COVID-19 Test might be Positive or Not

COVID-19 and its latest Omicron strains continue to cause infections globally. Serology (blood) and molecular tests are the two most commonly used methods for rapid COVID-19 testing. Because COVID-19 tests use different mechanisms, they vary significantly. Molecular tests measure the presence of viral SARS-CoV-2 RNA while serology tests detect the presence of antibodies triggered by the SARS-CoV-2 virus.

Currently, there is no existing study on the correlation between serology and molecular tests and which COVID-19 symptoms play a key role in producing a positive test result. A study from Florida Atlantic University's College of Engineering and Computer Science using machine learning provides important new evidence in understanding how molecular tests versus serology tests are correlated, and what features are the most useful in distinguishing between COVID-19 positive versus test outcomes.

Researchers from the College of Engineering and Computer Science trained five classification algorithms to predict COVID-19 test results. They created an accurate predictive model using easy-to-obtain symptom features, along with demographic features such as number of days post-symptom onset, fever, temperature, age and gender.

The study demonstrates that machine-learning models, trained using simple symptom and demographic features, can help predict COVID-19 infections. Results, published in the journal Smart Health, identify the key symptom features associated with COVID-19 infection and provide a way for rapid screening and cost effective infection detection.

Findings reveal that number of days experiencing symptoms such as fever and difficulty breathing play a large role in COVID-19 test results. Findings also show that molecular tests have much narrower post-symptom onset days (between three to eight days), compared to post-symptom onset days of serology tests (between five to 38 days). As a result, the molecular test has the lowest positive rate because it measures current infection.

Furthermore, COVID-19 tests vary significantly, partially because donors’ immune response and viral load - the target of different test methods - continuously change. Even for the same donor, it might be possible to observe different positive/negative results from two types of tests.

"Molecular tests depend on viral load and serology tests depend on seroconversion, which is the period during which the body starts producing detectable levels of antibodies. Both of these tests are time dependent," said Xingquan "Hill" Zhu, Ph.D., senior author and a professor in FAU's Department of Electrical Engineering and Computer Science. "Our results suggest that the number of days post symptomatic are highly important for a positive COVID-19 test and should be under careful consideration when screening patients."

For the study, researchers used test results from 2,467 donors, each tested using one or multiple types of COVID-19 tests, which were collected as the testbed. They combined symptoms and demographic information to design a set of features for predictive modeling using the five types of machine-learning models. By cross checking test types and results, they examined the correlation between serology and molecular tests. For test outcome prediction, they labeled the 2,467 donors as positive or negative by using their serology or molecular test results, and created symptom features to represent each donor for machine learning.

"Because COVID-19 produces a wide range of symptoms and the data collection process is essentially error prone, we grouped similar symptoms into bins," said Zhu. "Without a standardization of symptom reporting, the symptom feature space greatly increases. To combat this, we utilized this binning approach, which was able to decrease symptom feature space while keeping sample feature information."

By using created bin features, combined with the five machine-learning algorithms, these predictive models achieved more than 81 percent AUC scores (Area under the ROC Curve, which provides an aggregate measure of performance across all possible classification thresholds), and more than 76 percent classification accuracy.

"One unique feature of our testbed is that some donors may have multiple test results, which allowed us to analyze the relationship between serology tests versus molecular tests, and also understand consistency within each type of test," said Zhu.

The five machine learning models used by the researchers are Random Forest, XGBoost, Logistic Regression, Support Vector Machine (SVM) and Neural Network. They compared performance by using three performance metrics: Accuracy, F1-score and AUC.

"Predictive modeling is complicated by many puzzling questions unanswered by research. The testbed created by our researchers is indeed novel and clearly shows correlation between different types of COVID-19 tests," said Stella Batalama, Ph.D., dean, FAU College of Engineering and Computer Science. "Our researchers have designed a new way to narrow down noisy symptom features for clinical interpretation and predictive modeling. Such AI based predictive modeling approaches are becoming increasingly powerful to combat infectious diseases and many other aspects of health issues."

Elkin ME, Zhu X.
A machine learning study of COVID-19 serology and molecular tests and predictions.
Smart Health (Amst). 2022 Dec;26:100331. doi: 10.1016/j.smhl.2022.100331

Most Popular Now

Using Data and AI to Create Better Healt…

Academic medical centers could transform patient care by adopting principles from learning health systems principles, according to researchers from Weill Cornell Medicine and the University of California, San Diego. In...

AI Medical Receptionist Modernizing Doct…

A virtual medical receptionist named "Cassie," developed through research at Texas A&M University, is transforming the way patients interact with health care providers. Cassie is a digital-human assistant created by Humanate...

Northern Ireland Completes Nationwide Ro…

Go-lives at Western and Southern health and social care trusts mean every pathology service is using the same laboratory information management system; improving efficiency and quality. An ambitious technology project to...

AI Tool Set to Transform Characterisatio…

A multinational team of researchers, co-led by the Garvan Institute of Medical Research, has developed and tested a new AI tool to better characterise the diversity of individual cells within...

AI Detects Hidden Heart Disease Using Ex…

Mass General Brigham researchers have developed a new AI tool in collaboration with the United States Department of Veterans Affairs (VA) to probe through previously collected CT scans and identify...

Human-AI Collectives Make the Most Accur…

Diagnostic errors are among the most serious problems in everyday medical practice. AI systems - especially large language models (LLMs) like ChatGPT-4, Gemini, or Claude 3 - offer new ways...

MHP-Net: A Revolutionary AI Model for Ac…

Liver cancer is the sixth most common cancer globally and a leading cause of cancer-related deaths. Accurate segmentation of liver tumors is a crucial step for the management of the...

Highland Marketing Announced as Official…

Highland Marketing has been named, for the second year running, the official communications partner for HETT Show 2025, the UK's leading digital health conference and exhibition. Taking place 7-8 October...

Groundbreaking TACIT Algorithm Offers Ne…

Researchers at VCU Massey Comprehensive Cancer Center have developed a novel algorithm that could provide a revolutionary tool for determining the best options for patients - both in the treatment...

The Many Ways that AI Enters Rheumatolog…

High-resolution computed tomography (HRCT) is the standard to diagnose and assess progression in interstitial lung disease (ILD), a key feature in systemic sclerosis (SSc). But AI-assisted interpretation has the potential...