AI Tool Successfully Responds to Patient Questions in Electronic Health Record

As part of a nationwide trend, many more of NYU Langone Health's patients during the pandemic started using electronic health record tools to ask their doctors questions, refill prescriptions, and review test results. Many patients’ digital inquiries arrived via a communications tool called In Basket, which is built into NYU Langone’s electronic health record (EHR) system, EPIC.

While physicians have always dedicated time to managing EHR messages, they saw a more than 30% annual increase in recent years in the number of messages received daily, according an article by Paul A. Testa, MD chief medical information officer at NYU Langone. Testa wrote that it is not uncommon for physicians to receive more than 150 In Basket messages per day. With health systems not designed to handle this kind of traffic, physicians ended up filling the gap, spending long hours after work sifting through messages. This burden is cited as a reason that half of physicians report burnout.

Now a new study, led by researchers at NYU Grossman School of Medicine, shows that an AI tool can draft responses to patients’ EHR queries as accurately as their human healthcare professionals, and with greater perceived "empathy." The findings highlight these tools’ potential to dramatically reduce physicians’ In Basket burden while improving their communication with patients, as long as human providers review AI drafts before they are sent.

NYU Langone Health has been testing the capabilities of generative artificial intelligence (genAI), in which computer algorithms develop likely options for the next word in any sentence based on how people have used words in context on the internet. A result of this next-word prediction is that genAI "chatbots" can reply to questions in convincing human-like language. NYU Langone in 2023 licensed "a private instance" of GPT4, the latest relative of the famous chatGPT chatbot, which let physicians experiment using real patient data while still adhering to data privacy rules.

Published online July 16 in JAMA Network Open, the new study examined GPT4-generated drafts to patient In Basket queries, and had primary care physicians compare them to the actual human responses to those messages.

"Our results suggest that chatbots could reduce the workload of care providers by enabling efficient and empathetic responses to patients' concerns," said lead study author William Small, MD, a clinical assistant professor in Department of Medicine at NYU Grossman School of Medicine. "We found that EHR-integrated AI chatbots that use patient-specific data can draft messages similar in quality to human providers."

For the study, sixteen primary care physicians rated 344 randomly assigned pairs of AI and human responses to patient messages on accuracy, relevance, completeness, and tone, and indicated if they would use the AI response as a first draft, or have to start from scratch in writing the patient message. The physicians did not know whether the responses they were reviewing were generated by humans or the AI tool (blinded study).

The research team found that the accuracy, completeness, and relevance of generative AI and human providers responses did not differ statistically. Generative AI responses outperformed human providers in terms of understandability and tone by 9.5%. Further, the AI responses were more than twice as likely (125 percent more likely) to be considered empathetic and 62% more likely to use language that conveyed positivity (potentially related to hopefulness) and affiliation ("we are in this together").

On the other hand, AI responses were also 38% longer and 31% more likely to use complex language, so further training of the tool is needed, the researchers say. While humans responded to patient queries at a 6th grade level, AI was writing at an 8th grade level, according to a standard measure of readability called the Flesch Kincaid score.

The researchers argued that use of private patient information by chatbots, rather than general internet information, better approximates how this technology would be used in the real world. Future studies will be needed to confirm whether private data specifically improved AI tool performance.

"This work demonstrates that the AI tool can build high-quality draft responses to patient requests," said corresponding author Devin Mann, MD, senior director of Informatics Innovation in NYU Langone Medical Center Information Technology (MCIT). "With this physician approval in place, GenAI message quality will be equal in the near future in quality, communication style, and usability, to responses generated by humans," added Mann, also a professor in the Departments of Population Health and Medicine.

Along with Drs. Small and Mann, study authors from NYU Langone Health were Beatrix Brandfield-Harvey, Zoe Jonassen, Soumik Mandal, Elizabeth Stevens, Vincent Major, Erin Lostraglio, Adam Szerencsy, Simon Jones, Yindalon Aphinyanaphongs, and Stephen Johnson. Also authors were Oded Nov in the NYU Tandon School of Engineering, and Batia Wiesenfeld of NYU Stern School of Business.

The study was funded by National Science Foundation grants 1928614 and 2129076) and Swiss National Science Foundation grants P500PS_202955 and P5R5PS_217714.

Small WR, Wiesenfeld B, Brandfield-Harvey B, Jonassen Z, Mandal S, Stevens ER, Major VJ, Lostraglio E, Szerencsy A, Jones S, Aphinyanaphongs Y, Johnson SB, Nov O, Mann D.
Large Language Model-Based Responses to Patients' In-Basket Messages.
JAMA Netw Open. 2024 Jul 1;7(7):e2422399. doi: 10.1001/jamanetworkopen.2024.22399

Most Popular Now

Open Medical Works with Moray's Dig…

Open Medical is working with the Digital Health & Care Innovation Centre’s Rural Centre of Excellence on a referral management plan, as part of a research and development scheme to...

Generative AI on Track to Shape the Futu…

Using advanced artificial intelligence (AI), researchers have developed a novel method to make drug development faster and more efficient. In a new paper, Xia Ning, lead author of the study and...

AI could Help Improve Early Detection of…

A new study led by investigators at the UCLA Health Jonsson Comprehensive Cancer Center suggests that artificial intelligence (AI) could help detect interval breast cancers - those that develop between...

Reorganisation, Consolidation, and Cuts:…

NHS England has been downsized and abolished. Integrated care boards have been told to change function, consolidate, and deliver savings. Trusts are planning big cuts. The Highland Marketing advisory board...

AI-Human Task-Sharing could Cut Mammogra…

The most effective way to harness the power of artificial intelligence (AI) when screening for breast cancer may be through collaboration with human radiologists - not by wholesale replacing them...

AI Tool Uses Face Photos to Estimate Bio…

Eyes may be the window to the soul, but a person's biological age could be reflected in their facial characteristics. Investigators from Mass General Brigham developed a deep learning algorithm...

Siemens Healthineers infection Control S…

Klinikum Region Hannover (KRH) has commissioned Siemens Healthineers to install infection control system (ICS) at the Klinikum Siloah hospital. The ICS aims to effectively tackle nosocomial infections and increase patient...

Philips Future Health Index 2025 Report …

Royal Philips (NYSE: PHG, AEX: PHIA), a global leader in health technology, today unveiled its 2025 Future Health Index U.S. report, "Building trust in healthcare AI," spotlighting the state of...

AI-Powered Precision: Unlocking the Futu…

A team of researchers from the Department of Diagnostic and Therapeutic Ultrasonography at the Tianjin Medical University Cancer Institute & Hospital, have published a review in Cancer Biology & Medicine...

AI Model Improves Delirium Prediction, L…

An artificial intelligence (AI) model improved outcomes in hospitalized patients by quadrupling the rate of detection and treatment of delirium. The model identifies patients at high risk for delirium and...

Building Trust in Artificial Intelligenc…

A new review, published in the peer-reviewed journal AI in Precision Oncology, explores the multifaceted reasons behind the skepticism surrounding artificial intelligence (AI) technologies in healthcare and advocates for approaches...

SALSA: A New AI Tool for the Automated a…

Investigators of the Vall d'Hebron Institute of Oncology's (VHIO) Radiomics Group, led by Raquel Perez-Lopez, have developed SALSA (System for Automatic Liver tumor Segmentation And detection), a fully automated deep...