Could ChatGPT Help or Hurt Scientific Research Articles?

Since its introduction to the public in November 2022, ChatGPT, an artificial intelligence system, has substantially grown in use, creating written stories, graphics, art and more with just a short prompt from the user. But when it comes to scientific, peer-reviewed research, could the tool be useful?

“Right now, many journals do not want people to use ChatGPT to write their articles, but a lot of people are still trying to use it," said Melissa Kacena, PhD, vice chair of research and a professor of orthopaedic surgery at the Indiana University School of Medicine. "We wanted to study whether ChatGPT is able to write a scientific article and what are the different ways you could successfully use it."

The researchers took three different topics - fractures and the nervous system, Alzheimer's disease and bone health and COVID-19 and bone health - and prompted the subscription version of ChatGPT ($20/month) to create scientific articles about them. The researchers took 3 different approaches for the original draft of the articles - all human, all ChatGPT or a combination. The study is published in a compilation of 12 articles in a new, special edition of Current Osteoporosis Reports.

"The standard way of writing a review article is to do a literature search, write an outline, start writing, and then faculty members revise and edit the draft," Kacena said. "We collected data about how much time it takes for this human method and how much time it takes for ChatGPT to write and then for faculty to edit the different articles."

In the articles written only by ChatGPT, up to 70% of the references were wrong. But when using an AI-assisted approach with more human involvement, they saw more plagiarism, especially when giving the tool more references up front. Overall, the use of AI decreased time spent to write the article, but required more extensive fact checking.

Another concern is with the writing style used by ChatGPT. Even though the tool was prompted to use a higher level of scientific writing, the words and phrases were not necessarily written at the level someone would expect to see from a researcher.

"It was repetitive writing and even if it was structured the way you learn to write in school, it was scary to know there were maybe incorrect references or wrong information," said Lilian Plotkin, PhD, professor of anatomy, cell biology and physiology at the IU School of Medicine and coauthor on five of the papers.

Jill Fehrenbacher, PhD, associate professor of pharmacology and toxicology at the school and coauthor on nine of the papers, said she believes even though many scientific journals do not want authors to use ChatGPT, many people still will--especially non-native English speakers.

"People may still write everything themselves, but then put it into ChatGPT to fix their grammar or help with their writing, so I think we need to look at how do we shepherd people in using it appropriately and even helping them?" Fehrenbacher said. "We hope to provide a guide for the scientific community so that if people are going to use it, here are some tips and advice."

"I think it’s here to stay, but we need to understand how we can use it in an appropriate manner that won’t compromise someone’s reputation or spread misinformation," Kacena said.

Faculty and students from several departments and centers across the IU School of Medicine were involved, including orthopaedic surgery; anatomy, cell biology and physiology; pharmacology and toxicology; radiology and imaging sciences; anesthesia; the Stark Neuroscience Research Institute; the Indiana Center for Musculoskeletal Health; and the IU School of Dentistry. Authors are also affiliated with the Richard L. Roudebush Veterans Affairs Medical Center in Indianapolis, Eastern Virginia Medical School in Norfolk, Virginia, and Mount Holyoke College in South Hadley, Massachusetts.

Kacena MA, Plotkin LI, Fehrenbacher JC.
The Use of Artificial Intelligence in Writing Scientific Review Articles.
Curr Osteoporos Rep. 2024 Jan 16. doi: 10.1007/s11914-023-00852-0

Most Popular Now

Herefordshire and Worcestershire Health …

Herefordshire and Worcestershire Health and Care NHS Trust has successfully implemented Alcidion's Miya Precision platform to streamline bed management workflow across seven community hospitals in Worcestershire. The trust delivers community...

With Huge Patient Dataset, AI Accurately…

Scientists have designed a new artificial intelligence (AI) model that emulates randomized clinical trials at determining the treatment options most effective at preventing stroke in people with heart disease. The model...

A Shortcut for Drug Discovery

For most human proteins, there are no small molecules known to bind them chemically (so called "ligands"). Ligands frequently represent important starting points for drug development but this knowledge gap...

New Horizon Europe Funding Boosts Europe…

The European Commission has announced the launch of new Horizon Europe calls, with a substantial funding pool of over €112 million. These calls are aimed primarily at pioneering projects in...

Cleveland Clinic Study Finds AI can Deve…

Cleveland Clinic researchers developed an artficial intelligence (AI) model that can determine the best combination and timeline to use when prescribing drugs to treat a bacterial infection, based solely on...

New AI-Technology Estimates Brain Age Us…

As people age, their brains do, too. But if a brain ages prematurely, there is potential for age-related diseases such as mild-cognitive impairment, dementia, or Parkinson's disease. If "brain age...

Radboud University Medical Center and Ph…

Royal Philips (NYSE: PHG, AEX: PHIA), a global leader in health technology, and Radboud University Medical Center have signed a hospital-wide, long-term strategic partnership that delivers the latest patient monitoring...

GPT-4, Google Gemini Fall Short in Breas…

Use of publicly available large language models (LLMs) resulted in changes in breast imaging reports classification that could have a negative effect on patient management, according to a new international...

ChatGPT fails at heart risk assessment

Despite ChatGPT's reported ability to pass medical exams, new research indicates it would be unwise to rely on it for some health assessments, such as whether a patient with chest...

Study Shows ChatGPT Failed when Challeng…

With artificial intelligence (AI) poised to become a fundamental part of clinical research and decision making, many still question the accuracy of ChatGPT, a sophisticated AI language model, to support...

Virtual Reality Shows Promise in Fightin…

A new study published in JMIR Mental Health sheds light on the promising role of virtual reality (VR) in treating major depressive disorder (MDD). Titled "Examining the Efficacy of Extended...

AXREM and Highland Marketing Partner to …

AXREM represents member companies that collectively provide UK hospitals with most of their diagnostic medical imaging technology, and radiotherapy equipment. The association has seen substantial growth in recent years, with membership...