AI Outperforms Humans in Standardized Tests of Creative Potential

Score another one for artificial intelligence. In a recent study, 151 human participants were pitted against ChatGPT-4 in three tests designed to measure divergent thinking, which is considered to be an indicator of creative thought.

Divergent thinking is characterized by the ability to generate a unique solution to a question that does not have one expected solution, such as "What is the best way to avoid talking about politics with my parents?" In the study, GPT-4 provided more original and elaborate answers than the human participants.

The study, "The current state of artificial intelligence generative language models is more creative than humans on divergent thinking tasks," was published in Scientific Reports and authored by U of A Ph.D. students in psychological science Kent F. Hubert and Kim N. Awa, as well as Darya L. Zabelina, an assistant professor of psychological science at the U of A and director of the Mechanisms of Creative Cognition and Attention Lab.

The three tests utilized were the Alternative Use Task, which asks participants to come up with creative uses for everyday objects like a rope or a fork; the Consequences Task, which invites participants to imagine possible outcomes of hypothetical situations, like "what if humans no longer needed sleep?"; and the Divergent Associations Task, which asks participants to generate 10 nouns that are as semantically distant as possible. For instance, there is not much semantic distance between "dog" and "cat" while there is a great deal between words like "cat" and "ontology."

Answers were evaluated for the number of responses, length of response and semantic difference between words. Ultimately, the authors found that "Overall, GPT-4 was more original and elaborate than humans on each of the divergent thinking tasks, even when controlling for fluency of responses. In other words, GPT-4 demonstrated higher creative potential across an entire battery of divergent thinking tasks."

This finding does come with some caveats. The authors state, "It is important to note that the measures used in this study are all measures of creative potential, but the involvement in creative activities or achievements are another aspect of measuring a person’s creativity." The purpose of the study was to examine human-level creative potential, not necessarily people who may have established creative credentials.

Hubert and Awa further note that "AI, unlike humans, does not have agency" and is "dependent on the assistance of a human user. Therefore, the creative potential of AI is in a constant state of stagnation unless prompted."

Also, the researchers did not evaluate the appropriateness of GPT-4 responses. So while the AI may have provided more responses and more original responses, human participants may have felt they were constrained by their responses needing to be grounded in the real world.

Awa also acknowledged that the human motivation to write elaborate answers may not have been high, and said there are additional questions about "how do you operationalize creativity? Can we really say that using these tests for humans is generalizable to different people? Is it assessing a broad array of creative thinking? So I think it has us critically examining what are the most popular measures of divergent thinking."

Whether the tests are perfect measures of human creative potential is not really the point. The point is that large language models are rapidly progressing and outperforming humans in ways they have not before. Whether they are a threat to replace human creativity remains to be seen. For now, the authors continue to see "Moving forward, future possibilities of AI acting as a tool of inspiration, as an aid in a person's creative process or to overcome fixedness is promising."

Hubert KF, Awa KN, Zabelina DL.
The current state of artificial intelligence generative language models is more creative than humans on divergent thinking tasks.
Sci Rep. 2024 Feb 10;14(1):3440. doi: 10.1038/s41598-024-53303-w

Most Popular Now

Airwave Healthcare Expands Team with Fra…

Patient stimulus technology provider Airwave Healthcare has appointed Francesca McPhail, who will help health and care providers achieve more from their media and entertainment systems for people receiving care. Francesca McPhail...

Scientists Use AI to Detect Chronic High…

Researchers at Klick Labs unveiled a cutting-edge, non-invasive technique that can predict chronic high blood pressure (hypertension) with a high degree of accuracy using just a person's voice. Just published...

ChatGPT Outperformed Trainee Doctors in …

The chatbot ChatGPT performed better than trainee doctors in assessing complex cases of respiratory disease in areas such as cystic fibrosis, asthma and chest infections in a study presented at...

Former NHS CIO Will Smart Joins Alcidion

A former national chief information officer for health and social care in England, Will Smart will join the Alcidion Group board in a global role from October. He will provide...

The Darzi Review: The NHS "Is in Se…

Lyn Whitfield, content director at Highland Marketing, takes a look at Lord Darzi's review of the NHS, immediate reaction, and next steps. The review calls for a "tilt towards technology...

SPARK TSL Appoints David Hawkins as its …

SPARK TSL has appointed David Hawkins as its new sales director, to support take-up of the SPARK Fusion infotainment solution by NHS trusts and health boards. SPARK Fusion is a state-of-the-art...

Can Google Street View Data Improve Publ…

Big data and artificial intelligence are transforming how we think about health, from detecting diseases and spotting patterns to predicting outcomes and speeding up response times. In a new study analyzing...

Healthcare Week Luxembourg: Second Editi…

1 - 2 October 2024, Luxembourg.Save the date: Healthcare Week Luxembourg is back on 1 and 2 October 2024 at Luxexpo The Box. Acclaimed last year by healthcare professionals from...

AI Products Like ChatGPT can Provide Med…

The much-hyped AI products like ChatGPt may provide medical doctors and healthcare professionals with information that can aggravate patients' conditions and lead to serious health consequences, a study suggests. Researchers considered...

One in Five UK Soctors use AI Chatbots

A survey led by researchers at Uppsala University in Sweden reveals that a significant proportion of UK general practitioners (GPs) are integrating generative AI tools, such as ChatGPT, into their...

Specially Designed Video Games may Benef…

In a review of previous studies, a Johns Hopkins Children's Center team concludes that some video games created as mental health interventions can be helpful - if modest - tools...

AI may Enhance Patient Safety

Generative artificial intelligence (genAI) uses hundreds of millions, sometimes billions, of data points to train itself to produce realistic and innovative outputs that can mimic human-created content. Its applications include...