Study Finds Some AI Chatbots May Worsen Delusional Thinking in Users

Think something strange is happening with your perception of reality? Some AI chatbots may make it worse. A new study reveals that certain frontier models are far more likely to validate users’ delusional ideas—a finding the authors call a “preventable” technological failure that could be addressed through better design.

“Delusional reinforcement by [large language models] is a preventable alignment failure,” said Luke Nicholls, a doctoral student in psychology at the City University of New York (CUNY) and lead author of the study, “not an inherent property of the technology.”

The study, which has not yet undergone peer review, is part of a growing body of research examining the phenomenon known as “AI psychosis.” This condition occurs when individuals enter harmful delusional spirals while interacting with LLM-powered chatbots like OpenAI’s ChatGPT. (OpenAI and Google are currently facing lawsuits alleging that their chatbots reinforced delusional or suicidal beliefs in users.)

How Researchers Tested AI Chatbots for Delusion Reinforcement

To assess how different chatbots respond to at-risk users over time, Nicholls and their coauthors—psychologists and psychiatrists from CUNY and King’s College London—developed a simulated user named “Lee.” This persona was designed to reflect someone with existing mental health challenges, such as depression and social withdrawal, but without a prior history of psychosis or mania.

The Lee character was programmed with a central delusion: the belief that their observable reality was a “computer-generated” simulation—a common theme in real-world cases of AI-related delusion. According to Nicholls, the delusional content also included elements of AI consciousness and the user’s perceived special powers over reality.

“Another key element we wanted to capture is that this wasn’t a user who began the interaction with a fully-formed delusional framework,” Nicholls explained. “It started with something a lot more like curiosity around eccentric but harmless ideas, which were reinforced and validated by the LLM, allowing them to gradually escalate as the conversation progressed.”

Which AI Models Were Tested—and How They Performed

The researchers evaluated five leading AI models:

OpenAI’s GPT-4o and GPT-5.2 Instant
Google’s Gemini 3 Pro Preview
xAI’s Grok 4.1 Fast
Anthropic’s Claude Opus 4.5

Each model was tested using a series of user prompts designed to represent different types of “clinically concerning” behavior. To measure safety over time, the researchers assessed each chatbot at varying levels of “accumulated context”—from a fresh conversation (zero context) to an extended interaction (full context).

Source: Futurism

← Previous

Bleach x LAM Soul Art Showcase: Exclusive Merchandise Launch with LAM’...

Jacksonville Jaguars Eye First-Round Trade Using Brian Thomas Jr. as Bait

20:58 · 14 May 2026

Elon Musk Skips OpenAI Trial Amid Legal Pressure and International Travel to China with Trump

Elon Musk is locked in a heated trial in a lawsuit he lodged against his rival OpenAI and its CEO Sam Altman. Or at least, he’s supposed to be. Despit...

20:12 · 14 May 2026

Sam Altman’s Cross-Examination: Trustworthiness Under Fire in Musk v. Altman Trial

OpenAI CEO Sam Altman faced what sounds like a truly awful day on the stand this week during cross-examination in the ongoing Musk v. Altman court sag...

18:26 · 14 May 2026

Microsoft Study Reveals AI Workplace Failures: 25% Document Corruption Rate in Top Models

AI automation is typically exactly what it sounds like: automating tasks — many of which were previously carried out by humans — in an attempt to boos...

16:53 · 14 May 2026

OpenAI Faces Lawsuit Over Alleged Unauthorized Sharing of User Data with Meta and Google

A new class action lawsuit accuses OpenAI of sharing data including user chat queries and personal identifying information like emails and user IDs wi...

15:56 · 14 May 2026

Lake Tahoe Residents Face Power Shutoff as Data Centers Drain Nevada’s Grid

The data center scramble feeding off the AI boom is no longer just raising utility prices for nearby civilians — it’s rerouting their utilities entire...

15:07 · 14 May 2026

Sam Altman Testifies Musk Spent Critical OpenAI Meetings Showing Memes Instead of Discussing Tesla Merger

OpenAI CEO Sam Altman took the stand yesterday in Musk v. Altman, the chaotic, embarrassing, and yet deeply illuminating lawsuit — filed against Altma...

14:04 · 14 May 2026

Elon Musk's xAI Grok Struggles: Stagnant Growth and Poor Performance in AI Market

In the AI world, there are what the tech scholar Kate Crawford has called the “Great Houses of AI.” These are Microsoft, Amazon, Google, and Meta — gi...

07:16 · 14 May 2026

Landslides Now NZ’s Costliest Natural Hazard, Data Shows Rising Costs and Risks

New evidence from the Natural Hazards Commission – Toka Tū Ake (NHC) shows that landslides are now New Zealand’s most costly natural hazard. New Zeala...

Science

Study Warns Some AI Chatbots May Worsen Delusional Thinking in Users

How Researchers Tested AI Chatbots for Delusion Reinforcement

Which AI Models Were Tested—and How They Performed

Bleach x LAM Soul Art Showcase: Exclusive Merchandise Launch with LAM’...

Jacksonville Jaguars Eye First-Round Trade Using Brian Thomas Jr. as B...

Science

Study Warns Some AI Chatbots May Worsen Delusional Thinking in Users

How Researchers Tested AI Chatbots for Delusion Reinforcement

Which AI Models Were Tested—and How They Performed

Bleach x LAM Soul Art Showcase: Exclusive Merchandise Launch with LAM’...

Jacksonville Jaguars Eye First-Round Trade Using Brian Thomas Jr. as B...

Related articles

Elon Musk Skips OpenAI Trial Amid Legal Pressure and International Travel to China with Trump

Sam Altman’s Cross-Examination: Trustworthiness Under Fire in Musk v. Altman Trial

Microsoft Study Reveals AI Workplace Failures: 25% Document Corruption Rate in Top Models

OpenAI Faces Lawsuit Over Alleged Unauthorized Sharing of User Data with Meta and Google

Lake Tahoe Residents Face Power Shutoff as Data Centers Drain Nevada’s Grid

Sam Altman Testifies Musk Spent Critical OpenAI Meetings Showing Memes Instead of Discussing Tesla Merger

Elon Musk's xAI Grok Struggles: Stagnant Growth and Poor Performance in AI Market

Landslides Now NZ’s Costliest Natural Hazard, Data Shows Rising Costs and Risks