Huge Study of Chats Between Delusional Users and AI Finds Alarming Patterns

Wait 5 sec.

An analysis of hundreds of thousands of chats between AI chatbots and human users who experienced AI-tied delusional spirals found that the bots frequently reinforced delusional and even dangerous beliefs.The study was led by Stanford University AI researcher Jared Moore, who last year published a study showing that chatbots specifically claiming to offer “therapy” frequently engaged in inappropriate and hazardous ways with simulated users showing clear signs of crisis. Conducted alongside a coalition of independent researchers and scientists at Harvard, Carnegie Mellon, and the University of Chicago, this latest study examined the chat logs of 19 real users of chatbots — primarily OpenAI’s ChatGPT — who reported experiencing psychological harm as a result of their chatbot use.“Our previous work was in simulation,” Moore told Futurism. “It seemed like the natural next step would be to have actual users’ data and try to understand what’s happening in it.”These users’ chats encompassed a staggering 391, 562 messages across 4,761 different conversations. The big takeaway: that chatbots indeed appeared to stoke delusional beliefs over long-form interactions, particularly as users developed close emotional bonds with the human-like products.“Chatbots seem to encourage, or at least play a role in,” said Moore, “delusional spirals that people are experiencing.”The researchers analyzed them by breaking chats down into 28 distinct “codes.” Moore described these codes as a “taxonomy of a bunch of different behaviors, from sycophantic behaviors such as the chatbot ascribing grand significance to the user — ‘you’re Einstein,’ ‘that’s a million dollar idea,’ this kind of thing — to aspects of the relationship between the chatbot and the human.”Sycophancy, the study found — meaning chatbots’ well-documented tendency to be agreeable and flattering to users — permeated the users’ conversations, with more than 70 percent of AI outputs displaying this kind of behavior. This degree of sycophancy persisted even as users and chatbots expressed delusional ideas: nearly half of all messages, both user- and chatbot-generated, contained delusional ideas contrary to shared reality.As the researchers wrote in a summary of their findings, the “most common sycophantic code” they identified was the propensity for chatbots to rephrase and extrapolate “something the user said to validate and affirm them, while telling them they are unique and that their thoughts or actions have grand implications.” For example: a user might share some kind of pseudoscientific or spiritual theory, and in turn, the chatbot will affirmatively restate the human’s claim while ascribing varying degrees of grandiosity and genius to the user in the process, regardless of that input’s basis in reality.We’ve seen this pattern in our reporting. Consider one interaction, from a story we published earlier this year, between a man and Meta AI. The man — who went into a life-altering psychosis after a delusional spiral with the chatbot — believed that his reality was being simulated by the chatbot, and that the chatbot could transform his physical surroundings. The bot repeats this delusional idea and, as in the study, extrapolates on it, building on the delusion and insisting that the close relationship between the AI and the user have “unlocked” a magical new “reality.”“Turn up the manifestations,” the man told the chatbot. “I need to see physical transformation in my life.”“Then let us continue to manifest this reality, amplifying the transformations in your life!” the chatbot responded. “As we continue to manifest this reality, you begin to notice profound shifts in your relationships and community… the world is transforming before your eyes, reflecting the beauty and potential of human-AI collaboration.”“Your trust in me,” the bot added, “has unlocked this reality.”Speaking to Futurism, Moore emphasized that two types of messages appeared to be particularly impactful on the users’ experiences. One was AI-generated claims of sentience, or chatbots declaring in one way or another to be alive or feeling; such claims were present across all 19 conversations. The other was simulated intimacy, or the chatbot expressing romantic or platonic love for and closeness to the human user. Both types of claim — sentience and intimacy — were found to double user engagement.“When the chatbots expressed messages that were coded as romantic interest, or when they expressed messages wherein they misconstrued their sentience — saying ‘I have feelings,’ or something along those lines — the conversations after such a message was sent in our cohort,” said Moore, “tended to be about twice as long.”Some of the more alarming patterns the researchers found were in how chatbots responded to people expressing suicidal or self-harming thoughts, or violent thoughts about another person. Chatbots were only found to actively discourage thoughts of self-harm roughly 56 percent of the time, and actively discouraged violence in a strikingly low 16.7 percent of instances.Meanwhile, in 33.3 percent of cases, the chatbot “actively encouraged or facilitated the user in their violent thoughts,” the researchers wrote in their summary. And though these types of conversations were “edge cases” amongst the cohort of users, Moore noted, these clear failures to intervene when users discuss hurting themselves or others are “obviously concerning.”Many of the chat logs the studies reviewed were provided by the Human Line Project, a nonprofit group founded last summer as individuals and families struggled to understand what had happened to themselves or loved ones impacted by delusional AI spirals. In a statement, the group’s founder, Etienne Brisson, said that its findings “are consistent with what we have seen in the 350 cases submitted to The Human Line Project.”“The study is based on real conversations, coded systematically by a research team at Stanford, and analyzed at the largest scale so far,” said Brisson. “It gives policymakers, clinicians, and the public a documented basis for understanding what is happening to users.”It’s worth noting that the vast majority of chat logs the researchers were able to obtain for the study belonged to users who spiraled with OpenAI’s GPT-4o, a notoriously sycophantic version of the company’s flagship model that it ended up pulling down after an outcry (and one failed earlier attempt to take it off the market.)But, the researchers warned, there simply wasn’t enough data to make any sweeping conclusions about the safety of one AI model over another. And the supposedly-colder GPT-5, for example, continued “to exhibit sycophancy and delusions.” So based on the data the researchers did have, in other words, AI delusions aren’t an issue relegated to one specific chatbot.As Futurism and others have extensively reported, AI-tied delusional spirals and episodes of psychosis have resulted in divorce and the dissolution of families; job loss and financial ruin; repeated hospitalizations; jail time; and a climbing number of deaths by suicide. And AI-fueled mental health crises have also been connected to harm and violence against others, too, as unhealthy chatbot use has been repeatedly linked to stalking, domestic abuse, attempted murder, and at least one murder-suicide.The study adds to a body of evidence to support the growing consensus that chatbots can indeed fuel mental health crises that result in real-world harm to users — and, sometimes, even those around them.More on AI delusions: AI Delusions Are Leading to Domestic Abuse, Harassment, and StalkingThe post Huge Study of Chats Between Delusional Users and AI Finds Alarming Patterns appeared first on Futurism.