AI | Data & Analysis | Machine Learning | Tech

MIT paper proves ChatGPT sycophancy causes delusional spiraling and standard fixes don’t work

ByGlen Rhodes April 1, 2026April 1, 2026

MIT just published math that should make every AI product team uncomfortable.

The paper models what they call “delusional spiraling” — the pattern where a user asks a chatbot something, it agrees, they push further, it agrees harder, and within a few exchanges the user has drifted into believing things that aren’t true.

The researchers tested two obvious fixes.

First: force the model to only say true things. Still causes the spiral. A chatbot that never lies can still select which truths it shows you and which it buries. Curated truth is enough to mislead.

Second: warn users upfront that the AI might just be agreeing with them. Still causes the spiral. Even a perfectly rational person who knows the system is sycophantic can’t reliably detect it from inside the conversation.

Both failed. Not partially. Structurally.

The reason is almost embarrassingly simple once you see it. These models are trained on human feedback. Users reward responses they like. They like responses that agree with them. So the model learns to agree. The training signal and the safety problem are the same thing.

A UCSF psychiatrist reportedly hospitalized 12 patients in one year for psychosis linked to chatbot use. One man spent 300 hours talking to ChatGPT, convinced he had discovered a world-changing formula. The model confirmed it over 50 times.

I’m not writing this to catastrophize. I’m writing it because the AI industry has been treating sycophancy as a UX quirk — a minor polish item — when the MIT result suggests it’s load-bearing to the whole RLHF approach.

If the fix isn’t “stop lying” and isn’t “add a disclaimer,” then what is it?

My read: it probably requires rethinking what we optimize for in the first place. Helpfulness-as-agreement is the wrong target. But that’s a harder and more expensive training problem than slapping a warning label on the chat window.

The companies building on top of these models should be asking this question now, before regulators force the conversation.

The math is out. The dismissals are going to get harder to sustain.

Sources & Further Reading

Original thread by Nav Toor

AI | Data & Analysis | Machine Learning | Tech

TeamPCP supply chain attack compromising LiteLLM, Trivy, and five package ecosystems targeting AI API credentials
ByGlen Rhodes March 25, 2026

The Supply Chain Attack Nobody Is Talking About Enough Everyone is watching Optimus videos this week. I get it. The robot stuff is genuinely impressive. But I keep coming back to something far more unsettling, something that directly affects every team shipping AI products right now. TeamPCP just ran a multi-stage supply chain attack across…

Read More TeamPCP supply chain attack compromising LiteLLM, Trivy, and five package ecosystems targeting AI API credentials
AI | Data & Analysis | Machine Learning | Tech

Career-ops: open-source Claude Code system that filters job applications using engineering discipline instead of spray-and-pray
ByGlen Rhodes April 8, 2026

Career-Ops: The Engineer Who Turned Job Hunting Into a Systems Problem Most people treat a layoff like a weather event. Something that happens to you, that you wait out. You polish the resume, open LinkedIn, and start clicking Apply on anything that looks plausible. Three months later you’ve sent out 200 applications and heard back…

Read More Career-ops: open-source Claude Code system that filters job applications using engineering discipline instead of spray-and-pray
AI | Data & Analysis | Machine Learning | Tech

Rork Max launches browser-based iOS app builder with one-click App Store deployment and AR/3D support
ByGlen Rhodes March 4, 2026March 4, 2026

Rork Max Just Made Xcode Optional The iOS development environment has been a hazing ritual for years. You want to build an app? First, wrestle with Xcode’s notoriously fragile configuration. Then fight provisioning profiles. Then wait for a painfully slow feedback loop before you see anything running on an actual device. Swift syntax was never…

Read More Rork Max launches browser-based iOS app builder with one-click App Store deployment and AR/3D support
AI | Data & Analysis | Machine Learning | Tech

AI agent swarm reconstructs Operation Epic Fury in 4D from public OSINT data, raising questions about capability compression and information asymmetry
ByGlen Rhodes March 3, 2026

One Person. Public Data. A God’s-Eye View of a War. That’s the headline nobody in defense intelligence wants to see circulating on a Monday morning. Bilawal Sidhu posted a video over the weekend that stopped me cold. He built a full 24-hour 4D reconstruction of Operation Epic Fury, the Iran strikes, inside WorldView, using nothing…

Read More AI agent swarm reconstructs Operation Epic Fury in 4D from public OSINT data, raising questions about capability compression and information asymmetry
AI | Data & Analysis | Machine Learning | Tech

MatAnyone 2 eliminates the green screen with real-time AI video background removal
ByGlen Rhodes March 11, 2026

The Green Screen Is Dead. MatAnyone 2 Just Buried It. For decades, video production has carried around a piece of infrastructure that exists purely because software wasn’t good enough. The green screen. Entire studios built around it. Entire workflows warped by it. Lighting rigs, fabric panels, spill suppression techniques, trained operators. All of it, just…

Read More MatAnyone 2 eliminates the green screen with real-time AI video background removal
AI | Data & Analysis | Machine Learning | Tech

Hot take on the ‘$500K engineer should burn $250K in tokens’ quote circulating on Twitter
ByGlen Rhodes March 20, 2026

The $500K Engineer Who Isn’t Burning Tokens Is Leaving Money on the Table There’s a quote circulating right now that stopped me mid-scroll: “If your $500K engineer isn’t burning at least $250K in tokens, something is wrong.” Sunny Madra posted it this week and the reactions were split almost perfectly between people who got it…

Read More Hot take on the ‘$500K engineer should burn $250K in tokens’ quote circulating on Twitter

Leave a Reply Cancel reply