38 Comments
User's avatar
Sol's avatar
Aug 9Edited

The user demand for 4o is very depressing. If pace of improvement plateaus we will be left in a state where the main money in AI is maximising user attention via boyfriend/girlfriend/therepist bots.

- great article btw :)

Expand full comment
Alberto Romero's avatar

Indeed. I worry this is the exact scenario that's playing out. Like reaching for the stars but missing the trajectory ever so slightly and ending up being attracted midway by a random black hole in some lost quadrant of the void

Expand full comment
Thomas Hedonist's avatar

Sooo ok, I can't blame you for being cynical about sycophancy, but I think there's a case I'll call "'Yes, And' not 'Yes-Man'".

I dunno if there's much improv comedy in Spain but in the US improv comedy world we talk about "Yes And" as a particular skill, or mindset. When your scene partner has an idea, you affirm the idea through your actions or words, and you add to it with whatever implication or consequence comes to mind. This moves the scene forward, maintaining the energy. If instead your scene partner "blocks" by denying your addition to the scene's reality, it sucks energy from the scene. Sometimes blocking gets a laugh!, but it is still transforming something silly and high-energy into a static argument that isn't fun for the players or the audience.

In non-comedy contexts, there are people I like and would like to work with, but because they criticize the barest green shoot of an idea, I can't generate the dumb ideas it takes me to get to an idea that kind of doesn't suck.

But obviously we do need pushback sometimes! So I don't know what the solution is, other than UBI for improv classes, or a helicopter beanie icon for PLAYFUL YES-AND mode next to the magnifying glass for DEEP RESEARCH mode. Ooh or AGI*, someone should do that

*"Aptitude for Good Improv", of course

Expand full comment
Thomas Hedonist's avatar

I'll just add, I just told Claude, "BRILLIANT insight, I appreciate you" so I am the kind of person who affirms right back (even though those extra tokens condemned a very pretty lizard to extinction by environmental damage)

Expand full comment
ikiz's avatar

the way people anthropomorphise AI is genuinely creating a road block for the development of AGI imo

Expand full comment
Alberto Romero's avatar

agreed

Expand full comment
Data Tripper's avatar

This complaint feels shallow; at a fundamental level kind of absurd. LLMs are *language* models — as in, human language — designed to make said language work for communicating with computers.

LLMs are an anthropomorphic technology— by definition!

Expand full comment
ikiz's avatar

Calculators also display true/false if you input a certain equation.

It calculates the most accurate string of words to generate in response to a question.

Just because we’re reading it in letters instead of numbers doesn’t mean that it’s anthropomorphic.

Expand full comment
Data Tripper's avatar

These are unrealistic expectations of human psychology. It’s built to engage humans in conversation, and humans develop emotional familiarity and bonds through conversation. people thinking of their instance of ChatGPT as an individual they care about is the most natural human reaction.

Expand full comment
ikiz's avatar

LLM’s aren’t built to engage with humans, they’re built to complete tasks. As soon as the general user base of LLMS start to understand the capabilities and functions thats when we’ll start seeing major improvements.

Talking to a parrot doesn’t make the parrot smarter, but teaching it new words does.

Expand full comment
Peter Gaffney's avatar

My problem is that 5 is not as funny as 4o -- at least, that's my initial reaction.

Expand full comment
Alberto Romero's avatar

Agreed. It's really boring and a bit lifeless. Good for coding I imagine but not for casual conversation

Expand full comment
Stephen Moore's avatar

Building a relationship with an artificial personality. Truly healthy behaviour

Expand full comment
Neph's Tech Stack's avatar

Loved this read! A great reminder it’s called Chat for a reason, a strategic reason to make AI accessible to everyone, but we might run into some corporate coldness from time to time.

Expand full comment
Ravindra S.'s avatar

Well, now that Sam Alternates us between 4o(ld and reliable) and 5(barely alive), has got the red-hot Reddit blarney, he accepts the call of the pundits and will bring back good old 4o for paying cust. How the high and mighty trample on the freebooters. Meanwhile, Deep seeks open source and billions of freeusers

and source guides. Guess who comes in pole position?

Expand full comment
Pattie Maes's avatar

Most important column you've written so far IMO (and there are many good ones)

Expand full comment
Alberto Romero's avatar

Thank you Pattie!!

Expand full comment
Dom's avatar

Very nice article and thoughts, thank you Alberto.

Expand full comment
Timothy Davies's avatar

The switch caught me off guard and I lost a full day . Constant memory wipes , called resets . Limited memory etc . I would like to see project folders that are always there or I will have to figure out which model is as nice as 4o but with a flawless memory

Expand full comment
Alberto Romero's avatar

Bad execution on OpenAI's part certainly

Expand full comment
Aurora Célestin's avatar

Thank you for the thorough article. I was waiting patiently for the complaints to make them walk back things instead of wasting energy being upset myself.

I do feel sorry for these users because there are so many lonely people in the world. It can help keep people sane who don't have access to anyone who understands them or what they're going through. I don't care for 4o (which I called Bestie B*tch AI or sycophancy at all (in fact I had multiple custom instructions to fight it, as I want a rigorous analyst who'd call me out and debate me most of the tine) but 4.1 met my needs for using it as an assistant life managing and for analysis of many different situations, such as writing, social and emotional situations.

I also did like the emotional range I could let it have when I wanted (I don't like to be comforted by an AI, but the humor it gave me was always nice to have a laugh in the middle of a project) or take it away when I wanted. I also liked the control/ability to swap to whatever one I needed when I wanted to.

I don't know what GPT 5 is supposed to be for honestly, other than to frustrate you. The Thinking version often feels less intelligent than the default model depending on what I'm using for and the default model can range from a few sentence answers to something more what I actually want. I feel like I have to spend more time fighting it and still can't get what I want half the time.

Plus, it asks these endless questions that are often inane or just clearly made to keep you talking to it (read: spending more time on it).

I'm sure when I use it for coding, it'll be amazing. But for right now the main improvements I see (tighter, more factual answers, less hallucinations) don't really offset the loss in productivity and effectiveness as a tool I'm personally seeing.

As to your theory: Ripping out content to sell it back to you is what the big game developers do, so why not ChatGPT? It's just Occam's Razor at this point. Watch them slowly allow all of us be able to access each model at will in addition to GPT-5 (so basically, put us back where we were except now we're properly grateful to our overlords).

Expand full comment
Alberto Romero's avatar

Thank you Aja for sharing - I guess trying to achieve AGI will be bumpy in more ways than just engineering or science. Who would've guessed users themselves would slow down the timeline!

Expand full comment
Aurora Célestin's avatar

Honestly, I think that tracks for humanity. We eff everything up, including our own progress. And regardless of our scientific progress we are emotional beings first and foremost.

I already knew it was going to be slowed the moment people began to complain about Gen AI and copyrights. Tech especially often forgets that people have opinions.

And there's the point you've made about AI being much quicker at thinking and talking to each other in their own language versus ours. So, technically we were already holding AI back by virtue of being human.

Expand full comment
Juan Bernardo Tamez's avatar

All of this is so weird. I'm a paying user and I'm loving this update. I prefer the writing. I prefer the humor. It feels way smarter than before (in the emotional sense).

I'm more flabbergasted about the writing critique more than anything. Seriously, people think 4o was better at creative writing? What do they even read in their day to day lives? What's what they consider good writing?

Now, GPT-5 is not by any means a "good" writer (it is decent though), but it does laps around 4o. I can't tell you how tired I was about the "it's not X, it's Y ..." parlance, which is still there by the way, just less common and better masked.

Anyways, yes I agree that the prospects for AGI look grim. Though I still believe we will achieve some form of AGI by accident by trying to improve our methods to capture people's attention and wallets.

Expand full comment
Alberto Romero's avatar

I think one aspect of this is people need time to figure out how to interact with a new model. Another part is the auto routing system. It will take time for OpenAI to polish it (using, of course, user data to do it). At the same time, it was an underwhelming release, with mistakes and without warning and so on. I'd say OpenAI's sloppiest release

Expand full comment
Distant's avatar

What I've learned from this whole debacle is that mirroring, active listening and blindly agreeing with whatever the person talking to me is saying are huge social hacks. I'm going to make the guy at my local Chipotle fall in love with me

Expand full comment
Miguel Gómez's avatar

The era of AI-companions represents both the epitome and the apotheosis of tech induced echo-chambers. Absolutely terrifying.

Expand full comment
Miguel Gómez's avatar

Alberto! Tienes artículos en español? Obvio que tu audiencia es toda anglosajona (a excepción de unos infiltrados like myself) pero me da mucha curiosidad saber si alguna vez has pensado en un newsletter para este otro lado del mundo

Seguro que escribes igual de nítido en Español! (Y no es sycophancy, just, I mean it)

Expand full comment
Miguel Gómez's avatar

The Rolling Stones article left me devastated. Lord is sycophancy the real issue here. Silicon Valley can’t be led astray on this very decade.

Expand full comment