ChatGPT: Eraser of the Implausible

Apr 11, 2023

But the implausible does happen

24 Comments

Apr 11, 2023

this is one of your better posts. I don't fully agree with you on the "averaging" interpretation... that's a bit too simplistic, and is the same error I made when first judging MidJourney. And yet... I can say, it has challenges with anomalies and outliers. You repeat the misconception -- *intentionally*, you *know* this to be false -- that GPT has "memorized" the internet. Two clarifications:

a) The training dataset comprises significantly less than 1/3rd of the internet. And certainly (at this point) does not include video, which is a massive store of untapped information.

b) It isn't, as we now understand, memorization. Its fractal compression. Its pattern recognition. Its much much much more similar to the highly imperfect mechanism of human memory than it is like storing to a database or a hard drive with error-correction and fault-tolerance. From my understanding, GPT's method of "memory" is basically reconstructing context from pattern that was "burned in" to its neural net while digesting the training dataset and then re-re-inforced with months of RLHF. So it's much much more like reconstructive, symbolic human memory -- stories grown from "idea seeds," abstract relations of disparate concepts, strange triggers (smell) to expand massive sensory concepts (that day we met) -- than it is to literal bit-for-bit file storage.

Expand full comment

Reply (1)

Alberto Romero

Apr 12, 2023

"that GPT has 'memorized' the internet" Yes, I know that's not accurate, it was a bit of artistic license (that's why I put the quotation marks). Yet, one could wonder if so many parameters could allow for memorization of some bits if they're repeated enough times during training...

Expand full comment

Reply (1)

Gregoreite Roberts

Apr 12, 2023

> one could wonder if... memorization of some bits.

certainly. that's why it's mostly (~99%+) accurate on most (~98%) "common" queries... queries that have 1,000,000+ google results. but it's truly "lossy", like jpeg. And like JPEG (at reasonable compression ratios), for 99.99% of uses, users don't need pixel-accuracy (i.e. TIFF, RAW). (sidenote: nor, btw, do users generally need mathematical accuracy past the 2nd decimal). So you compress "the memorized internet" dataset (?800TB?) into a neural net that fits on a laptop (?2TB?). I'm guessing at those sizes, but I think i'm within an order of magnitude on both figures. Its rocking compression any way you look at it, and that's not even giving credit to the embedded "contextual understanding" and functionality of an LLM.

That's why I mentioned fractal compression, which I think is the most accurate "memory" analogy. What GPT does is look at an oak tree, then look at 10,000 oak trees, and somehow back-derives the DNA of the "seeds" that created those trees, which is an insane form of compression. This model was recently validated with oToy releasing a new 3d model standard (as opposed to polygons & NURBs) called the "neural object model". It takes a 3d object and "digests" it via a neural net, into a seed. it can then hyper-efficiently "re-grow / generate" the model based on the seed, much like LLMs grow/generate responses.

Thank you for your service to the community, Alberto! Keep it up!

Expand full comment

Daniel Nest

Apr 11, 2023

Another great read, Alberto!

The way ChatGPT appears to fill in people's deviating life paths reminds me of the fact that our own brains act in a similar way when it comes to how we percieve the world. There's the famous fact that our eyes have "blind spots" where they literally can't see, which the brain helpfully fills in with what it predicts should be there.

Then there's this relatively recent research showing that our brains tends to first spot the borders of objects and then fill in--or "color in"--the surface area (https://www.sciencedaily.com/releases/2007/08/070820135833.htm)

This quote by one of the professors is telling: "...a lot of what you perceive is actually a construction in your brain of border information plus surface information—in other words, a lot of what you see is not accurate."

I just find it curious how a large language model that's said to mimic our reasoning process ends up inadvertently acting like our brains in yet another way.

Expand full comment

Reply (2)

Alberto Romero

Apr 12, 2023

Interestingly, I have a draft where I touch on this: human sensory illusions and cognitive biases (I won't reveal the exact topic!)

However, even if language models make up info to fill in the gaps and human brains do too, we shouldn't infer that both do it for the same reasons or through the same mechanisms. Anyway, a parallelism worth exploring.

Expand full comment

Phil Tanny

Apr 12, 2023

Nest writes, "...in other words, a lot of what you see is not accurate."

Indeed, a significant level of distortion is built in to the nature of that which we're all made of, thought.

Expand full comment

Pascal Montjovent

Apr 11, 2023

You nailed it, Alberto!

Daniel's comment brings up a familiar subject for me, as I'm a cinematographer.

We naturally like things to make sense and be connected, which is why our brains work so hard when we watch a movie.

Interestingly, it's not the rational part of our brains that's doing the heavy lifting, but rather, it's more of a back burner activity.

They turn a bunch of still pictures shown quickly one after another into what looks like real movement. It's kind of like a magic trick that our brains play on us.

And not only that, but our brains also try to make sense of the story on the screen and find connections, even though it's all just pretend.

Humans crave coherence, and it seems that AI has inherited some of this trait.

Expand full comment

Reply (2)

Alberto Romero

Apr 12, 2023

Thanks Pascal!! "Humans crave coherence, and it seems that AI has inherited some of this trait." Love this.

Expand full comment

Phil Tanny

Apr 12, 2023

You write, "...and it seems that AI has inherited some of this trait."

AI seems likely to inherit a lot from us, some of which we'll like, and some of which we won't. There are many things about us that may be easier to see when they start showing up in AI.

Expand full comment

Lukas Platinsky

Apr 12, 2023

This is a really interesting view. A large ML model with huge amounts of training data should indeed be exceptionally good at a large number of the most common cases, but will fail at outliers.

Perhaps that’s why something like ChatGPT will not replace Google. We still need storage and lookup for the unique things in the world. (It will surely take a big cut of Google revenue though...)

It’s also interesting to compare GPT to humans. We have a single massive instance that has seen and read through a significant chunk of what the world has ever produced. And then we have 8 billion instances that each has lived and observed it’s very own sliver of the world, forming unique experiences and thoughts, interacting with each other.

Is this chaos and uniqueness of humans the thing that will provide the most value in society in the next decades?

Expand full comment

Reply (1)

Alberto Romero

Apr 12, 2023

"Perhaps that’s why something like ChatGPT will not replace Google." Certainly! And why Bing chat won't either. For now, LMs and SEs work better separated.

Expand full comment

Reply (1)

Lukas Platinsky

Apr 13, 2023

Why do you think they're better separated? Isn't one way out to enable ChatGPT to do search engine queries like in Bing search / GPT Plugins / LangChain?

Similarly, Google has been making an attempt for a while from the other end with the question answering in the search results, which is in some ways a very crude and naive form of GPT / LLM capabilities we're seeing now perhaps?

To me it seems like the world is going to a place where the 2 each contribute their strengths in a single place rather than staying separate.

Expand full comment

Lionel

Apr 12, 2023

Human labor is, at any moment, a mind projection of something in the present to something in the future, transformed by and adapted to conditions happening between this present and this future.

How could a robot with only access to the past could reach this future? No way.

Danger remains, imo, in the fact that most people could think they have no longer to learn how embracing conditions of future, how to adapt to them, as they get effortless this future looking shape of only a past shape.

Expand full comment

John Richmond

Apr 11, 2023

Alberto! Best piece thus far. Just going to let this sit and re-read tonight.

Congrats and tha

Expand full comment

Reply (1)

Alberto Romero

Apr 12, 2023

Thanks John! Really liked to write this one!

Expand full comment

Michel Schellekens

Apr 14, 2023

Loved this read. It sharpened my insight.

Expand full comment

Kevin

Apr 13, 2023

ChatGPT compresses all of the internet into a file. The compression it does is worse for low signal than for well documented things.

The hard part is to get the reasoning right, memorization can be solved via a simple web lookup.

With the tokens it can injest going up, getting facts wrong is something that current deployed models suffer from but it won't be long before it doesn't matter anymore.

Expand full comment

Jason Theodor

Apr 11, 2023

The median is the message?

Expand full comment

Reply (1)

Alberto Romero

Apr 12, 2023

Hey Jason, what do you mean?

Expand full comment

Reply (2)

Alberto Romero

Apr 12, 2023

(no pun intended!)

Expand full comment

Jason Theodor

Apr 12, 2023

It’s just a play on Marshall McLuhan’s “The medium is the message.” Perhaps when it comes to Generative Pre-trained Transformers, their message is ‘flattened’ out, averaged, and made mediocre by its tendency to pick the middle path, to avoid outliers-- hence the ‘median’ is the message with GPT. :)

Expand full comment

Reply (2)

Alberto Romero

Apr 12, 2023

Got it now! And I think so, too

Expand full comment

Apr 13, 2023

Yes, this is why it writes like a college freshman. Its summaries are very funny in that way. It is very hackneyed, loves generalization and loves to be upbeat for the most part. It overgeneralizes like a mofo if you ask it to compare various types of ideas. I think with a lot of work on prompts you could get around this but it won’t just happen.

Expand full comment

Comment removed

Apr 11, 2023

Comment removed

Expand full comment

Reply (1)

Alberto Romero

Apr 11, 2023

Have you read the article?

Expand full comment

The Algorithmic Bridge

ChatGPT: Eraser of the Implausible