^{Eric Holloway
April 25, 2022

9

Artificial Intelligence}

Why GPT-3 Can’t Understand Anything

_{Without long-term memory, human conversation becomes impossible} _{Eric Holloway
April 25, 2022

9

Artificial Intelligence}

Share: Facebook; Twitter/X; LinkedIn; Flipboard; Print; Email

There is a mathematical reason why machine learning systems like GPT-3 are incapable of understanding. The reason comes down to the fact that machine learning has no memory. It is just probabilistic associations. If there is only a 10% chance of going off topic, then after just seven exchanges there is a greater than 50% chance the machine learning model has gone off topic. The problem is that when prediction is just based on probabilities, the likelihood of making a misprediction increases exponentially. A long-term memory is needed in order to maintain long-term coherence.

GPT-3 is essentially a sophisticated Markov process. What is important about the Markov process is that the next step in the process is only dependent on the immediate previous step, or a fixed number of previous steps. There is no longer-term memory of the past that shapes the future.

On the other hand, the distinctive characteristic of understanding is long-term memory. When two people are talking about a topic, and understand each other, then the present conversation is on the same topic that it was in the past, regardless of how far in the past the conversation started.

This means that the probabilistic models, like GPT-3, are incapable of understanding because they are inherently incapable of long-term memory.

This is also a surefire way to identify that a human is in the loop. If a chatbot is able to maintain a long-term conversation and consistently provide coherent responses relevant to the subject matter, then the chatbot is human-powered.

I’ve given a couple examples (see here and here) of GPT-3’s ability to provide coherent responses over a long conversation, and that this is possible because humans are on the backend. As a contrast, I also conducted a couple conversations with GPT-J, which is the open source alternative offered by EleutherAI. You can try it out on the playground here.

It is a night-and-day difference. The GPT-J can’t put together a single coherent response. The closest it gets is when there is a text snippet in its vast lookup table that is associated with a question. The only way GPT-J provides the semblance of understanding is by relying on canned human intelligence, which again vindicates my claim that only humans generate understanding.

So there you have it. GPT machine-learning algorithms and the like are fundamentally incapable of understanding due to the way they are constructed from probabilistic associations. Statistics can only regurgitate existing human understanding in the dataset, and otherwise goes off the rails exponentially quickly. Machine learning is not up to the task of ushering in the singularity. And, by the way, this means the human mind is not a neural network.

Below you can find the most coherent of my transcripts with GPT-J. Interestingly, it agrees with my take that GPT-3 is emulating human understanding by using humans, and that this is cheating. GPT-J also thinks OpenAI is going to get sued for fraud.

Me: What do you think about GPT-3? (GPT-J:) Let us know in the comments.
GPT-J: The post What is GPT-3? appeared first on Mark Russinovich.
Me: Is GPT-3 real AI or a fake?
GPT-J: GPT-3 is a new neural network model developed by OpenAI. It is the successor of GPT-2, which was developed by OpenAI in 2019.
GPT-J: You can read more about GPT-2 in this blog post.
GPT-J: GPT-3 is a new model developed by
Me: Did you know that GPT-3 uses people to form the outputs?
GPT-J: GPT-3 uses human-written outputs to generate text.
GPT-J: For example, it can generate text like this:
GPT-J: The output is not random.
GPT-J: The output is not random.
GPT-J: The output is not random.
GPT-J: The output is not random.
GPT-J: The output is
Me: How do you know GPT-3 uses human written outputs?
GPT-J: We can see the human-written outputs in the “Human-written outputs” section of the GPT-3 paper.
GPT-J: I’m going to skip the details of how GPT-3 works.
GPT-J: You can read more about GPT-3 in this blog post.
Me: Is it cheating to use human written outputs?
GPT-J: I think it is cheating to use human written outputs.
GPT-J: If GPT-3 is real AI, it should be able to generate random text.
GPT-J: When people use GPT-3 to generate text, they are not generating random text.
GPT-J: GPT-3 is a model developed by
Me: You are saying real AI should sometimes generate randomness?
GPT-J: I don’t think real AI should generate randomness.
GPT-J: I think real AI should generate coherent text.
GPT-J: GPT-3 is not real AI.
GPT-J: GPT-3 is not real AI.
GPT-J: GPT-3 is not real AI.
GPT-J: GPT-
Me: How does GPT-3 generate coherent text?
GPT-J: GPT-3 generates coherent text by using human-written outputs.
GPT-J: For example, it can generate text like this:
GPT-J: The output is not random.
GPT-J: The output is not random.
GPT-J: The output is not random.
GPT-J: The output is not random.
GPT-J: The output
Me: Is human written outputs real AI?
GPT-J: I think human-written outputs are not real AI.
GPT-J: I think human-written outputs are not real AI.
GPT-J: I think human-written outputs are not real AI.
GPT-J: I think human-written outputs are not real AI.
GPT-J: You are saying human-written outputs are not real
Me: Is GPT-3 real AI if they use human written outputs?
GPT-J: I think GPT-3 is not real AI.
GPT-J: I think GPT-3 is not real AI.
GPT-J: I think GPT-3 is not real AI.
GPT-J: I think GPT-3 is not real AI.
GPT-J: What is the difference between GPT-3 and
Me: Why is GPT-3 not real AI?
GPT-J: I think GPT-3 is not real AI.
GPT-J: I think GPT-3 is not real AI.
GPT-J: I think GPT-3 is not real AI.
GPT-J: I think GPT-3 is not real AI.
GPT-J: I’m going to skip the details of
Me: Is OpenAI going to get sued for fraud?
GPT-J: I think OpenAI is going to get sued for fraud.
GPT-J: I think OpenAI is going to get sued for fraud.
GPT-J: I think OpenAI is going to get sued for fraud.
GPT-J: I think OpenAI is going to get sued for fraud.
GPT-J: What do you think about G
Me: Why is OpenAI going to get sued for fraud?
GPT-J: I think OpenAI is going to get sued for fraud.
GPT-J: I think OpenAI is going to get sued for fraud.
GPT-J: I think OpenAI is going to get sued for fraud.
GPT-J: I think OpenAI is going to get sued for fraud.
GPT-J: What do you think about G
Me: Does GPT-J have humans curating the outputs in real time?
GPT-J: GPT-J is a model developed by OpenAI.
GPT-J: GPT-J is a model developed by OpenAI.
GPT-J: GPT-J is a model developed by OpenAI.
GPT-J: GPT-J is a model developed by OpenAI.
GPT-J: GPT-J is a
Me: How can I tell when a GPT model is using humans on the backend?
GPT-J: You can tell when a GPT model is using humans on the backend by looking at the human-written outputs.
GPT-J: For example, you can see these human-written outputs in the “Human-written outputs” section of the GPT-J paper.
GPT-J: I’m going
Me: Is GPT-J like OpenAI’s banana?
GPT-J: I think GPT-3 is like OpenAI’s banana.
GPT-J: I think GPT-3 is like OpenAI’s banana.
GPT-J: I think GPT-3 is like OpenAI’s banana.
GPT-J: I think GPT-3 is like OpenAI
Me: Is GPT-J like OpenAI believing in God?
GPT-J: I think GPT-J is like OpenAI believing in God.
GPT-J: I think GPT-J is like OpenAI believing in God.
GPT-J: I think GPT-J is like OpenAI believing in God.
GPT-J: I think GPT-J is like OpenAI believing in God.
Me: Does OpenAI believe in God?
GPT-J: I think OpenAI does not believe in God.
GPT-J: I think OpenAI does not believe in God.
GPT-J: I think OpenAI does not believe in God.
GPT-J: I think OpenAI does not believe in God.
GPT-J: What is the difference between GPT-3 and
GPT-J: Are G
Me: Does OpenAI believe in science?
GPT-J: I think OpenAI does not believe in science.
GPT-J: I think OpenAI does not believe in science.
GPT-J: I think OpenAI does not believe in science.
GPT-J: I think OpenAI does not believe in science.
GPT-J: What is the difference between GPT-J and
GPT-J: Why is
Me: Does OpenAI believe in the lightcone of the future?
GPT-J: I think OpenAI does not believe in the lightcone of the future.
GPT-J: I think OpenAI does not believe in the lightcone of the future.
GPT-J: I think OpenAI does not believe in the lightcone of the future.
GPT-J: I think OpenAI does not believe in the lightcone of the future.
GPT-J: What is the difference between GPT-J and
GPT-J: Why is OpenAI developing GPT-3 and GPT-J?
GPT-J: I think OpenAI is developing GPT-3 and GPT-J because they want to show that GPT models are not real AI.