The mystery of GPT-2

Pranav Tiwari
1 min readApr 30, 2024

Day 121 / 366

There is a new chatbot going around, which has everyone speculating about it’s origins.

There is a website called LMSYS Chatbot Arena where users can compare various LLMs. A few days ago a mysterious model named ‘gpt2-chatbot’ was added there. It wasn’t a big deal at the time, because at first it just looked like a fine-tuned version of the old GPT-2 model.

But slowly this started gaining traction on social media, when users reported that it was outperforming GPT-4 in terms of logical reasoning. This is where the speculations began that it might be a hidden release of the new GPT 4.5 or GPT 5 model.

This model's coding, as well as mathematics skills, are way better than that of GPT-4. It also did well in things where LLMs generally fail, for example generating ASCII art.

To add fuel to the fire, OpenAI’s CEO Sam Altman tweeted this today —

I think this is definitely some sort of a marketing tactic. Given the hype that has been created around this model, I am sure we will hear from OpenAI or some other company soon claiming responsibility for creating it.

--

--

Pranav Tiwari

I write about life, happiness, work, mental health, and anything else that’s bothering me