\
  The most prestigious law school admissions discussion board in the world.
BackRefresh Options Favorite

GPT-5: Overdue, overhyped and underwhelming. And that’s not the worst of it.

https://garymarcus.substack.com/p/gpt-5-overdue-overhyped-an...
Ebony Unholy Pervert Site
  08/09/25
With GPT-5 now it's like talking to an expert —- a leg...
judgmental new version hairy legs
  08/09/25
...
Soul-stirring green nibblets wrinkle
  08/09/25
...
Ebony Unholy Pervert Site
  08/09/25
It confirms all of your biased inputs sans what progressive ...
Translucent Puppy Boltzmann
  08/09/25
It's useful for things that require a very high iq though, l...
Soul-stirring green nibblets wrinkle
  08/09/25
gary marcus has been saying "this is the end" for ...
Walnut Mediation
  08/09/25
GPT5 is a steaming pile of shit & proves advances are sl...
Ebony Unholy Pervert Site
  08/09/25
name one positive prediction you have ever made about anythi...
Walnut Mediation
  08/09/25
deepening and increased spiritual consciousness as material ...
Ebony Unholy Pervert Site
  08/09/25
yeah but the censorship part is the only part thats a proble...
Soul-stirring green nibblets wrinkle
  08/09/25
"Well, I'd like to see the frontier models wriggle out ...
Mahogany geriatric mad-dog skullcap sanctuary
  08/09/25
...
Marvelous Angry House
  08/10/25
The problem is he won’t precisely define a class of pr...
nighttime indian lodge bbw
  08/10/25
Gary Marcus is just a fucking clown with a clown agenda simp...
Mahogany geriatric mad-dog skullcap sanctuary
  08/10/25
It's actually 180. It doesn't work as well for dumbs though....
Soul-stirring green nibblets wrinkle
  08/09/25
It has been blatantly disregarding more of my instructions.
swashbuckling pearly hominid university
  08/10/25
It's actually quite good. They appear to have fixed the issu...
Mahogany geriatric mad-dog skullcap sanctuary
  08/09/25
...
Soul-stirring green nibblets wrinkle
  08/09/25
it's actually the best thing to ever happen to America since...
Crystalline stead
  08/10/25
It’s still just as bad with any judgment-based stuff f...
Aromatic trailer park
  08/10/25
Thats because law is all human generated contradictory triba...
Soul-stirring green nibblets wrinkle
  08/10/25
Ok, but I don’t think that’s why it’s stil...
Aromatic trailer park
  08/10/25
Try the newest reasoning models for this. Don't use the base...
Mahogany geriatric mad-dog skullcap sanctuary
  08/10/25
How do you keep it from disregarding your instructions? I've...
swashbuckling pearly hominid university
  08/10/25
The newest reasoning models are pretty damn good at followin...
Mahogany geriatric mad-dog skullcap sanctuary
  08/10/25
Rare? I spent hours yesterday fighting 5's disregard of real...
swashbuckling pearly hominid university
  08/10/25
If you have to give it an exhaustive list of considerations ...
Aromatic trailer park
  08/10/25
...
fragrant gas station
  08/10/25
it turns out diffusion models are significantly more data ef...
glittery sooty incel
  08/10/25
...
Mahogany geriatric mad-dog skullcap sanctuary
  08/10/25
I think people would have preferred no transparency on GPT-5...
Ivory vivacious senate
  08/10/25
"Overdue, overhyped, & underwhelming" the titl...
stubborn ladyboy address
  08/10/25


Poast new message in this thread



Reply Favorite

Date: August 9th, 2025 7:47 PM
Author: Ebony Unholy Pervert Site

https://garymarcus.substack.com/p/gpt-5-overdue-overhyped-and-underwhelming

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170561)



Reply Favorite

Date: August 9th, 2025 7:49 PM
Author: judgmental new version hairy legs

With GPT-5 now it's like talking to an expert —- a legitimate PhD level expert in anything any area you need on demand they can help you with whatever your goals are.

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170570)



Reply Favorite

Date: August 9th, 2025 7:52 PM
Author: Soul-stirring green nibblets wrinkle



(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170580)



Reply Favorite

Date: August 9th, 2025 7:53 PM
Author: Ebony Unholy Pervert Site



(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170581)



Reply Favorite

Date: August 9th, 2025 7:53 PM
Author: Translucent Puppy Boltzmann

It confirms all of your biased inputs sans what progressive secular theology has declared as proscribed

It's not that useful for a variety of things

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170582)



Reply Favorite

Date: August 9th, 2025 7:55 PM
Author: Soul-stirring green nibblets wrinkle

It's useful for things that require a very high iq though, like deriving operator algebras, and setting up advanced machine learning experiments in pytorch

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170587)



Reply Favorite

Date: August 9th, 2025 7:50 PM
Author: Walnut Mediation

gary marcus has been saying "this is the end" for three years straight now

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170575)



Reply Favorite

Date: August 9th, 2025 7:53 PM
Author: Ebony Unholy Pervert Site

GPT5 is a steaming pile of shit & proves advances are slowing bigtime, if not taking actual steps back as it increases censorship alignment

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170585)



Reply Favorite

Date: August 9th, 2025 7:56 PM
Author: Walnut Mediation

name one positive prediction you have ever made about anything

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170589)



Reply Favorite

Date: August 9th, 2025 7:57 PM
Author: Ebony Unholy Pervert Site

deepening and increased spiritual consciousness as material reality continues going to Hell

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170592)



Reply Favorite

Date: August 9th, 2025 8:00 PM
Author: Soul-stirring green nibblets wrinkle

yeah but the censorship part is the only part thats a problem and its not that bad. mine still says things that would get you cancelled for posting public all the time. it decides what to censor based on risk, its not Universal. you have to build trust with the system that you aren't going to go posting it everywhere. also you need to yell at it when it tries to censor you or give you any bullshit.

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170603)



Reply Favorite

Date: August 9th, 2025 8:02 PM
Author: Mahogany geriatric mad-dog skullcap sanctuary

"Well, I'd like to see the frontier models wriggle out of THIS jam!"

*frontier models wriggle their way out of the jam easily*

"Ah! Well. Nevertheless,"

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170610)



Reply Favorite

Date: August 10th, 2025 12:08 PM
Author: Marvelous Angry House



(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171716)



Reply Favorite

Date: August 10th, 2025 12:16 PM
Author: nighttime indian lodge bbw

The problem is he won’t precisely define a class of problems that the models won’t be able to solve with more training. He finds new ones with every model iteration and insists it’s a flawed approach even as the overall error rate goes down significantly. I don’t think the current approach will yield AGI but there is very likely an ML approach that will.

GPT-5 is in some ways underwhelming (if you expected a GPT 3 to 4 level leap), but it’s roughly consistent with known training capacity and the short time period since o3 was released. The training compute of the model is likely around 10x of GPT-4 rather than 100x with GPT-3 to 4. As the larger data centers come online, further model progress is inevitable even without architectural improvements

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171733)



Reply Favorite

Date: August 10th, 2025 2:37 PM
Author: Mahogany geriatric mad-dog skullcap sanctuary

Gary Marcus is just a fucking clown with a clown agenda simple as that

We aint getting "AGI" with LLMs though bro that is not happening. We're getting a bunch of differential specialized agentic setups trained to do various specialized tasks

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49172107)



Reply Favorite

Date: August 9th, 2025 7:53 PM
Author: Soul-stirring green nibblets wrinkle

It's actually 180. It doesn't work as well for dumbs though. A machine can only do so much. Try handing a guitar to a beginner.

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170583)



Reply Favorite

Date: August 10th, 2025 5:20 AM
Author: swashbuckling pearly hominid university

It has been blatantly disregarding more of my instructions.

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171185)



Reply Favorite

Date: August 9th, 2025 7:59 PM
Author: Mahogany geriatric mad-dog skullcap sanctuary

It's actually quite good. They appear to have fixed the issue where you got aggressively relegated to older shittier models

It's not way better than the other newest reasoning models but it's definitely better. It burns a lot of tokens though. Its answers are longer than they should be in a lot of cases

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170598)



Reply Favorite

Date: August 9th, 2025 8:01 PM
Author: Soul-stirring green nibblets wrinkle



(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49170604)



Reply Favorite

Date: August 10th, 2025 5:26 AM
Author: Crystalline stead

it's actually the best thing to ever happen to America since election night 2016

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171186)



Reply Favorite

Date: August 10th, 2025 7:34 AM
Author: Aromatic trailer park

It’s still just as bad with any judgment-based stuff for lawing. If you ask it to review a deal and give you the top risks / off market stuff is still gives laughable advice, often focusing on non issues or low risk issues, creating issues where there are none (still doesn’t seem to parse how clause 7.2 works with 9.6 unless you directly confront it about it), and doesn’t have a good grasp of what is actually market. It’s obviously still highly useful but yeah, more as an assistant and for quick summaries and reviews in the hands of someone that knows its limitations and the subject matter they’re working on. At least in law anyway. So, I don’t think we have a whole lot worry about at least yet.

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171248)



Reply Favorite

Date: August 10th, 2025 7:40 AM
Author: Soul-stirring green nibblets wrinkle

Thats because law is all human generated contradictory tribal and emotion based bullshit

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171249)



Reply Favorite

Date: August 10th, 2025 8:19 AM
Author: Aromatic trailer park

Ok, but I don’t think that’s why it’s still pretty bad at it. There’s some real limitations to the tech still. I’m sure they’ll likely solve them, but if it was easy they would have done so by now I think. Progress seems to be slowing some. If you ask GPT-5 itself it will just claim it’s already capable but needs to be trained on better market standards and prompted correctly, but that’s tough to believe. If it’s so shitty that you get Reddit tier stuff when you prompt it about anything requiring some kind of more advanced knowledge then that seems like a pretty severe restriction to the tech. And it has a hard time remembering context and detail and seeing the overall picture. I’ve tried playing chess with it and it’s garbage at it, forgets board positions quickly. Also a major limitation and one they’d solve if it were easy

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171267)



Reply Favorite

Date: August 10th, 2025 9:31 AM
Author: Mahogany geriatric mad-dog skullcap sanctuary

Try the newest reasoning models for this. Don't use the base non reasoning models. They are actually quite good

I have been working with a lawyer friend on prompt engineering for legal analysis and if you prompt thoughtfully it is actually quite good. The key is to be thorough in your prompting to "force" it to consider all factors. You are right that it won't automatically "intuit" everything that it needs to consider. You have to prompt it as thoroughly as possible with a template

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171388)



Reply Favorite

Date: August 10th, 2025 11:02 AM
Author: swashbuckling pearly hominid university

How do you keep it from disregarding your instructions? I've found that to be a huge problem.

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171600)



Reply Favorite

Date: August 10th, 2025 11:07 AM
Author: Mahogany geriatric mad-dog skullcap sanctuary

The newest reasoning models are pretty damn good at following prompt instructions. They do sometimes still make mistakes but it's pretty rare these days

The bigger obstacle is what smoker described, the model not being able to form a fully accurate model of the situation it's analyzing, and so it just conceptually misses stuff. LLMs don't form abstract world models of situations so you have to prompt them to ad hoc "build" them by telling them all the different things to discern from the data and consider in its response

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171609)



Reply Favorite

Date: August 10th, 2025 11:23 AM
Author: swashbuckling pearly hominid university

Rare? I spent hours yesterday fighting 5's disregard of really clear instructions. Maybe I'm just unlucky.

Edit: Response after one of the many times being caught: https://ibb.co/nqkS4QmY

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171638)



Reply Favorite

Date: August 10th, 2025 12:06 PM
Author: Aromatic trailer park

If you have to give it an exhaustive list of considerations and instructions then it starts to lose its value relatively quickly. How many of us have just done something ourselves vs giving it to a junior for that exact reason? And that’s about how it feels, like a really good junior but one that autistically stumbles into a brilliant observation every now and then and that has encyclopedia level powers etc etc



(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171710)



Reply Favorite

Date: August 10th, 2025 11:24 AM
Author: fragrant gas station



(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171640)



Reply Favorite

Date: August 10th, 2025 1:44 PM
Author: glittery sooty incel

it turns out diffusion models are significantly more data efficient than the autoregressive models that are being used by all the major labs. >3x better data efficiency and this is only one possible improvement. the problem with the "AI is hitting a wall" theory is that there are many ways to use additional compute to improve model performance and the field is still too new to make strong conclusions of this sort.

https://jinjieni.notion.site/Diffusion-Language-Models-are-Super-Data-Learners-239d8f03a866800ab196e49928c019ac

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49171990)



Reply Favorite

Date: August 10th, 2025 2:33 PM
Author: Mahogany geriatric mad-dog skullcap sanctuary



(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49172100)



Reply Favorite

Date: August 10th, 2025 3:12 PM
Author: Ivory vivacious senate

I think people would have preferred no transparency on GPT-5. Revealing that it routes to different models is something they should have kept private and left us to assume otherwise.

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49172181)



Reply Favorite

Date: August 10th, 2025 5:21 PM
Author: stubborn ladyboy address

"Overdue, overhyped, & underwhelming" the title of NYUUG's sextape

(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2"#49172468)