GPT-5: Overdue, overhyped and underwhelming. And that’s not the worst of it.
| bespoke up-to-no-good pisswyrm property | 08/09/25 | | bateful garnet theater | 08/09/25 | | chrome internal respiration garrison | 08/09/25 | | bespoke up-to-no-good pisswyrm property | 08/09/25 | | Red slippery heaven jap | 08/09/25 | | chrome internal respiration garrison | 08/09/25 | | peach provocative site codepig | 08/09/25 | | bespoke up-to-no-good pisswyrm property | 08/09/25 | | peach provocative site codepig | 08/09/25 | | bespoke up-to-no-good pisswyrm property | 08/09/25 | | chrome internal respiration garrison | 08/09/25 | | duck-like box office scourge upon the earth | 08/09/25 | | Swashbuckling rose juggernaut immigrant | 08/10/25 | | fuchsia seedy ticket booth | 08/10/25 | | duck-like box office scourge upon the earth | 08/10/25 | | chrome internal respiration garrison | 08/09/25 | | galvanic flesh jewess | 08/10/25 | | duck-like box office scourge upon the earth | 08/09/25 | | chrome internal respiration garrison | 08/09/25 | | bronze menage | 08/10/25 | | Charismatic Very Tactful New Version Indian Lodge | 08/10/25 | | chrome internal respiration garrison | 08/10/25 | | Charismatic Very Tactful New Version Indian Lodge | 08/10/25 | | duck-like box office scourge upon the earth | 08/10/25 | | galvanic flesh jewess | 08/10/25 | | duck-like box office scourge upon the earth | 08/10/25 | | galvanic flesh jewess | 08/10/25 | | Charismatic Very Tactful New Version Indian Lodge | 08/10/25 | | vermilion effete public bath | 08/10/25 | | Sapphire round eye psychic | 08/10/25 | | duck-like box office scourge upon the earth | 08/10/25 | | Racy National Filthpig | 08/10/25 | | drab frum boiling water university | 08/10/25 |
Poast new message in this thread
 |
Date: August 9th, 2025 8:02 PM Author: duck-like box office scourge upon the earth
"Well, I'd like to see the frontier models wriggle out of THIS jam!"
*frontier models wriggle their way out of the jam easily*
"Ah! Well. Nevertheless,"
(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2,#49170610) |
Date: August 9th, 2025 7:59 PM Author: duck-like box office scourge upon the earth
It's actually quite good. They appear to have fixed the issue where you got aggressively relegated to older shittier models
It's not way better than the other newest reasoning models but it's definitely better. It burns a lot of tokens though. Its answers are longer than they should be in a lot of cases
(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2,#49170598) |
 |
Date: August 10th, 2025 12:06 PM Author: Charismatic Very Tactful New Version Indian Lodge
If you have to give it an exhaustive list of considerations and instructions then it starts to lose its value relatively quickly. How many of us have just done something ourselves vs giving it to a junior for that exact reason? And that’s about how it feels, like a really good junior but one that autistically stumbles into a brilliant observation every now and then and that has encyclopedia level powers etc etc
(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2,#49171710)
|
Date: August 10th, 2025 1:44 PM Author: Sapphire round eye psychic
it turns out diffusion models are significantly more data efficient than the autoregressive models that are being used by all the major labs. >3x better data efficiency and this is only one possible improvement. the problem with the "AI is hitting a wall" theory is that there are many ways to use additional compute to improve model performance and the field is still too new to make strong conclusions of this sort.
https://jinjieni.notion.site/Diffusion-Language-Models-are-Super-Data-Learners-239d8f03a866800ab196e49928c019ac
(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2,#49171990) |
|
|