Why is AI so slow / why does it freeze so much? | AutoAdmit.com

The most prestigious law school admissions discussion board in the world.

Back

Refresh

Options

Favorite

Why is AI so slow / why does it freeze so much?

Glittery cyan striped hyena jew

you're not paying $2k a month for it yet

Pale mewling hominid garrison

Each query has to kill an endangered species

mind-boggling range athletic conference

the apis are fine if you use those. if you are using the web...

slap-happy citrine corner

because there's a shortage of GPUs and RAM. Most OpenAI user...

Beta Odious Lay

no. all of the "slow ai" issues are always slow st...

slap-happy citrine corner

Beta Odious Lay

Poast new message in this thread

Favorite

Date: January 12th, 2026 5:47 PM
Author: Glittery cyan striped hyena jew

(http://www.autoadmit.com/thread.php?thread_id=5821222&forum_id=2#49584226)

Favorite

Date: January 12th, 2026 5:47 PM
Author: Pale mewling hominid garrison

you're not paying $2k a month for it yet

(http://www.autoadmit.com/thread.php?thread_id=5821222&forum_id=2#49584229)

Favorite

Date: January 12th, 2026 5:50 PM
Author: mind-boggling range athletic conference

Each query has to kill an endangered species

(http://www.autoadmit.com/thread.php?thread_id=5821222&forum_id=2#49584234)

Favorite

Date: January 12th, 2026 5:51 PM
Author: slap-happy citrine corner

the apis are fine if you use those. if you are using the web app or desktop app there are a lot of things that can make it go slow. having a ttt computer being one of the main reasons. check your cpu usage when its streaming and if it spikes up to like 100% with a ton of threads then you just need to open a new session. the app keeps the entire conversation history in memory and on screen and as the session grows the interface has to maintain and update every previous messsage. guarantee you its all UI/render latency not model inference latency. for instance claude streams fast as fuck on amazon bedrock like almost instant.

(http://www.autoadmit.com/thread.php?thread_id=5821222&forum_id=2#49584242)

Favorite

Date: January 12th, 2026 5:54 PM
Author: Beta Odious Lay

because there's a shortage of GPUs and RAM. Most OpenAI users are only interacting with quantized models because why let proles use the full uncompressed model?

https://www.xoxohth.com/thread.php?thread_id=5820268&forum_id=2

(http://www.autoadmit.com/thread.php?thread_id=5821222&forum_id=2#49584250)

Favorite

Date: January 12th, 2026 5:58 PM
Author: slap-happy citrine corner

no. all of the "slow ai" issues are always slow streaming which is a ui rendering issue caused by not a powerful enough local cpu and buggy client optimization

(http://www.autoadmit.com/thread.php?thread_id=5821222&forum_id=2#49584255)

Favorite

Date: January 12th, 2026 6:06 PM
Author: Beta Odious Lay

makes sense

(http://www.autoadmit.com/thread.php?thread_id=5821222&forum_id=2#49584270)