\
  The most prestigious law school admissions discussion board in the world.
BackRefresh Options Favorite

OpenAI's "Whisper" is astonishingly good (TSINAH)

I spent a few hours building a real-time transcription servi...
boyish bawdyhouse
  03/30/23
You know Android phones have done this for years, right? ...
glittery casino
  03/30/23
Their voice transcription isn't nearly as good. The OpenAI o...
boyish bawdyhouse
  03/30/23
Look at pyannote for diarization
Soul-stirring Bat-shit-crazy Property
  03/31/23
thanks
boyish bawdyhouse
  03/31/23
Google's latest model is even better: https://arxiv.org/a...
Excitant navy plaza
  04/01/23
What a time to be alive.
boyish bawdyhouse
  04/01/23
Why don't you upgrade your card?
Cruel-hearted macaca
  03/30/23
He's not a gamer.
glittery casino
  03/30/23
If you can believe it, I bought this computer back in 2010 a...
boyish bawdyhouse
  03/31/23
I'm going to build a server soon, and I told ChatGPT my requ...
chrome turdskin
  03/31/23
why don't you do this instead of shitlaw
vivacious deranged church building
  03/30/23
Efficient C++ implementation with SIMD acceleration https...
Soul-stirring Bat-shit-crazy Property
  03/30/23
holy shit, that's great
boyish bawdyhouse
  03/31/23
what does it do and how do i run cmake? also what is SIMD i...
Green vibrant sound barrier
  03/31/23
SIMD is a way of processing multiple numbers at the same tim...
Soul-stirring Bat-shit-crazy Property
  03/31/23
cd into cmake folder, run cmake .. not that you need it f...
boyish bawdyhouse
  03/31/23
it took me a couple minutes to build because -mf16c was miss...
boyish bawdyhouse
  03/31/23
Holy fucking shit. No deps means I literally just cloned the...
chrome turdskin
  03/31/23
Apparently it’s fast enough for cell phones
Soul-stirring Bat-shit-crazy Property
  03/31/23
yeah that's awesome, i already downloaded the large v2 model...
chrome turdskin
  03/31/23
...
boyish bawdyhouse
  03/31/23
damn imagine using this to listen to opposing counsel and th...
chrome turdskin
  03/31/23
it's over
ultramarine giraffe
  03/31/23
Use ElevenLabs' voice cloning service and "attend"...
boyish bawdyhouse
  03/31/23
Mind. Blown.
Tripping range hominid
  04/01/23
You can clone your own voice with it. I cloned CSLG's for th...
boyish bawdyhouse
  04/01/23
...
boyish bawdyhouse
  04/01/23
...
boyish bawdyhouse
  04/10/23


Poast new message in this thread



Reply Favorite

Date: March 30th, 2023 11:51 PM
Author: boyish bawdyhouse

I spent a few hours building a real-time transcription service in python. I'm running it on a g3s.xlarge instance on AWS, which is overkill, but nvidia stopped supporting my computer's old ass graphics cards years ago, so I needed access to the CUDA libraries.

The input was one of my fiancee's pharmacy school lectures. Using ffmpeg, I streamed the output into a FIFO file. I used PulseAudio's module-pipe-source to read from that file, which in effect, makes it a virtual microphone. I used a python microphone library to read from that audio device and then used whisper to transcribe the audio frames in near real-time, as you can see here: https://i.imgur.com/i9vCzd4.mp4

The next logical step is to use something like this to interact with ChatGPT by voice to dictate my prompts. This is pretty cool.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121579)



Reply Favorite

Date: March 30th, 2023 11:53 PM
Author: glittery casino

You know Android phones have done this for years, right?

Still cool though.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121588)



Reply Favorite

Date: March 30th, 2023 11:59 PM
Author: boyish bawdyhouse

Their voice transcription isn't nearly as good. The OpenAI one outperforms almost everything else out there--and it's totally free. This is really cool for single-speaker applications. It gets much more complex when you involve multiple speakers. The fact this only took a couple hours is the most exciting part of all.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121613)



Reply Favorite

Date: March 31st, 2023 12:03 AM
Author: Soul-stirring Bat-shit-crazy Property

Look at pyannote for diarization

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121623)



Reply Favorite

Date: March 31st, 2023 12:06 AM
Author: boyish bawdyhouse

thanks

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121632)



Reply Favorite

Date: April 1st, 2023 3:16 PM
Author: Excitant navy plaza

Google's latest model is even better:

https://arxiv.org/abs/2303.01037

probably won't be long before see even better than Whisper models out there.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128652)



Reply Favorite

Date: April 1st, 2023 3:18 PM
Author: boyish bawdyhouse

What a time to be alive.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128659)



Reply Favorite

Date: March 30th, 2023 11:55 PM
Author: Cruel-hearted macaca

Why don't you upgrade your card?

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121594)



Reply Favorite

Date: March 30th, 2023 11:55 PM
Author: glittery casino

He's not a gamer.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121596)



Reply Favorite

Date: March 31st, 2023 12:02 AM
Author: boyish bawdyhouse

If you can believe it, I bought this computer back in 2010 and it's still going strong. It's not worth it for me to upgrade any components at this point, since the money would be better spent on a new machine.

Back then, I spent around $10,000 building this computer. It was absolutely top of the line, and I have definitely gotten my money's worth out of it. I recently spec'd out a new machine just to re-familiarize myself with what's out there: https://pcpartpicker.com/list/gbGPZw

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121621)



Reply Favorite

Date: March 31st, 2023 12:11 AM
Author: chrome turdskin

I'm going to build a server soon, and I told ChatGPT my requirements and asked it to recommend CPU, GPU, mobo, case, and a few other components, and the list it spit out to me was really great and it took a few seconds instead of me fucking around on PCPP

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121654)



Reply Favorite

Date: March 30th, 2023 11:56 PM
Author: vivacious deranged church building

why don't you do this instead of shitlaw

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121598)



Reply Favorite

Date: March 30th, 2023 11:59 PM
Author: Soul-stirring Bat-shit-crazy Property

Efficient C++ implementation with SIMD acceleration

https://github.com/ggerganov/whisper.cpp

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121615)



Reply Favorite

Date: March 31st, 2023 12:04 AM
Author: boyish bawdyhouse

holy shit, that's great

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121629)



Reply Favorite

Date: March 31st, 2023 12:06 AM
Author: Green vibrant sound barrier

what does it do and how do i run cmake?

also what is SIMD is it just x86 64?

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121634)



Reply Favorite

Date: March 31st, 2023 12:14 AM
Author: Soul-stirring Bat-shit-crazy Property

SIMD is a way of processing multiple numbers at the same time, instead of one at a time. It stands for Single Instruction Multiple Data.

Every CPU since the introduction of Intel’s MMX in the 90s has supported SIMD instructions. x86_64 has SIMD instruction sets like SSE, AVX, AVX2, and AVX512. ARM processors have the NEON instruction set.

To run CMake download and install it to your system. It’s a build system that helps you to organize disparate C and C++ files and libraries and compile them into a single executable.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121674)



Reply Favorite

Date: March 31st, 2023 12:19 AM
Author: boyish bawdyhouse

cd into cmake folder, run cmake ..

not that you need it for a program this small. I just ran make, and when that failed, I used gcc-10 to add the -mf16c compiler flag before adding it to the CFLAGS and re-running make.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121701)



Reply Favorite

Date: March 31st, 2023 12:17 AM
Author: boyish bawdyhouse

it took me a couple minutes to build because -mf16c was missing from the CFLAGS in the Makefile.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121692)



Reply Favorite

Date: March 31st, 2023 12:23 AM
Author: chrome turdskin

Holy fucking shit. No deps means I literally just cloned the repo and executed `make`. I am already imagining use cases. At minimum it's so fast I could add it to the stuff I have running in various rooms to get really good voice command abilities.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121713)



Reply Favorite

Date: March 31st, 2023 12:25 AM
Author: Soul-stirring Bat-shit-crazy Property

Apparently it’s fast enough for cell phones

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121722)



Reply Favorite

Date: March 31st, 2023 12:26 AM
Author: chrome turdskin

yeah that's awesome, i already downloaded the large v2 model, which i'm assuming has the broadest multilingual support

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121728)



Reply Favorite

Date: March 31st, 2023 12:59 AM
Author: boyish bawdyhouse



(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121854)



Reply Favorite

Date: March 31st, 2023 12:25 AM
Author: chrome turdskin

damn imagine using this to listen to opposing counsel and then asking chatgpt to generate objections or lines of questioning or something

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121727)



Reply Favorite

Date: March 31st, 2023 12:27 AM
Author: ultramarine giraffe

it's over

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121733)



Reply Favorite

Date: March 31st, 2023 1:00 AM
Author: boyish bawdyhouse

Use ElevenLabs' voice cloning service and "attend" meetings with your camera off.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121857)



Reply Favorite

Date: April 1st, 2023 3:30 PM
Author: Tripping range hominid

Mind. Blown.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128692)



Reply Favorite

Date: April 1st, 2023 3:34 PM
Author: boyish bawdyhouse

You can clone your own voice with it. I cloned CSLG's for this: https://vocaroo.com/1k3hpB2jW1G1

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128698)



Reply Favorite

Date: April 1st, 2023 3:08 PM
Author: boyish bawdyhouse



(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128635)



Reply Favorite

Date: April 10th, 2023 6:36 PM
Author: boyish bawdyhouse



(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46169038)