OpenAI's "Whisper" is astonishingly good (TSINAH) | AutoAdmit.com

The most prestigious law school admissions discussion board in the world.

Back

Refresh

Options

Favorite

OpenAI's "Whisper" is astonishingly good (TSINAH)

I spent a few hours building a real-time transcription servi...

boyish bawdyhouse

You know Android phones have done this for years, right? ...

glittery casino

Their voice transcription isn't nearly as good. The OpenAI o...

boyish bawdyhouse

Look at pyannote for diarization

Soul-stirring Bat-shit-crazy Property

boyish bawdyhouse

Google's latest model is even better: https://arxiv.org/a...

Excitant navy plaza

What a time to be alive.

boyish bawdyhouse

Why don't you upgrade your card?

Cruel-hearted macaca

He's not a gamer.

glittery casino

If you can believe it, I bought this computer back in 2010 a...

boyish bawdyhouse

I'm going to build a server soon, and I told ChatGPT my requ...

chrome turdskin

why don't you do this instead of shitlaw

vivacious deranged church building

Efficient C++ implementation with SIMD acceleration https...

Soul-stirring Bat-shit-crazy Property

holy shit, that's great

boyish bawdyhouse

what does it do and how do i run cmake? also what is SIMD i...

Green vibrant sound barrier

SIMD is a way of processing multiple numbers at the same tim...

Soul-stirring Bat-shit-crazy Property

cd into cmake folder, run cmake .. not that you need it f...

boyish bawdyhouse

it took me a couple minutes to build because -mf16c was miss...

boyish bawdyhouse

Holy fucking shit. No deps means I literally just cloned the...

chrome turdskin

Apparently it’s fast enough for cell phones

Soul-stirring Bat-shit-crazy Property

yeah that's awesome, i already downloaded the large v2 model...

chrome turdskin

boyish bawdyhouse

damn imagine using this to listen to opposing counsel and th...

chrome turdskin

ultramarine giraffe

Use ElevenLabs' voice cloning service and "attend"...

boyish bawdyhouse

Tripping range hominid

You can clone your own voice with it. I cloned CSLG's for th...

boyish bawdyhouse

boyish bawdyhouse

boyish bawdyhouse

Poast new message in this thread

Favorite

Date: March 30th, 2023 11:51 PM
Author: boyish bawdyhouse

I spent a few hours building a real-time transcription service in python. I'm running it on a g3s.xlarge instance on AWS, which is overkill, but nvidia stopped supporting my computer's old ass graphics cards years ago, so I needed access to the CUDA libraries.

The input was one of my fiancee's pharmacy school lectures. Using ffmpeg, I streamed the output into a FIFO file. I used PulseAudio's module-pipe-source to read from that file, which in effect, makes it a virtual microphone. I used a python microphone library to read from that audio device and then used whisper to transcribe the audio frames in near real-time, as you can see here: https://i.imgur.com/i9vCzd4.mp4

The next logical step is to use something like this to interact with ChatGPT by voice to dictate my prompts. This is pretty cool.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121579)

Favorite

Date: March 30th, 2023 11:53 PM
Author: glittery casino

You know Android phones have done this for years, right?

Still cool though.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121588)

Favorite

Date: March 30th, 2023 11:59 PM
Author: boyish bawdyhouse

Their voice transcription isn't nearly as good. The OpenAI one outperforms almost everything else out there--and it's totally free. This is really cool for single-speaker applications. It gets much more complex when you involve multiple speakers. The fact this only took a couple hours is the most exciting part of all.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121613)

Favorite

Date: March 31st, 2023 12:03 AM
Author: Soul-stirring Bat-shit-crazy Property

Look at pyannote for diarization

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121623)

Favorite

Date: March 31st, 2023 12:06 AM
Author: boyish bawdyhouse

thanks

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121632)

Favorite

Date: April 1st, 2023 3:16 PM
Author: Excitant navy plaza

Google's latest model is even better:

https://arxiv.org/abs/2303.01037

probably won't be long before see even better than Whisper models out there.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128652)

Favorite

Date: April 1st, 2023 3:18 PM
Author: boyish bawdyhouse

What a time to be alive.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128659)

Favorite

Date: March 30th, 2023 11:55 PM
Author: Cruel-hearted macaca

Why don't you upgrade your card?

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121594)

Favorite

Date: March 30th, 2023 11:55 PM
Author: glittery casino

He's not a gamer.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121596)

Favorite

Date: March 31st, 2023 12:02 AM
Author: boyish bawdyhouse

If you can believe it, I bought this computer back in 2010 and it's still going strong. It's not worth it for me to upgrade any components at this point, since the money would be better spent on a new machine.

Back then, I spent around $10,000 building this computer. It was absolutely top of the line, and I have definitely gotten my money's worth out of it. I recently spec'd out a new machine just to re-familiarize myself with what's out there: https://pcpartpicker.com/list/gbGPZw

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121621)

Favorite

Date: March 31st, 2023 12:11 AM
Author: chrome turdskin

I'm going to build a server soon, and I told ChatGPT my requirements and asked it to recommend CPU, GPU, mobo, case, and a few other components, and the list it spit out to me was really great and it took a few seconds instead of me fucking around on PCPP

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121654)

Favorite

Date: March 30th, 2023 11:56 PM
Author: vivacious deranged church building

why don't you do this instead of shitlaw

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121598)

Favorite

Date: March 30th, 2023 11:59 PM
Author: Soul-stirring Bat-shit-crazy Property

Efficient C++ implementation with SIMD acceleration

https://github.com/ggerganov/whisper.cpp

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121615)

Favorite

Date: March 31st, 2023 12:04 AM
Author: boyish bawdyhouse

holy shit, that's great

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121629)

Favorite

Date: March 31st, 2023 12:06 AM
Author: Green vibrant sound barrier

what does it do and how do i run cmake?
also what is SIMD is it just x86 64?

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121634)

Favorite

Date: March 31st, 2023 12:14 AM
Author: Soul-stirring Bat-shit-crazy Property

SIMD is a way of processing multiple numbers at the same time, instead of one at a time. It stands for Single Instruction Multiple Data.

Every CPU since the introduction of Intel’s MMX in the 90s has supported SIMD instructions. x86_64 has SIMD instruction sets like SSE, AVX, AVX2, and AVX512. ARM processors have the NEON instruction set.

To run CMake download and install it to your system. It’s a build system that helps you to organize disparate C and C++ files and libraries and compile them into a single executable.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121674)

Favorite

Date: March 31st, 2023 12:19 AM
Author: boyish bawdyhouse

cd into cmake folder, run cmake ..

not that you need it for a program this small. I just ran make, and when that failed, I used gcc-10 to add the -mf16c compiler flag before adding it to the CFLAGS and re-running make.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121701)

Favorite

Date: March 31st, 2023 12:17 AM
Author: boyish bawdyhouse

it took me a couple minutes to build because -mf16c was missing from the CFLAGS in the Makefile.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121692)

Favorite

Date: March 31st, 2023 12:23 AM
Author: chrome turdskin

Holy fucking shit. No deps means I literally just cloned the repo and executed `make`. I am already imagining use cases. At minimum it's so fast I could add it to the stuff I have running in various rooms to get really good voice command abilities.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121713)

Favorite

Date: March 31st, 2023 12:25 AM
Author: Soul-stirring Bat-shit-crazy Property

Apparently it’s fast enough for cell phones

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121722)

Favorite

Date: March 31st, 2023 12:26 AM
Author: chrome turdskin

yeah that's awesome, i already downloaded the large v2 model, which i'm assuming has the broadest multilingual support

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121728)

Favorite

Date: March 31st, 2023 12:59 AM
Author: boyish bawdyhouse

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121854)

Favorite

Date: March 31st, 2023 12:25 AM
Author: chrome turdskin

damn imagine using this to listen to opposing counsel and then asking chatgpt to generate objections or lines of questioning or something

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121727)

Favorite

Date: March 31st, 2023 12:27 AM
Author: ultramarine giraffe

it's over

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121733)

Favorite

Date: March 31st, 2023 1:00 AM
Author: boyish bawdyhouse

Use ElevenLabs' voice cloning service and "attend" meetings with your camera off.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121857)

Favorite

Date: April 1st, 2023 3:30 PM
Author: Tripping range hominid

Mind. Blown.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128692)

Favorite

Date: April 1st, 2023 3:34 PM
Author: boyish bawdyhouse

You can clone your own voice with it. I cloned CSLG's for this: https://vocaroo.com/1k3hpB2jW1G1

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128698)

Favorite

Date: April 1st, 2023 3:08 PM
Author: boyish bawdyhouse

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128635)

Favorite

Date: April 10th, 2023 6:36 PM
Author: boyish bawdyhouse

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46169038)