\
  The most prestigious law school admissions discussion board in the world.
BackRefresh Options Favorite

OpenAI's "Whisper" is astonishingly good (TSINAH)

I spent a few hours building a real-time transcription servi...
hideous cerise stage
  03/30/23
You know Android phones have done this for years, right? ...
Big fluffy hissy fit
  03/30/23
Their voice transcription isn't nearly as good. The OpenAI o...
hideous cerise stage
  03/30/23
Look at pyannote for diarization
soul-stirring forum main people
  03/31/23
thanks
hideous cerise stage
  03/31/23
Google's latest model is even better: https://arxiv.org/a...
Razzmatazz Son Of Senegal
  04/01/23
What a time to be alive.
hideous cerise stage
  04/01/23
Why don't you upgrade your card?
crusty goyim
  03/30/23
He's not a gamer.
Big fluffy hissy fit
  03/30/23
If you can believe it, I bought this computer back in 2010 a...
hideous cerise stage
  03/31/23
I'm going to build a server soon, and I told ChatGPT my requ...
canary swashbuckling famous landscape painting
  03/31/23
why don't you do this instead of shitlaw
Coral Alpha
  03/30/23
Efficient C++ implementation with SIMD acceleration https...
soul-stirring forum main people
  03/30/23
holy shit, that's great
hideous cerise stage
  03/31/23
what does it do and how do i run cmake? also what is SIMD i...
Submissive henna house bbw
  03/31/23
SIMD is a way of processing multiple numbers at the same tim...
soul-stirring forum main people
  03/31/23
cd into cmake folder, run cmake .. not that you need it f...
hideous cerise stage
  03/31/23
it took me a couple minutes to build because -mf16c was miss...
hideous cerise stage
  03/31/23
Holy fucking shit. No deps means I literally just cloned the...
canary swashbuckling famous landscape painting
  03/31/23
Apparently it’s fast enough for cell phones
soul-stirring forum main people
  03/31/23
yeah that's awesome, i already downloaded the large v2 model...
canary swashbuckling famous landscape painting
  03/31/23
...
hideous cerise stage
  03/31/23
damn imagine using this to listen to opposing counsel and th...
canary swashbuckling famous landscape painting
  03/31/23
it's over
Fantasy-prone anal shrine death wish
  03/31/23
Use ElevenLabs' voice cloning service and "attend"...
hideous cerise stage
  03/31/23
Mind. Blown.
scarlet fortuitous meteor
  04/01/23
You can clone your own voice with it. I cloned CSLG's for th...
hideous cerise stage
  04/01/23
...
hideous cerise stage
  04/01/23
...
hideous cerise stage
  04/10/23


Poast new message in this thread



Reply Favorite

Date: March 30th, 2023 11:51 PM
Author: hideous cerise stage

I spent a few hours building a real-time transcription service in python. I'm running it on a g3s.xlarge instance on AWS, which is overkill, but nvidia stopped supporting my computer's old ass graphics cards years ago, so I needed access to the CUDA libraries.

The input was one of my fiancee's pharmacy school lectures. Using ffmpeg, I streamed the output into a FIFO file. I used PulseAudio's module-pipe-source to read from that file, which in effect, makes it a virtual microphone. I used a python microphone library to read from that audio device and then used whisper to transcribe the audio frames in near real-time, as you can see here: https://i.imgur.com/i9vCzd4.mp4

The next logical step is to use something like this to interact with ChatGPT by voice to dictate my prompts. This is pretty cool.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121579)



Reply Favorite

Date: March 30th, 2023 11:53 PM
Author: Big fluffy hissy fit

You know Android phones have done this for years, right?

Still cool though.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121588)



Reply Favorite

Date: March 30th, 2023 11:59 PM
Author: hideous cerise stage

Their voice transcription isn't nearly as good. The OpenAI one outperforms almost everything else out there--and it's totally free. This is really cool for single-speaker applications. It gets much more complex when you involve multiple speakers. The fact this only took a couple hours is the most exciting part of all.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121613)



Reply Favorite

Date: March 31st, 2023 12:03 AM
Author: soul-stirring forum main people

Look at pyannote for diarization

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121623)



Reply Favorite

Date: March 31st, 2023 12:06 AM
Author: hideous cerise stage

thanks

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121632)



Reply Favorite

Date: April 1st, 2023 3:16 PM
Author: Razzmatazz Son Of Senegal

Google's latest model is even better:

https://arxiv.org/abs/2303.01037

probably won't be long before see even better than Whisper models out there.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128652)



Reply Favorite

Date: April 1st, 2023 3:18 PM
Author: hideous cerise stage

What a time to be alive.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128659)



Reply Favorite

Date: March 30th, 2023 11:55 PM
Author: crusty goyim

Why don't you upgrade your card?

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121594)



Reply Favorite

Date: March 30th, 2023 11:55 PM
Author: Big fluffy hissy fit

He's not a gamer.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121596)



Reply Favorite

Date: March 31st, 2023 12:02 AM
Author: hideous cerise stage

If you can believe it, I bought this computer back in 2010 and it's still going strong. It's not worth it for me to upgrade any components at this point, since the money would be better spent on a new machine.

Back then, I spent around $10,000 building this computer. It was absolutely top of the line, and I have definitely gotten my money's worth out of it. I recently spec'd out a new machine just to re-familiarize myself with what's out there: https://pcpartpicker.com/list/gbGPZw

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121621)



Reply Favorite

Date: March 31st, 2023 12:11 AM
Author: canary swashbuckling famous landscape painting

I'm going to build a server soon, and I told ChatGPT my requirements and asked it to recommend CPU, GPU, mobo, case, and a few other components, and the list it spit out to me was really great and it took a few seconds instead of me fucking around on PCPP

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121654)



Reply Favorite

Date: March 30th, 2023 11:56 PM
Author: Coral Alpha

why don't you do this instead of shitlaw

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121598)



Reply Favorite

Date: March 30th, 2023 11:59 PM
Author: soul-stirring forum main people

Efficient C++ implementation with SIMD acceleration

https://github.com/ggerganov/whisper.cpp

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121615)



Reply Favorite

Date: March 31st, 2023 12:04 AM
Author: hideous cerise stage

holy shit, that's great

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121629)



Reply Favorite

Date: March 31st, 2023 12:06 AM
Author: Submissive henna house bbw

what does it do and how do i run cmake?

also what is SIMD is it just x86 64?

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121634)



Reply Favorite

Date: March 31st, 2023 12:14 AM
Author: soul-stirring forum main people

SIMD is a way of processing multiple numbers at the same time, instead of one at a time. It stands for Single Instruction Multiple Data.

Every CPU since the introduction of Intel’s MMX in the 90s has supported SIMD instructions. x86_64 has SIMD instruction sets like SSE, AVX, AVX2, and AVX512. ARM processors have the NEON instruction set.

To run CMake download and install it to your system. It’s a build system that helps you to organize disparate C and C++ files and libraries and compile them into a single executable.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121674)



Reply Favorite

Date: March 31st, 2023 12:19 AM
Author: hideous cerise stage

cd into cmake folder, run cmake ..

not that you need it for a program this small. I just ran make, and when that failed, I used gcc-10 to add the -mf16c compiler flag before adding it to the CFLAGS and re-running make.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121701)



Reply Favorite

Date: March 31st, 2023 12:17 AM
Author: hideous cerise stage

it took me a couple minutes to build because -mf16c was missing from the CFLAGS in the Makefile.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121692)



Reply Favorite

Date: March 31st, 2023 12:23 AM
Author: canary swashbuckling famous landscape painting

Holy fucking shit. No deps means I literally just cloned the repo and executed `make`. I am already imagining use cases. At minimum it's so fast I could add it to the stuff I have running in various rooms to get really good voice command abilities.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121713)



Reply Favorite

Date: March 31st, 2023 12:25 AM
Author: soul-stirring forum main people

Apparently it’s fast enough for cell phones

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121722)



Reply Favorite

Date: March 31st, 2023 12:26 AM
Author: canary swashbuckling famous landscape painting

yeah that's awesome, i already downloaded the large v2 model, which i'm assuming has the broadest multilingual support

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121728)



Reply Favorite

Date: March 31st, 2023 12:59 AM
Author: hideous cerise stage



(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121854)



Reply Favorite

Date: March 31st, 2023 12:25 AM
Author: canary swashbuckling famous landscape painting

damn imagine using this to listen to opposing counsel and then asking chatgpt to generate objections or lines of questioning or something

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121727)



Reply Favorite

Date: March 31st, 2023 12:27 AM
Author: Fantasy-prone anal shrine death wish

it's over

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121733)



Reply Favorite

Date: March 31st, 2023 1:00 AM
Author: hideous cerise stage

Use ElevenLabs' voice cloning service and "attend" meetings with your camera off.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46121857)



Reply Favorite

Date: April 1st, 2023 3:30 PM
Author: scarlet fortuitous meteor

Mind. Blown.

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128692)



Reply Favorite

Date: April 1st, 2023 3:34 PM
Author: hideous cerise stage

You can clone your own voice with it. I cloned CSLG's for this: https://vocaroo.com/1k3hpB2jW1G1

(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128698)



Reply Favorite

Date: April 1st, 2023 3:08 PM
Author: hideous cerise stage



(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46128635)



Reply Favorite

Date: April 10th, 2023 6:36 PM
Author: hideous cerise stage



(http://www.autoadmit.com/thread.php?thread_id=5316132&forum_id=2#46169038)