Meet Jambi - a blazing-fast voice transcription application built with Rust

guttermonk@lemmy.ml · edit-2 23 hours ago

Meet Jambi - a blazing-fast voice transcription application built with Rust

black_flag@lemmy.dbzer0.com · 21 hours ago

Is there any reason you chose vosk over whisper.c++?

guttermonk@lemmy.ml · 20 hours ago

I’m not familiar with whisper.c++ but I did try faster-whisper. Unfortunately, the transcriptions took upward of 40sec and it didn’t offer live transcription, which is a nice feature of vosk. There’s a comparison in the readme with other differences. That said, it should be relatively modular. It shouldn’t take much to swap it back to whisper if that’s what you prefer to use. Whisper is in the nix flake as optional, and the program allows you to change models but i haven’t bothered trying to switch back to Whisper since Vosk has been more performant.

guttermonk@lemmy.ml · 20 hours ago

I should also add that if you don’t want to reconfigure Jambi to use Whisper, you can try WhisperNow, which is built in Python and uses Whisper. I saw similar transcription performance when I used Whisper with Jambi and decided to move forward with Vosk after testing Whisper in both programs.

FailBetter@crust.piefed.social · 21 hours ago

Probably just playing it safe since vosk has been around longer iirc Is kinda weird to rust and not choose the more performant whisper tho