User:Ralsettem: Difference between revisions

Latest revision as of 19:58, 3 May 2024

Self-proclaimed nerd venturing out into the world of transcription supporting a podcast I love to listen to.

After some computer issues I'm back and I'm using whisper-diarization to create diarization transcriptions. It uses Whisper and nemo.

@@ Line 1: / Line 1: @@
 Self-proclaimed nerd venturing out into the world of transcription supporting a podcast I love to listen to.
-I am using [https://github.com/yinruiqing/pyannote-whisper pyannote-whisper] to create diarization transcriptions. It uses a local of installation of [https://openai.com/blog/whisper/ Whisper] by OpenAi on [https://github.com/openai/whisper Github] to using the large model to transcribe podcast episodes and [https://github.com/pyannote/pyannote-audio Pyannote on Github] to label the different speakers.
+After some computer issues I'm back and I'm using [https://github.com/MahmoudAshraf97/whisper-diarization whisper-diarization] to create diarization transcriptions. It uses [https://openai.com/blog/whisper/ Whisper] and [https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html nemo].
-It's about 80% correct most of the time, but there are errors.