User:Ralsettem: Difference between revisions
Jump to navigation
Jump to search
m (Creating user page for new user.) |
mNo edit summary |
||
Line 1: | Line 1: | ||
Self-proclaimed nerd venturing out into the world of transcription supporting a podcast I love to listen to. | Self-proclaimed nerd venturing out into the world of transcription supporting a podcast I love to listen to. | ||
I’m using a local of installation of [https://openai.com/blog/whisper/ Whisper] by OpenAi on [https://github.com/openai/whisper Github] to using the large model to transcribe podcast episodes. | |||
It’s theoretically possible to have [https://github.com/lablab-ai/Whisper-transcription_and_diarization-speaker-identification- diarization] of speakers using [https://github.com/pyannote/pyannote-audio Pyannote on Github], however I don’t know python and there is a lack of video tutorials that show the process. | |||
Hopefully there will be some kind developers that will create a webUI that is able to transcribe with diarization. |
Revision as of 07:01, 1 November 2022
Self-proclaimed nerd venturing out into the world of transcription supporting a podcast I love to listen to.
I’m using a local of installation of Whisper by OpenAi on Github to using the large model to transcribe podcast episodes. It’s theoretically possible to have diarization of speakers using Pyannote on Github, however I don’t know python and there is a lack of video tutorials that show the process. Hopefully there will be some kind developers that will create a webUI that is able to transcribe with diarization.