python - Transcribe audio with Whisper and Google Collab raw verbatim with fillers

To transcribe an audio file with Whisper I upload the file into my Google drive run the command:

!pip install git+.git
!sudo apt update && sudo apt install ffmpeg

I mount the Google drive using:

from google.colab import drive
drive.mount('/content/gdrive')

To extract the text from the audio file I used the command:

!whisper "ENTER FILE NAME HERE" --model medium --language en

But it renders a clean verbatim transcript.
I have particular client that wants the fillers to be transcribed.
What command do I use if I want to transcribe fillers, false starts and uhms.

I tried using --input_prompt but Python does not seem to recognize it if I add it in my command.
I can live with the fact that it might loose punctuation.

Could I use CrisperWhisper with Google Colab and if so what is the code to download CrisperWhisper and the command to transcribe?

Many thanks in advance

I tried experimenting with --input_prompt.
I searched for tutorials on ChrisperWhisper but was only able to find one where you need to install the software directly onto your PC but my graphics card isn't so great and prefer to use Google Colab with Python.

科技改变生活-雨落星辰 - 所有的伟大,都源于一个勇敢的开始

python - Transcribe audio with Whisper and Google Collab raw verbatim with fillers - Stack Overflow

与本文相关的文章

评论列表(0)