To transcribe an audio file with Whisper I upload the file into my Google drive run the command:
!pip install git+.git
!sudo apt update && sudo apt install ffmpeg
I mount the Google drive using:
from google.colab import drive
drive.mount('/content/gdrive')
To extract the text from the audio file I used the command:
!whisper "ENTER FILE NAME HERE" --model medium --language en
But it renders a clean verbatim transcript.
I have particular client that wants the fillers to be transcribed.
What command do I use if I want to transcribe fillers, false starts and uhms.
I tried using --input_prompt
but Python does not seem to recognize it if I add it in my command.
I can live with the fact that it might loose punctuation.
Could I use CrisperWhisper with Google Colab and if so what is the code to download CrisperWhisper and the command to transcribe?
Many thanks in advance
I tried experimenting with --input_prompt
.
I searched for tutorials on ChrisperWhisper but was only able to find one where you need to install the software directly onto your PC but my graphics card isn't so great and prefer to use Google Colab with Python.