最新消息:雨落星辰是一个专注网站SEO优化、网站SEO诊断、搜索引擎研究、网络营销推广、网站策划运营及站长类的自媒体原创博客

python - Transcribe audio with Whisper and Google Collab raw verbatim with fillers - Stack Overflow

programmeradmin2浏览0评论

To transcribe an audio file with Whisper I upload the file into my Google drive run the command:

!pip install git+.git
!sudo apt update && sudo apt install ffmpeg

I mount the Google drive using:

from google.colab import drive
drive.mount('/content/gdrive')

To extract the text from the audio file I used the command:

!whisper "ENTER FILE NAME HERE" --model medium --language en 

But it renders a clean verbatim transcript.
I have particular client that wants the fillers to be transcribed.
What command do I use if I want to transcribe fillers, false starts and uhms.

I tried using --input_prompt but Python does not seem to recognize it if I add it in my command.
I can live with the fact that it might loose punctuation.

Could I use CrisperWhisper with Google Colab and if so what is the code to download CrisperWhisper and the command to transcribe?

Many thanks in advance

I tried experimenting with --input_prompt.
I searched for tutorials on ChrisperWhisper but was only able to find one where you need to install the software directly onto your PC but my graphics card isn't so great and prefer to use Google Colab with Python.

发布评论

评论列表(0)

  1. 暂无评论