These commands work for Ubuntu. Should be simple to change for other Linux distros.
Install Nvidia and CUDA drivers
sudo apt install nvidia-driver-530 nvidia-cuda-toolkit
Reboot so the system uses the driver.
Install pip and ffmpeg
sudo apt install python3-pip
sudo apt install ffmpeg
Now we can install whisper with
pip install -U openai-whisper
Run Whisper
After it is installed, it should be able to run it like
whisper audio.mp3 --model medium
Change out medium to the model you would like to use. It will then download the model and then work get to work on transcribing it. The .en models i.e. medium.en, seem to perform better then the other ones. If you are using English that is.
If you receive a “Command ‘whisper’ not found” error, you may not have ~/.local/bin in your user PATH. Either add ~/.local/bin to your PATH, or run whisper with the full path
~/.local/bin/whisper audio.mp3 --model medium
OpenAI Whisper GitHub link.
https://github.com/openai/whisper