

EDIT: seems like Whisper might be your best bet, it does speech-to-text + translation via a CLI that can be installed with pip: https://github.com/openai/whisper
Huggingface has loads of models for local speech to text and translation, see https://huggingface.co/docs/transformers/model_doc/speech_to_text
There is probably a nicer UI to use them than python coding, but I don’t know one.
AFAIK, this is the Google translate model that you can deploy locally https://huggingface.co/google/mt5-large