

Dunno how I missed this, but I referenced this post as a reply to another and am just now seeing it.
The box I ssh into is headless, and AFAIK, use of /media/{user}/{mountpoint} is just a desktop environment convention. When I plug in any kind of removable media to this box, I manually mount it under /mnt. I mount my NAS’s media share to /media mostly for convenience since that’s the main purpose of this box in my workshop.









https://github.com/marytts/marytts
I’ve used MaryTTS semi-recently. It’s older but works well enough for my cases. I have it running on a server (locally) and my endpoints make a call to it and playback the returned audio file.
On Android, I use SherpaTTS which has good voices, but I’m not aware of a desktop/Linux option. It mentions using voices from Coqui which you linked, so I would guess that would be the way to go for desktop.