• 11 Posts
  • 57 Comments
Joined 3 years ago
cake
Cake day: July 9th, 2023

help-circle
  • I use the HA Voice Preview in two different rooms and got rid of my Alexa Dots. I’ve been trying both speech-to-phrase and whisper with medium.en running on the GPU for STT, tried llama3.2 and granite4 for the LLM with local command handling

    I’ve been trying to get it working better, but it’s been a struggle. The wake word responds to me, but not my girlfriend’s voice. I try setting timers, and it says done, but never triggers the timer.

    I’d love to improve operating performance of my assistant, but want to know what options work well for others. I’ve been experimenting with an intermediary STT proxy to send it to both whisper and speech-to-phrase to see which one has more confidence.



  • I’d love for my HA Voice Preview to be sufficient to replace my Alexa/Google devices. I even unplugged my Alexa devices. However, it’s been rough going for me. It never responds to my girlfriend speaking the wake word and doesn’t set timers. There’s a number of knobs that define how well it works including the physical hardware (there’s obviously the Voice Preview, but also some community made versions with better mics,) wake word model, conservation LLM model and the speech to text model (whisper vs speech to phrase). If it works well for you, can you share your configuration you’re using?






  • Interesting. I played around with X11 vs Wayland settings just to see what different configurations give me

    • MOZ_ENABLE_WAYLAND=1 /snap/bin/firefox - Exhibits low FPS issue
    • MOZ_ENABLE_WAYLAND=0 DISABLE_WAYLAND=1 /snap/bin/firefox - Actually feels fast like it should be. Most animations feel faster, some are still choppy though. It’s hard to tell.

    It seems like running with X11 sort of the problem? Which seems unexpected and concerns me since I know distros are starting to default to Wayland.