What AI services are you selfhosting? Or, have tested and passed on

kiol@lemmy.world · 10 months ago

What AI services are you selfhosting? Or, have tested and passed on

y0shi@lemm.ee · 10 months ago

I’ve an old gaming PC with a decent GPU laying around and I’ve thought of doing that (currently use it for linux gaming and GPU related tasks like photo editing etc) However ,I’m currently stuck using LLMs on demand locally with ollama. Energy costs of having it powered on all time for on demand queries seems a bit overkill to me…

kiol@lemmy.world · 10 months ago

Have to agree on that. Certainly only makes sense to have up when you are using it.

pezhore@infosec.pub · 10 months ago

I put my Plex media server to work doing Ollama - it has a GPU for transcoding that’s not awful for simple LLMs.

y0shi@lemm.ee · 10 months ago

That sounds like a great way of leveraging existing infrastructure! I host Plex together with other services in a server with intel transcoding capable CPU. I’m quite sure I would get much better performance with the GPU machine, might end up following this path!

What AI services are you selfhosting? Or, have tested and passed on

What AI services are you selfhosting? Or, have tested and passed on

Testing Indiedroid Nova w/ 16gb ram - Learning Together