Qwen3.6-35B-A3B released

TheCornCollector@piefed.zip · 1 month ago

Qwen3.6-35B-A3B released

TheCornCollector@piefed.zip · 1 month ago

I’m running it with the UD_Q4_K_XL quant on 24GB VRAM 7900XTX at ~85 token/s. Since it’s an MOE model, CPU inference with 32 GB ram should be doable, but I won’t make any promises on speed.

venusaur@lemmy.world · 1 month ago

Thanks! That sounds expensive. Hopefully 24GB VRAM gets cheaper or models get more efficient soon.

Jakeroxs@sh.itjust.works · 1 month ago

You would want to wait till smaller models for 3.6 are released, I’d assume it’ll be soon

venusaur@lemmy.world · 1 month ago

Thanks! I’m hoping to run at least 20B. Idk if I can do that fast enough without 24GB. Seems to be the sweet spot.

fonix232@fedia.io · 1 month ago

Wonder what the wombo-combo of Ryzen AI APU can do with this.

Time to fire up the trusty 370.

Qwen3.6-35B-A3B released

Qwen3.6-35B-A3B released

Qwen/Qwen3.6-35B-A3B · Hugging Face