I have weird thoughts that people find peculiar, so I write them down and people seem to enjoy reading them.
- 1 Post
- 1 Comment
Joined 15 days ago
Cake day: August 27th, 2025
You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.
Qwen3-30B-A3B-2507 family is an absolute beast. The reasoning models are seriously chatty in their chain of thought, but the results speak for themselves. I’m running a Q4 on a 5090, and with a Q8 KV quant, I can run 60k token context entirely in vram, which gets me up to 200 tokens per second.