🚀 Meet Qwen-Image — a 20B MMDiT model for next-gen text-to-image generation. Especially strong at creating stunning graphic posters with native text. Now open-source.
🔍 Key Highlights:
🔹 SOTA text rendering — rivals GPT-4o in English, best-in-class for Chinese
🔹 In-pixel text generation — no overlays, fully integrated
🔹 Bilingual support, diverse fonts, complex layouts
🎨 Also excels at general image generation — from photorealistic to anime, impressionist to minimalist. A true creative powerhouse.
Blog: https://qwenlm.github.io/blog/qwen-image/
Hugging Face: https://huggingface.co/Qwen/Qwen-Image
Model Scope: https://modelscope.cn/models/Qwen/Qwen-Image/summary
GitHub: https://github.com/QwenLM/Qwen-Image
Technical Report: https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-Image/Qwen_Image.pdf
WaveSpeed Demo: https://wavespeed.ai/models/wavespeed-ai/qwen-image/text-to-image
Demo: https://modelscope.cn/aigc/imageGeneration?tab=advanced
ah awesome, let me check! i used qwen for ages before flicking over to GLM and I’ve not been impacted but it’s not like i ask about chinese government things very often
i finally got to the workstation. after instaling ComfyUI you need to add the ComfyUI-GGUF Node https://github.com/city96/ComfyUI-GGUF if you’re using Apple Silicon - i didn’t manage to get it to work otherwise because of the data type conversion.
Finally this is the Workflow I use for image generation: https://voidbin.com/paste/6f15026e-d18d-4542-97aa-2a93acc97af6 just save it as a json.