view post Post 1104 QwQ can see 🔥Qwen team released QvQ, a large vision LM with reasoning 😱it outperforms proprietary VLMs on several benchmarks, comes with open weights and a demo! Check them out ⬇️Demo Qwen/QVQ-72B-previewModel Qwen/QVQ-72B-PreviewRead more https://qwenlm.github.io/blog/qvq-72b-preview/Congratulations @JustinLin610 and team! See translation 👍 6 6 👀 5 5 🔥 3 3 🚀 1 1 + Reply
Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis Paper • 2412.15322 • Published 6 days ago • 15
Resources for Tagging / Captioning / Prompting / LLM Collection 5271 items • Updated about 4 hours ago • 4
Resources for Tagging / Captioning / Prompting / LLM Collection 5271 items • Updated about 4 hours ago • 4
Resources for Tagging / Captioning / Prompting / LLM Collection 5271 items • Updated about 4 hours ago • 4
LoRAs / Models (SDXL1.0, Pony, SD1.5, Flux, ...) Collection 979 items • Updated about 4 hours ago • 8
Spaces for Text-to-images (SDXL, Pony, SD1.5, Flux,...) Collection 387 items • Updated about 4 hours ago • 22