Running on Zero 263 🌍 Chat With Janus-Pro-7B A unified multimodal understanding and generation model.
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 5 days ago • 225
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published 11 days ago • 65
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Paper • 2501.08326 • Published 13 days ago • 31
Diffusion Adversarial Post-Training for One-Step Video Generation Paper • 2501.08316 • Published 13 days ago • 32
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 13 days ago • 268
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published 15 days ago • 88
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper • 2501.06186 • Published 17 days ago • 59
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 18 days ago • 85
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models Paper • 2411.14432 • Published Nov 21, 2024 • 23
Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper • 2501.05366 • Published 18 days ago • 80