InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning Paper • 2502.11573 • Published 21 days ago • 9
Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage Paper • 2412.15606 • Published Dec 20, 2024 • 2
Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage Paper • 2412.15606 • Published Dec 20, 2024 • 2
Self-Consistency Improves Chain of Thought Reasoning in Language Models Paper • 2203.11171 • Published Mar 21, 2022 • 4
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale Paper • 2407.05282 • Published Jul 7, 2024 • 15
FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models Paper • 2407.11522 • Published Jul 16, 2024 • 9
FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models Paper • 2407.11522 • Published Jul 16, 2024 • 9