Combining Flow Matching and Transformers for Efficient Solution of Bayesian Inverse Problems Paper • 2503.01375 • Published 6 days ago • 5
Lost in Literalism: How Supervised Training Shapes Translationese in LLMs Paper • 2503.04369 • Published 3 days ago • 4
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation Paper • 2503.04606 • Published 3 days ago • 7
Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer Paper • 2503.02495 • Published 5 days ago • 7
L^2M: Mutual Information Scaling Law for Long-Context Language Modeling Paper • 2503.04725 • Published 3 days ago • 16
Token-Efficient Long Video Understanding for Multimodal LLMs Paper • 2503.04130 • Published 3 days ago • 60
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Paper • 2503.03983 • Published 4 days ago • 18
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization Paper • 2503.04598 • Published 3 days ago • 16
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion Paper • 2503.04222 • Published 3 days ago • 12
Reliable and Efficient Multi-Agent Coordination via Graph Neural Network Variational Autoencoders Paper • 2503.02954 • Published 5 days ago • 3
HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs Paper • 2503.02003 • Published 6 days ago • 37
Interact, Instruct to Improve: A LLM-Driven Parallel Actor-Reasoner Framework for Enhancing Autonomous Vehicle Interactions Paper • 2503.00502 • Published 8 days ago • 2
CognitiveDrone: A VLA Model and Evaluation Benchmark for Real-Time Cognitive Task Solving and Reasoning in UAVs Paper • 2503.01378 • Published 6 days ago • 3
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding Paper • 2503.02951 • Published 5 days ago • 25