Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2502.08606

Synthetic Data and Self-Improvement

Training Software Engineering Agents and Verifiers with SWE-Gym

Paper • 2412.21139 • Published Dec 30, 2024 • 22
Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 48
Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 147
Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6, 2024 • 115

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published 18 days ago • 66
Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published 20 days ago • 28
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading

Paper • 2502.12574 • Published 20 days ago • 11
Large Language Diffusion Models

Paper • 2502.09992 • Published 24 days ago • 99

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published 17 days ago • 44
S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Paper • 2502.12853 • Published 20 days ago • 28
Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published 24 days ago • 17
Distillation Scaling Laws

Paper • 2502.08606 • Published 25 days ago • 46

Large Language Diffusion Models

Paper • 2502.09992 • Published 24 days ago • 99
Distillation Scaling Laws

Paper • 2502.08606 • Published 25 days ago • 46
Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22 • 41
Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published Jan 20 • 28

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106
PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published Jan 17 • 43
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong

Paper • 2501.09775 • Published Jan 16 • 29
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Paper • 2501.10132 • Published Jan 17 • 19

Distillation Scaling Laws

Paper • 2502.08606 • Published 25 days ago • 46

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 24 days ago • 32
Distillation Scaling Laws

Paper • 2502.08606 • Published 25 days ago • 46
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 25 days ago • 143

Distillation Scaling Laws

Paper • 2502.08606 • Published 25 days ago • 46

paper maybe useful

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Paper • 2502.08590 • Published 25 days ago • 40
Distillation Scaling Laws

Paper • 2502.08606 • Published 25 days ago • 46
Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published 20 days ago • 76

interesting papers

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published about 1 month ago • 122
Agency Is Frame-Dependent

Paper • 2502.04403 • Published Feb 6 • 22
Distillation Scaling Laws

Paper • 2502.08606 • Published 25 days ago • 46
LLM Pretraining with Continuous Concepts

Paper • 2502.08524 • Published 25 days ago • 28

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs