RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published Nov 19, 2024 • 53
HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs Paper • 2503.02003 • Published 6 days ago • 37
When an LLM is apprehensive about its answers -- and when its uncertainty is justified Paper • 2503.01688 • Published 6 days ago • 19
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens Paper • 2502.18890 • Published 12 days ago • 23
ProX Dataset Collection a collection of pre-training corpora refined by ProX • 6 items • Updated 23 days ago • 7
How to Get Your LLM to Generate Challenging Problems for Evaluation Paper • 2502.14678 • Published 17 days ago • 16
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper • 2502.14739 • Published 17 days ago • 94
Diverse Inference and Verification for Advanced Reasoning Paper • 2502.09955 • Published 24 days ago • 17
DarwinLM: Evolutionary Structured Pruning of Large Language Models Paper • 2502.07780 • Published 26 days ago • 17
Expect the Unexpected: FailSafe Long Context QA for Finance Paper • 2502.06329 • Published 27 days ago • 126
An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging Paper • 2502.09056 • Published 25 days ago • 30