RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published Nov 19, 2024 • 53
HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs Paper • 2503.02003 • Published 6 days ago • 37
When an LLM is apprehensive about its answers -- and when its uncertainty is justified Paper • 2503.01688 • Published 6 days ago • 19
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens Paper • 2502.18890 • Published 12 days ago • 23
ProX Dataset Collection a collection of pre-training corpora refined by ProX • 6 items • Updated 23 days ago • 7
How to Get Your LLM to Generate Challenging Problems for Evaluation Paper • 2502.14678 • Published 17 days ago • 16
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper • 2502.14739 • Published 17 days ago • 94