RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published Nov 19 • 47
Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees Paper • 2110.03313 • Published Oct 7, 2021 • 1
SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices Paper • 2406.02532 • Published Jun 4 • 13
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 27
RuCoLA: Russian Corpus of Linguistic Acceptability Paper • 2210.12814 • Published Oct 23, 2022 • 1
Petals: Collaborative Inference and Fine-tuning of Large Models Paper • 2209.01188 • Published Sep 2, 2022 • 2
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding Paper • 2402.12374 • Published Feb 19 • 3
The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models Paper • 2404.05904 • Published Apr 8 • 8
Mind Your Format: Towards Consistent Evaluation of In-Context Learning Improvements Paper • 2401.06766 • Published Jan 12 • 2
Distributed Inference and Fine-tuning of Large Language Models Over The Internet Paper • 2312.08361 • Published Dec 13, 2023 • 25
Hypernymy Understanding Evaluation of Text-to-Image Models via WordNet Hierarchy Paper • 2310.09247 • Published Oct 13, 2023 • 3
Hypernymy Understanding Evaluation of Text-to-Image Models via WordNet Hierarchy Paper • 2310.09247 • Published Oct 13, 2023 • 3
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU Paper • 2303.06865 • Published Mar 13, 2023 • 1
SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient Paper • 2301.11913 • Published Jan 27, 2023 • 1
Distributed Deep Learning in Open Collaborations Paper • 2106.10207 • Published Jun 18, 2021 • 2
It's All in the Heads: Using Attention Heads as a Baseline for Cross-Lingual Transfer in Commonsense Reasoning Paper • 2106.12066 • Published Jun 22, 2021 • 1