LLM For Smartphone Collection These are some of the best llm that can run on a smartphone. These models go toe-to-toe with much larger models, and are great for use on the go. • 12 items • Updated Oct 8, 2024 • 11
Model Stock: All we need is just a few fine-tuned models Paper • 2403.19522 • Published Mar 28, 2024 • 11
ColPali: Efficient Document Retrieval with Vision Language Models Paper • 2407.01449 • Published Jun 27, 2024 • 45
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT Paper • 2004.12832 • Published Apr 27, 2020 • 3
LoRA: Low-Rank Adaptation of Large Language Models Paper • 2106.09685 • Published Jun 17, 2021 • 35
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering? Paper • 2502.12115 • Published 20 days ago • 42
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 105
RLHF Workflow: From Reward Modeling to Online RLHF Paper • 2405.07863 • Published May 13, 2024 • 68
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published 21 days ago • 141
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 610
Post-Training Statistical Calibration for Higher Activation Sparsity Paper • 2412.07174 • Published Dec 10, 2024 • 1
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA May 24, 2023 • 124
view article Article Smaller is better: Q8-Chat, an efficient generative AI experience on Xeon May 16, 2023 • 2
view article Article Overview of natively supported quantization schemes in 🤗 Transformers Sep 12, 2023 • 12