Beyond Release: Access Considerations for Generative AI Systems Paper • 2502.16701 • Published 14 days ago • 12
L1 Collection L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning • 2 items • Updated 2 days ago • 2
IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages Paper • 2403.06350 • Published Mar 11, 2024 • 1
view article Article Hugging Face and JFrog partner to make AI Security more transparent 6 days ago • 18
Remote VAE Inference Endpoints Collection Models and handler code used in https://huggingface.co./blog/remote_vae • 4 items • Updated 12 days ago • 4
BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation Paper • 2305.19144 • Published May 30, 2023 • 1
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models Paper • 2502.16614 • Published 14 days ago • 24
Ukrainian Text-to-Speech datasets Collection Five voices: Mykyta, Oleksa, Lada, Kateryna or Tetiana • 6 items • Updated 11 days ago • 4
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 11 days ago • 206
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study Paper • 2502.02481 • Published Feb 4 • 10
GemmaX2 Collection GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. • 7 items • Updated about 1 month ago • 20
MassiveDS Collection Data, embedding, and index of MassiveDS by "Scaling Retrieval-Based Language Models with a Trillion-Token Datastore" • 5 items • Updated 4 days ago • 2
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore Paper • 2407.12854 • Published Jul 9, 2024 • 31