The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper β’ 2501.07301 β’ Published 24 days ago β’ 89
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper β’ 2501.08313 β’ Published 23 days ago β’ 272
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 β’ Dec 30, 2024 β’ 31
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ Jan 2 β’ 39
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 β’ Jan 3 β’ 32
view article Article Accelerating Language Model Inference with Mixture of Attentions By hba123 and 1 other β’ about 1 month ago β’ 24