mltrials's picture

11 2

mltrials

mltrials

·

AI & ML interests

None yet

Recent Activity

liked a model 10 days ago

deepseek-ai/DeepSeek-R1

upvoted a paper 21 days ago

Tensor Product Attention Is All You Need

upvoted a paper 21 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

View all activity

Organizations

None yet

mltrials's activity

liked a model 10 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 5 days ago • 1.54M • • 7.25k

upvoted 3 papers 21 days ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 27 days ago • 80

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 24 days ago • 89

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 23 days ago • 272

upvoted 5 articles 29 days ago

Article

🌁#81: Key AI Concepts to Follow in 2025

By

•

Dec 23, 2024

• 24

Article

Fine-tune ModernBERT for text classification using synthetic data

By

•

Dec 30, 2024

• 31

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

By

•

Jan 2

• 39

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

By

•

Jan 3

• 32

Article

Accelerating Language Model Inference with Mixture of Attentions

By

and 1 other •

about 1 month ago

• 24

liked a Space 29 days ago

2024 AI Timeline

View and filter AI model releases in 2024