7 86 210

Chmielewski

Eryk-Chmielewski

AI & ML interests

None yet

Recent Activity

liked a model about 14 hours ago

EvaByte/EvaByte

upvoted an article 1 day ago

Open-R1: a fully open reproduction of DeepSeek-R1

liked a model 1 day ago

bytedance-research/UI-TARS-72B-DPO

View all activity

Organizations

Eryk-Chmielewski's activity

upvoted an article 1 day ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

2 days ago

• 376

upvoted 15 papers 3 days ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 19 days ago • 78

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published 10 days ago • 30

SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution

Paper • 2501.05040 • Published 21 days ago • 15

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published 6 days ago • 40

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published 7 days ago • 60

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published 8 days ago • 20

Autonomy-of-Experts Models

Paper • 2501.13074 • Published 7 days ago • 38

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 8 days ago • 73

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 7 days ago • 261

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 9 days ago • 84

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published 8 days ago • 79

upvoted a collection 7 days ago

DeepSeek R1 AWQ

Collection

7 items • Updated 7 days ago • 4

upvoted 2 collections 9 days ago

Unsloth 4-bit Dynamic Quants

Collection

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 13 items • Updated 5 days ago • 25

DeepSeek R1 (All Versions)

Collection

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 27 items • Updated 3 days ago • 101

upvoted a paper 17 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 20 days ago • 83