7 86 210

Chmielewski

Eryk-Chmielewski

AI & ML interests

None yet

Recent Activity

liked a model about 13 hours ago

EvaByte/EvaByte

upvoted an article 1 day ago

Open-R1: a fully open reproduction of DeepSeek-R1

liked a model 1 day ago

bytedance-research/UI-TARS-72B-DPO

View all activity

Organizations

Eryk-Chmielewski's activity

liked a model about 13 hours ago

EvaByte/EvaByte

Updated 8 days ago • 196 • 23

upvoted an article 1 day ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

2 days ago

• 371

liked a model 1 day ago

bytedance-research/UI-TARS-72B-DPO

Image-Text-to-Text • Updated 4 days ago • 5.4k • 73

liked 3 models 2 days ago

upvoted 14 papers 3 days ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 19 days ago • 78

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published 10 days ago • 30

SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution

Paper • 2501.05040 • Published 20 days ago • 15

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published 20 days ago • 87

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published 21 days ago • 53

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 13 days ago • 100

Humanity's Last Exam

Paper • 2501.14249 • Published 6 days ago • 44

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published 6 days ago • 40

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published 7 days ago • 60

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published 8 days ago • 20

Autonomy-of-Experts Models

Paper • 2501.13074 • Published 7 days ago • 38

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 8 days ago • 73

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 7 days ago • 260

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 9 days ago • 84