7 158 9

Robin Williams PRO

bfuzzy1

AI & ML interests

None yet

Recent Activity

updated a collection 1 day ago

upvoted a paper 1 day ago

Towards General-Purpose Model-Free Reinforcement Learning

updated a collection 1 day ago

Agents

View all activity

Organizations

None yet

bfuzzy1's activity

upvoted 2 papers 1 day ago

Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published 2 days ago • 18

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published 6 days ago • 14

upvoted a paper 2 days ago

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published 7 days ago • 60

upvoted a paper 3 days ago

Control LLM: Controlled Evolution for Intelligence Retention in LLM

Paper • 2501.10979 • Published 10 days ago • 4

upvoted 3 papers 5 days ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published 8 days ago • 20

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 8 days ago • 73

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 7 days ago • 260

upvoted a paper 7 days ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 9 days ago • 84

upvoted a paper 9 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 13 days ago • 100

upvoted 5 papers 13 days ago

MangaNinja: Line Art Colorization with Precise Reference Following

Paper • 2501.08332 • Published 15 days ago • 55

ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning

Paper • 2501.06590 • Published 18 days ago • 8

upvoted a paper 17 days ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 59

upvoted 3 papers 18 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 22 days ago • 84

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 21 days ago • 90

Entropy-Guided Attention for Private LLMs

Paper • 2501.03489 • Published 23 days ago • 14

upvoted a collection 19 days ago

LLM Reasoning Papers

Collection

Papers to improve reasoning capabilities of LLMs • 20 items • Updated 14 days ago • 106

upvoted a paper 21 days ago

HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation

Paper • 2412.21199 • Published about 1 month ago • 13