8 16 7

ytaewon PRO

hamzzi

AI & ML interests

None yet

Recent Activity

commented on a paper 11 days ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

upvoted a paper 11 days ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

upvoted a paper 11 days ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

View all activity

Organizations

hamzzi's activity

commented a paper 11 days ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Paper • 2504.00891 • Published 18 days ago • 12 •

upvoted 2 papers 11 days ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Paper • 2504.00891 • Published 18 days ago • 12

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published 18 days ago • 242

commented a paper 11 days ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 16 days ago • 52 •

upvoted a paper 11 days ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 16 days ago • 52

upvoted 2 papers 16 days ago

JudgeLRM: Large Reasoning Models as a Judge

Paper • 2504.00050 • Published 19 days ago • 59

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Paper • 2504.00883 • Published 18 days ago • 60

commented a paper 17 days ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published 23 days ago • 39 •

upvoted a paper 17 days ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published 23 days ago • 39

commented 2 papers 17 days ago

Effectively Controlling Reasoning Models through Thinking Intervention

Paper • 2503.24370 • Published 18 days ago • 18 •

OThink-MR1: Stimulating multimodal generalized reasoning capabilities via dynamic reinforcement learning

Paper • 2503.16081 • Published 30 days ago • 26 •

upvoted 3 papers 17 days ago

OThink-MR1: Stimulating multimodal generalized reasoning capabilities via dynamic reinforcement learning

Paper • 2503.16081 • Published 30 days ago • 26

Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation

Paper • 2503.22675 • Published 21 days ago • 34

Effectively Controlling Reasoning Models through Thinking Intervention

Paper • 2503.24370 • Published 18 days ago • 18

commented a paper 17 days ago

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published 21 days ago • 45 •

upvoted a paper 17 days ago

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published 21 days ago • 45

commented a paper 17 days ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 19 days ago • 61 •

upvoted a paper 17 days ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 19 days ago • 61

upvoted 2 papers 19 days ago

AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation

Paper • 2503.19693 • Published 25 days ago • 75

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published 22 days ago • 43