9 28 14

Yuhang Zang

yuhangzang

https://yuhangzang.github.io/

AI & ML interests

Open-source, CV, and NLP :-)

Recent Activity

authored a paper 6 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

upvoted a paper 6 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

upvoted a paper 12 days ago

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

View all activity

Organizations

yuhangzang's activity

authored a paper 6 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 6 days ago • 59

upvoted a paper 6 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 6 days ago • 59

upvoted a paper 12 days ago

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published 12 days ago • 69

upvoted a paper 13 days ago

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published 13 days ago • 67

upvoted an article 17 days ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

26 days ago

• 10

authored a paper 17 days ago

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Paper • 2502.13128 • Published 19 days ago • 37

liked a dataset 18 days ago

bethgelab/CuratedThoughts

Viewer • Updated 11 days ago • 222k • 1.27k • 37

upvoted a paper 18 days ago

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Paper • 2502.13128 • Published 19 days ago • 37

commented a paper 21 days ago

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published 23 days ago • 31 •

authored a paper 25 days ago

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Paper • 2502.08590 • Published 25 days ago • 40

upvoted a paper 25 days ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published 27 days ago • 60

liked a model 25 days ago

internlm/OREAL-7B

Text Generation • Updated 14 days ago • 570 • 19

upvoted a paper 25 days ago

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Paper • 2502.08590 • Published 25 days ago • 40

authored 3 papers 28 days ago

upvoted a paper 28 days ago

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Paper • 2502.05173 • Published about 1 month ago • 64

authored a paper about 2 months ago

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published Jan 21 • 42

upvoted a paper about 2 months ago

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published Jan 21 • 42

liked a model about 2 months ago

internlm/internlm-xcomposer2d5-7b-chat

Visual Question Answering • Updated Jan 23 • 214 • 5