Barry Li

Brilliant-B

Brilliant-B

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

LINGOLY-TOO: Disentangling Memorisation from Reasoning with Linguistic Templatisation and Orthographic Obfuscation

upvoted a paper 4 days ago

UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface

upvoted a paper 6 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

View all activity

Organizations

None yet

Brilliant-B's activity

upvoted a paper 2 days ago

LINGOLY-TOO: Disentangling Memorisation from Reasoning with Linguistic Templatisation and Orthographic Obfuscation

Paper • 2503.02972 • Published 5 days ago • 23

upvoted a paper 4 days ago

UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface

Paper • 2503.01342 • Published 7 days ago • 7

upvoted a paper 6 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 6 days ago • 59

upvoted 2 papers 17 days ago

AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence

Paper • 2502.13943 • Published 18 days ago • 7

Thinking Preference Optimization

Paper • 2502.13173 • Published 20 days ago • 17

upvoted a paper 18 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 18 days ago • 157

liked a model 18 days ago

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • Updated Feb 6 • 1.42M • • 1.15k

liked a dataset 18 days ago

Xiaodong/open-r1-video-4k

Viewer • Updated 20 days ago • 4.66k • 197 • 4

upvoted a paper 22 days ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 84

upvoted 2 papers 24 days ago

Next Block Prediction: Video Generation via Semi-Autoregressive Modeling

Paper • 2502.07737 • Published 26 days ago • 9

Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving

Paper • 2502.07640 • Published 26 days ago • 8

upvoted a paper 25 days ago

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published Jan 23 • 37

upvoted a paper about 1 month ago

Temporal Preference Optimization for Long-Form Video Understanding

Paper • 2501.13919 • Published Jan 23 • 22

upvoted 4 papers about 2 months ago

Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding

Paper • 2501.07888 • Published Jan 14 • 15

upvoted 3 papers 2 months ago

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Paper • 2412.21059 • Published Dec 30, 2024 • 18

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 55

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published Dec 24, 2024 • 75