17 221 19

Orr Zohar PRO

orrzohar

https://orrzohar.github.io

AI & ML interests

Large Multi-Modal Models, Foundation Models, Video Understanding

Recent Activity

upvoted a paper about 3 hours ago

Redundancy Principles for MLLMs Benchmarks

upvoted a collection 2 days ago

Temporal Preference Optimization

upvoted a paper 3 days ago

Temporal Preference Optimization for Long-Form Video Understanding

View all activity

Organizations

orrzohar's activity

upvoted a paper about 3 hours ago

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published 7 days ago • 19

upvoted a collection 2 days ago

Temporal Preference Optimization

Collection

Temporal Preference Optimization for Long-form Video Understanding • 3 items • Updated 8 days ago • 3

upvoted a paper 3 days ago

Temporal Preference Optimization for Long-Form Video Understanding

Paper • 2501.13919 • Published 4 days ago • 18

upvoted 2 papers 4 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 5 days ago • 216

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published 5 days ago • 69

upvoted 3 papers 6 days ago

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

Paper • 2501.09781 • Published 11 days ago • 22

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 11 days ago • 98

Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong

Paper • 2501.09775 • Published 11 days ago • 26

upvoted 2 papers 11 days ago

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Paper • 2501.09755 • Published 11 days ago • 33

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 13 days ago • 49

upvoted 7 papers 12 days ago

OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training

Paper • 2501.08197 • Published 13 days ago • 7

Potential and Perils of Large Language Models as Judges of Unstructured Textual Data

Paper • 2501.08167 • Published 13 days ago • 6

HALoGEN: Fantastic LLM Hallucinations and Where to Find Them

Paper • 2501.08292 • Published 13 days ago • 16

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 13 days ago • 268

upvoted 2 papers 13 days ago

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published 14 days ago • 49

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 14 days ago • 88

upvoted a paper 14 days ago

ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

Paper • 2501.05452 • Published 18 days ago • 15