Ziwei Liu's picture

Ziwei Liu

liuziwei7

·

https://liuziwei7.github.io/

AI & ML interests

None yet

Recent Activity

liked a dataset 2 days ago

lmms-lab/VideoMMMU

upvoted a paper 3 days ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

authored a paper 3 days ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

View all activity

Organizations

liuziwei7's activity

upvoted a paper 3 days ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published 4 days ago • 19

upvoted 2 papers 12 days ago

CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities

Paper • 2501.08983 • Published 12 days ago • 19

RepVideo: Rethinking Cross-Layer Representation for Video Generation

Paper • 2501.08994 • Published 12 days ago • 15

upvoted a paper 20 days ago

Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Paper • 2501.03847 • Published 20 days ago • 23

upvoted 2 papers about 1 month ago

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Paper • 2412.09645 • Published Dec 10, 2024 • 35

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

Paper • 2412.09626 • Published Dec 12, 2024 • 20

upvoted 2 papers about 2 months ago

Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion

Paper • 2412.09593 • Published Dec 12, 2024 • 18

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters

Paper • 2412.00174 • Published Nov 29, 2024 • 23

upvoted 2 papers 2 months ago

Material Anything: Generating Materials for Any 3D Object via Diffusion

Paper • 2411.15138 • Published Nov 22, 2024 • 42

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 16

upvoted 2 collections 2 months ago

Multimodal-SAE

The collection of the sae that hooked on llava • 4 items • Updated 22 days ago • 5

Insight-V

Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models • 5 items • Updated Nov 22, 2024 • 9

upvoted 2 papers 2 months ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published Nov 21, 2024 • 23

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published Nov 20, 2024 • 30

upvoted 3 papers 3 months ago

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D

Paper • 2411.02336 • Published Nov 4, 2024 • 23

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Paper • 2410.19355 • Published Oct 25, 2024 • 23

Disco4D: Disentangled 4D Human Generation and Animation from a Single Image

Paper • 2409.17280 • Published Sep 25, 2024 • 10

upvoted 2 collections 4 months ago

LMMs-Eval-Lite

Making Lite version of the dataset to accelerate holistic evaluation during model development! • 20 items • Updated Oct 4, 2024 • 2

LLaVA-OneVision

a model good at arbitrary types of visual input • 15 items • Updated Oct 5, 2024 • 22

upvoted a paper 4 months ago

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3, 2024 • 38