Victor Mustar's picture

Victor Mustar PRO

victor

·

victormustar

AI & ML interests

Building the UX of this website

Recent Activity

liked a model about 1 hour ago

deepseek-ai/DeepSeek-V3-Base

liked a Space about 2 hours ago

Qwen/QVQ-72B-preview

liked a model 1 day ago

answerdotai/ModernBERT-base

View all activity

Articles

Inference for PROs

Organizations

victor's activity

upvoted a paper 1 day ago

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published 6 days ago • 44

upvoted a collection 3 days ago

Vision Language Models

Grounding, chat • 4 items • Updated 6 days ago • 10

upvoted a paper 4 days ago

AniDoc: Animation Creation Made Easier

Paper • 2412.14173 • Published 7 days ago • 48

upvoted 2 papers 5 days ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7 • 11

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 6 days ago • 327

upvoted 3 papers 8 days ago

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published 13 days ago • 84

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Paper • 2412.11919 • Published 9 days ago • 33

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 12 days ago • 131

upvoted 4 papers 12 days ago

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published 16 days ago • 71

STIV: Scalable Text and Image Conditioned Video Generation

Paper • 2412.07730 • Published 15 days ago • 69

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Paper • 2412.07760 • Published 15 days ago • 49

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 13 days ago • 90

upvoted a collection 12 days ago

DeepSeek-VL2

4 items • Updated 7 days ago • 26

upvoted an article 13 days ago

Article

Building an AI-powered search engine from scratch

By

•

14 days ago

• 8

upvoted a collection 19 days ago

InternVL2.5

Better than InternVL 2.0 • 18 items • Updated 4 days ago • 77

upvoted a collection 20 days ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 12 days ago • 119

upvoted an article 20 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

21 days ago

• 70

upvoted a collection 22 days ago

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 17 items • Updated 5 days ago • 58

upvoted a collection 26 days ago

GLM-Edge

10 items • Updated 27 days ago • 8

upvoted an article 27 days ago

Article

Use Models from the Hugging Face Hub in LM Studio

By

•

27 days ago

• 127