Pavel Iakubovskii's picture

Pavel Iakubovskii

qubvel-hf

·

AI & ML interests

Computer Vision models

Recent Activity

liked a model 5 days ago

xingyang1/Distill-Any-Depth

upvoted a paper 6 days ago

Unified Video Action Model

upvoted an article 6 days ago

SmolVLM2: Bringing Video Understanding to Every Device

View all activity

Organizations

qubvel-hf's activity

upvoted a paper 6 days ago

Unified Video Action Model

Paper • 2503.00200 • Published 11 days ago • 12

upvoted an article 6 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

20 days ago

• 202

upvoted an article 10 days ago

Article

Common AI Model Formats

By

•

12 days ago

• 27

upvoted a paper 11 days ago

MegaLoc: One Retrieval to Place Them All

Paper • 2502.17237 • Published 15 days ago • 1

upvoted an article 14 days ago

Article

FastRTC: The Real-Time Communication Library for Python

15 days ago

• 135

upvoted an article 19 days ago

Article

SigLIP 2: A better multilingual vision language encoder

19 days ago

• 130

upvoted a paper 19 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 19 days ago • 128

upvoted a collection 19 days ago

SigLIP2

36 items • Updated 19 days ago • 61

upvoted an article 26 days ago

Article

1 Billion Classifications

27 days ago

• 42

upvoted a paper 27 days ago

Scaling Pre-training to One Hundred Billion Data for Vision Language Models

Paper • 2502.07617 • Published 28 days ago • 29

upvoted an article 27 days ago

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

28 days ago

• 49

upvoted an article 29 days ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

By

and 1 other •

29 days ago

• 26

upvoted a collection about 1 month ago

DepthPro Models

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second • 4 items • Updated Feb 7 • 7

upvoted 2 articles about 2 months ago

Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

By

•

Jan 19

• 14

Article

Timm ❤️ Transformers: Use any timm model with transformers

Jan 16

• 44

upvoted 2 collections 2 months ago

ViTPose

Collection for ViTPose models based on transformers implementation. • 10 items • Updated Jan 12 • 13

Segformer

Transformer-based semantic segmentation model by Nvidia • 15 items • Updated Jan 13 • 4

upvoted a paper 3 months ago

TRecViT: A Recurrent Video Transformer

Paper • 2412.14294 • Published Dec 18, 2024 • 13

upvoted a collection 3 months ago

timm tiny test models

A collection of very small (~300-500k parameter) models at 160x160 resolution, for testing purposes. Trained on ImageNet-1k. • 13 items • Updated Oct 2, 2024 • 5

upvoted an article 3 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By

•

Jul 5, 2024

• 214