martin PRO

martintomov

AI & ML interests

None yet

Recent Activity

reacted to merve's post with ๐Ÿš€ 8 days ago
Apollo is a new family of open-source video language models by Meta, where 3B model outperforms most 7B models and 7B outperforms most 30B models ๐Ÿงถ โœจ the models come in 1.5B https://huggingface.co./Apollo-LMMs/Apollo-1_5B-t32, 3B https://huggingface.co./Apollo-LMMs/Apollo-3B-t32 and 7B https://huggingface.co./Apollo-LMMs/Apollo-7B-t32 with A2.0 license, based on Qwen1.5 & Qwen2 โœจ the authors also release a benchmark dataset https://huggingface.co./spaces/Apollo-LMMs/ApolloBench The paper has a lot of experiments (they trained 84 models!) about what makes the video LMs work โฏ๏ธ Try the demo for best setup here https://huggingface.co./spaces/Apollo-LMMs/Apollo-3B they evaluate sampling strategies, scaling laws for models and datasets, video representation and more! > The authors find out that whatever design decision was applied to small models also scale properly when the model and dataset are scaled ๐Ÿ“ˆ scaling dataset has diminishing returns for smaller models > They evaluate frame sampling strategies, and find that FPS sampling is better than uniform sampling, and they find 8-32 tokens per frame optimal > They also compare image encoders, they try a variation of models from shape optimized SigLIP to DINOv2 they find https://huggingface.co./google/siglip-so400m-patch14-384 to be most powerful ๐Ÿ”ฅ > they also compare freezing different parts of models, training all stages with some frozen parts give the best yield They eventually release three models, where Apollo-3B outperforms most 7B models and Apollo 7B outperforms 30B models ๐Ÿ”ฅ
liked a model 8 days ago
FastVideo/FastMochi
View all activity

Organizations

MLX Community's profile picture RB-IBDM's profile picture Cape's profile picture synthgen's profile picture aiml4fun's profile picture AI Starter Pack's profile picture

martintomov's activity

New activity in martintomov/rayban-meta-glasses-v2 28 days ago

Dataset caption

3
#2 opened 3 months ago by
umairahmad1789
New activity in black-forest-labs/FLUX.1-Redux-dev about 1 month ago
New activity in aiml4fun/simkata7-lora about 1 month ago

Add generated example

#2 opened about 1 month ago by
martintomov

Add generated example

#1 opened about 1 month ago by
martintomov
New activity in martintomov/ecom-flux-v2 2 months ago

Add generated example

#6 opened 2 months ago by
martintomov
New activity in martintomov/ecom-flux-v1 2 months ago

Add generated example

#6 opened 2 months ago by
martintomov
New activity in martintomov/ecom-flux-v2 2 months ago

Add generated example

#5 opened 2 months ago by
martintomov
New activity in martintomov/ecom-flux-v1 2 months ago

Add generated example

#5 opened 2 months ago by
martintomov

Add generated example

#4 opened 2 months ago by
martintomov
New activity in martintomov/ecom-flux-v2 2 months ago

Add generated example

#4 opened 2 months ago by
martintomov

Add generated example

#3 opened 2 months ago by
martintomov
New activity in martintomov/ecom-flux-v1 2 months ago

Add generated example

#3 opened 2 months ago by
martintomov
New activity in martintomov/ecom-flux-v2 2 months ago

Add generated example

#2 opened 2 months ago by
martintomov

Add generated example

#1 opened 2 months ago by
martintomov
New activity in martintomov/ecom-flux-v1 2 months ago

Add generated example

#2 opened 2 months ago by
martintomov

Add generated example

#1 opened 2 months ago by
martintomov
New activity in martintomov/mrr-synthetic-data-v1.5 2 months ago

Add generated example

#6 opened 2 months ago by
martintomov

Add generated example

#5 opened 2 months ago by
martintomov

Add generated example

#4 opened 2 months ago by
martintomov