Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
5
Sairam Pillai
sairampillai
Follow
https://linkedin.com/in/sairampillai
SairamPillai95
sairampillai
AI & ML interests
NLP, MLOps
Recent Activity
replied
to
merve
's
post
8 days ago
Apollo is a new family of open-source video language models by Meta, where 3B model outperforms most 7B models and 7B outperforms most 30B models ๐งถ โจ the models come in 1.5B https://huggingface.co./Apollo-LMMs/Apollo-1_5B-t32, 3B https://huggingface.co./Apollo-LMMs/Apollo-3B-t32 and 7B https://huggingface.co./Apollo-LMMs/Apollo-7B-t32 with A2.0 license, based on Qwen1.5 & Qwen2 โจ the authors also release a benchmark dataset https://huggingface.co./spaces/Apollo-LMMs/ApolloBench The paper has a lot of experiments (they trained 84 models!) about what makes the video LMs work โฏ๏ธ Try the demo for best setup here https://huggingface.co./spaces/Apollo-LMMs/Apollo-3B they evaluate sampling strategies, scaling laws for models and datasets, video representation and more! > The authors find out that whatever design decision was applied to small models also scale properly when the model and dataset are scaled ๐ scaling dataset has diminishing returns for smaller models > They evaluate frame sampling strategies, and find that FPS sampling is better than uniform sampling, and they find 8-32 tokens per frame optimal > They also compare image encoders, they try a variation of models from shape optimized SigLIP to DINOv2 they find https://huggingface.co./google/siglip-so400m-patch14-384 to be most powerful ๐ฅ > they also compare freezing different parts of models, training all stages with some frozen parts give the best yield They eventually release three models, where Apollo-3B outperforms most 7B models and Apollo 7B outperforms 30B models ๐ฅ
View all activity
Organizations
None yet
sairampillai
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a Space
3 months ago
Running
17
๐
Model Explorer
liked
a model
8 months ago
lmstudio-community/Meta-Llama-3-8B-Instruct-GGUF
Text Generation
โข
Updated
May 3
โข
10k
โข
174
liked
2 Spaces
8 months ago
Running
on
CPU Upgrade
126
๐ฅ
Hallucinations Leaderboard
Running
37
๐ป
Powered By Intel Leaderboard
liked
a model
8 months ago
meta-llama/Meta-Llama-3-8B-Instruct
Text Generation
โข
Updated
Sep 27
โข
1.75M
โข
โข
3.72k