Andres Marafioti
andito
AI & ML interests
Multimodal models, VLM and TTS
Recent Activity
upvoted
an
article
3 days ago
Replicating DeepSeek R1 for Information Extraction
liked
a dataset
4 days ago
fixie-ai/gigaspeech
posted
an
update
4 days ago
Extremely bullish on @CohereForAI's Aya Vision (8B & 32B) - new SOTA open-weight VLMs
- 8B wins up to 81% of the time in its class, better than Gemini Flash
- 32B beats Llama 3.2 90B!
- Covers 23 languages, excels in image captioning, VQA & more
- Integrated on transformers from Day 0!
Efficient multimodal models are here to stay!!🔥
Check out their blog! https://huggingface.co./blog/aya-vision
Organizations
andito's activity
Add ONNX sample code
#8 opened about 1 month ago
by
Xenova

Upload photo_2025-01-25_13-45-22.jpg
#5 opened about 1 month ago
by
Moeu

There is an issue with AutoProcessor
3
#6 opened about 1 month ago
by
Tech-Meld

Upload ONNX weights
#1 opened about 2 months ago
by
Xenova

[WIP] Upload ONNX weights
#1 opened about 2 months ago
by
Xenova

update model max length
#8 opened 2 months ago
by
andito

update model max length
#7 opened 2 months ago
by
andito

Update model max length
#21 opened 2 months ago
by
andito

Remove PR message
1
#19 opened 5 months ago
by
usharma1
GGUF format?
2
#12 opened 3 months ago
by
hvgupta1

Upload ONNX weights + chat template fixes
#13 opened 3 months ago
by
Xenova

Update README.md
#1 opened 3 months ago
by
andito

Update README.md
#1 opened 3 months ago
by
andito

Update README.md
#1 opened 3 months ago
by
andito

Update README.md
#1 opened 3 months ago
by
andito

Update app.py
#3 opened 3 months ago
by
andito

Best option for DocQVA->JSON
1
#11 opened 3 months ago
by
Truc95
ValueError: `resolution_max_side` cannot be larger than `max_image_size` with N=5
1
#9 opened 3 months ago
by
rtbonet
loading images locally?
5
#8 opened 3 months ago
by
fusi0n

Will this work with vLLM?
4
#10 opened 3 months ago
by
nickandbro