Todor Arnaudov

twenkid

AI & ML interests

AGI & "everything"; Seed AI; multimodality; automatic programming (...) "The Sacred Computer" AGI institute: Thinking machines, creativity and human development

Recent Activity

Organizations

None yet

twenkid's activity

reacted to vikhyatk's post with ❀️ 6 months ago
view post
Post
3291
πŸš€ Exciting news! We've just launched "Thundermoon" - the latest version of Moondream, our open-source vision language model! πŸŒ™

Key improvements in this release:
1. Massive leap in OCR capabilities
2. Enhanced document understanding
3. Significant boosts across key metrics:
* DocVQA: 61.9 (↑103%)
* TextVQA: 60.2 (↑5.2%)
* GQA: 64.9 (↑2.9%)

What does this mean? Moondream can now tackle complex document analysis tasks with unprecedented accuracy for a model of its size. From deciphering handwritten notes to interpreting data tables, the applications are vast.

Check out the image for a glimpse of Moondream in action, effortlessly extracting insights from a 1944 sugar industry document!

Why it matters:
* Democratizing AI: As an open-source project, we're making advanced vision AI accessible to all developers.
* Efficiency: Proving that smaller models can deliver big results.
* Real-world impact: From historical document analysis to modern business intelligence, the potential use cases are exciting.

Curious to try it out? Try out the live demo here! https://moondream.ai/playground
Β·
New activity in dataautogpt3/ProteusV0.2 9 months ago
reacted to vikhyatk's post with ❀️ 10 months ago
view post
Post
Just released moondream2 - a small 1.8B parameter vision language model. Now fully open source (Apache 2.0) so you can use it without restrictions on commercial use!

vikhyatk/moondream2
Β·
New activity in kiri-ai/gpt2-large-quantized 10 months ago