Alvaro Bartolome's picture

Alvaro Bartolome PRO

alvarobartt

·

https://alvarobartt.me

AI & ML interests

machine learning @huggingface

Recent Activity

new activity about 6 hours ago

microsoft/OmniParser-v2.0:Fix `imgsz` value when not provided in payload

new activity 4 days ago

mims-harvard/ToolRAG-T1-GTE-Qwen2-1.5B:text-generation-inference error

upvoted an article 5 days ago

Training and Finetuning Reranker Models with Sentence Transformers v4

View all activity

Organizations

Posts 6

Post

2964

🔥 Agents can do anything! @microsoft Research just announced the release of Magma 8B!

Magma is a new Visual Language Model (VLM) with 8B parameters for multi-modal agents designed to handle complex interactions across virtual and real environments; and it's MIT licensed!

Magma comes with exciting new features such as:
- Introduces the Set-of-Mark and Trace-of-Mark techniques for fine-tuning
- Leverages a large amount of unlabeled video data to learn the spatial-temporal grounding and planning
- A strong generalization and ability to be fine-tuned for other agentic tasks
- SOTA in different multi-modal benchmarks spanning across UI navigation, robotics manipulation, image / video understanding and spatial understanding and reasoning
- Generates goal-driven visual plans and actions for agentic use cases

Model: microsoft/Magma-8B
Technical Report: Magma: A Foundation Model for Multimodal AI Agents (2502.13130)

Articles 9

Article

4

🤗 Serve any model with Inference Endpoints + Custom Handlers

View all Articles

Collections 8

spaces 1

Running on Zero

FLUX.1 Studio Ghibli LoRA

Generate Studio Ghibli-style images from text prompts

models 23

alvarobartt/safetensors

Updated 22 days ago

alvarobartt/paligemma-2-ft-vqa

Updated Jan 3 • 3

alvarobartt/SmolVLM-Instruct-Handler

Image-Text-to-Text • Updated Dec 4, 2024 • 15

alvarobartt/NVLM-D-72B-IE-compatible

Image-Text-to-Text • Updated Nov 19, 2024 • 11

alvarobartt/ghibli-characters-flux-lora

Text-to-Image • Updated Nov 19, 2024 • 1.07k • • 52

alvarobartt/ghibli-characters-sd3.5-lora

Text-to-Image • Updated Nov 19, 2024 • 50 • • 10

alvarobartt/bert-base-multilingual-cased-ner-spanish

Token Classification • Updated Sep 2, 2024 • 63 • 2

alvarobartt/mistral-7b-orpo-airoboros-pref-10k

Text Generation • Updated Mar 28, 2024 • 6

alvarobartt/mistral-7b-orpo-alignment-handbook

Text Generation • Updated Mar 27, 2024 • 6

alvarobartt/mistral-orpo-mix-b0.05-l1024-pl512-lr5e-7-cosine

Text Generation • Updated Mar 26, 2024 • 7

datasets 55

alvarobartt/Magicoder-Vicuna-1.0

Viewer • Updated Nov 20, 2024 • 75.2k • 40

alvarobartt/SQL-OAI

Viewer • Updated Sep 26, 2024 • 106k • 36

alvarobartt/Magicoder-OAI

Viewer • Updated Sep 25, 2024 • 75.2k • 41

alvarobartt/ghibli-characters

Viewer • Updated Sep 1, 2024 • 9 • 203 • 7

alvarobartt/Capybara-Preferences-Tiny

Viewer • Updated May 14, 2024 • 10 • 44

alvarobartt/replacing-judges-with-juries-distilabel

Viewer • Updated May 8, 2024 • 100 • 74 • 3

alvarobartt/prometheus-eval-distilabel-default

Viewer • Updated May 7, 2024 • 2 • 102

alvarobartt/prometheus-eval-distilabel-ratings

Viewer • Updated May 7, 2024 • 2 • 57

alvarobartt/prometheus-eval-distilabel-generation

Viewer • Updated May 7, 2024 • 2 • 66

alvarobartt/prometheus-eval-distilabel-index

Viewer • Updated May 7, 2024 • 2 • 112