Alvaro Bartolome's picture

Alvaro Bartolome

alvarobartt

·

https://alvarobartt.me

AI & ML interests

☁️ cloud machine learning @huggingface and open source passionate

Recent Activity

upvoted a paper 5 days ago

Spectrum: Targeted Training on Signal to Noise Ratio

liked a model 6 days ago

Qwen/Qwen-VL-Chat

updated a model 7 days ago

merve/paligemma_vqav2

View all activity

Articles

🤗 Serve any model with Inference Endpoints + Custom Handlers

Introducing HUGS - Scale your AI with Open Models

Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

🧑‍⚖️ "Replacing Judges with Juries" using distilabel

Deploying 🤗 Hub models in Vertex AI

🏷️ Build AI Feedback (AIF) datasets for LLM alignment with ⚗️ distilabel

💨 Introducing Notus: a DPO fine-tune of Zephyr with a focus on high-quality data

🤗 LLM suggestions in Argilla with HuggingFace Inference Endpoints

Organizations

alvarobartt's activity

upvoted a paper 5 days ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7 • 11

upvoted a collection 11 days ago

2024 Interconnects Artifacts

Models & datasets mentioned in the bottom section of posts! • 278 items • Updated 6 days ago • 5

upvoted a paper 12 days ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published 21 days ago • 43

upvoted 2 collections 13 days ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 12 days ago • 119

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 17 items • Updated 2 days ago • 90

upvoted a collection 19 days ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated 19 days ago • 92

upvoted a collection 21 days ago

SmolVLM

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 3 days ago • 30

upvoted a paper 23 days ago

Unveiling Encoder-Free Vision-Language Models

Paper • 2406.11832 • Published Jun 17 • 50

upvoted an article 26 days ago

Article

Use Models from the Hugging Face Hub in LM Studio

By

•

27 days ago

• 127

upvoted a collection about 1 month ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 28 days ago • 257

upvoted 2 collections about 2 months ago

AMD-OLMo

AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo. • 4 items • Updated Oct 31 • 17

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 3 days ago • 195

upvoted 2 articles 2 months ago

Article

Releasing Outlines-core 0.1.0: structured generation in Rust and Python

Oct 22

• 44

Article

Fixing Gradient Accumulation

Oct 16

• 43

upvoted an article 3 months ago

Article

Inference Endpoints Changelog 🚀

By

•

Oct 11

• 18

upvoted a paper 3 months ago

Pixtral 12B

Paper • 2410.07073 • Published Oct 9 • 62

upvoted 4 collections 3 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 28 days ago • 289

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 19 days ago • 548

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 28 days ago • 444

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 224