25 20 58

geronimo PRO

g-ronimo

AI & ML interests

fafo

Recent Activity

liked a model 2 days ago

mit-han-lab/dc-ae-f64c128-in-1.0-uvit-h-in-512px-train2000k

liked a model 3 days ago

SG161222/Verus_Vision_2.0b

upvoted a paper 3 days ago

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

View all activity

Articles

SemScore: Evaluating LLMs with Semantic Similarity

Mar 9

• 12

Phinetuning 2.0

Jan 31

• 2

Organizations

g-ronimo's activity

liked a model 2 days ago

mit-han-lab/dc-ae-f64c128-in-1.0-uvit-h-in-512px-train2000k

Updated 20 days ago • 115 • 4

liked a model 3 days ago

SG161222/Verus_Vision_2.0b

Updated 34 minutes ago • 8

upvoted a paper 3 days ago

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

Paper • 2412.16112 • Published 6 days ago • 17

liked a model 3 days ago

Huage001/CLEAR

Updated 7 days ago • 3

liked a model 4 days ago

microsoft/BiomedParse

Updated 5 days ago • 1.99k • 57

liked a model 6 days ago

ostris/fluxdev2schnell-lora

Updated 7 days ago • 2

reacted to Xenova's post with 🚀🔥❤️ 6 days ago

Post

1943

Introducing Moonshine Web: real-time speech recognition running 100% locally in your browser!
🚀 Faster and more accurate than Whisper
🔒 Privacy-focused (no data leaves your device)
⚡️ WebGPU accelerated (w/ WASM fallback)
🔥 Powered by ONNX Runtime Web and Transformers.js

Demo: webml-community/moonshine-web
Source code: https://github.com/huggingface/transformers.js-examples/tree/main/moonshine-web

updated a model 7 days ago

g-ronimo/sam2-tiny

Mask Generation • Updated 7 days ago • 1

reacted to lewtun's post with ❤️ 9 days ago

Post

6442

We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥

How? By combining step-wise reward models with tree search algorithms :)

We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"

We're open sourcing the full recipe and sharing a detailed blog post.

In our blog post we cover:

📈 Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.

🎄 Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.

🧭 Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM

Here's the links:

- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute

- Code: https://github.com/huggingface/search-and-learn

Enjoy!

2 replies

reacted to sayakpaul's post with ❤️ 16 days ago

Post

2096

The Control family of Flux from @black-forest-labs should be discussed more!

It enables structural controls like ControlNets while being significantly less expensive to run!

So, we're working on a Control LoRA training script 🤗

It's still WIP, so go easy:
https://github.com/huggingface/diffusers/pull/10130