geronimo PRO

g-ronimo

AI & ML interests

fafo

Recent Activity

Articles

Organizations

AblateIt's profile picture Blog-explorers's profile picture

g-ronimo's activity

reacted to Xenova's post with šŸš€šŸ”„ā¤ļø 6 days ago
view post
Post
1943
Introducing Moonshine Web: real-time speech recognition running 100% locally in your browser!
šŸš€ Faster and more accurate than Whisper
šŸ”’ Privacy-focused (no data leaves your device)
āš”ļø WebGPU accelerated (w/ WASM fallback)
šŸ”„ Powered by ONNX Runtime Web and Transformers.js

Demo: webml-community/moonshine-web
Source code: https://github.com/huggingface/transformers.js-examples/tree/main/moonshine-web
reacted to lewtun's post with ā¤ļø 9 days ago
view post
Post
6442
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute šŸ”„

How? By combining step-wise reward models with tree search algorithms :)

We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"

We're open sourcing the full recipe and sharing a detailed blog post.

In our blog post we cover:

šŸ“ˆ Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.

šŸŽ„ Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.

šŸ§­ Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM

Here's the links:

- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute

- Code: https://github.com/huggingface/search-and-learn

Enjoy!
  • 2 replies
Ā·
reacted to sayakpaul's post with ā¤ļø 16 days ago
view post
Post
2096
The Control family of Flux from @black-forest-labs should be discussed more!

It enables structural controls like ControlNets while being significantly less expensive to run!

So, we're working on a Control LoRA training script šŸ¤—

It's still WIP, so go easy:
https://github.com/huggingface/diffusers/pull/10130
New activity in onnx-community/BackgroundMattingV2-4k 18 days ago

Model ready?

#1 opened 18 days ago by
g-ronimo
New activity in PleIAs/Pleias-350m-Preview 20 days ago