Jeremie Tisby

Frobenius

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

updated a collection 7 days ago

LLMs

liked a model 10 days ago

hexgrad/Kokoro-82M

View all activity

Organizations

Frobenius's activity

upvoted a paper 4 days ago

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published 23 days ago • 31

updated a collection 7 days ago

LLMs

Collection

3 items • Updated 7 days ago

liked a model 10 days ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated 6 days ago • 1.54M • 3.61k

liked a model 23 days ago

LGAI-EXAONE/EXAONE-3.5-2.4B-Instruct

Text Generation • Updated Dec 11, 2024 • 62.2k • 135

liked a model about 1 month ago

HKUSTAudio/Llasa-3B

Text-to-Speech • Updated about 14 hours ago • 3.66k • 469

updated a collection about 1 month ago

LLMs

Collection

3 items • Updated 7 days ago

liked 2 models about 2 months ago

tensorblock/Sky-T1-32B-Preview-GGUF

Updated Jan 12 • 201 • 2

omkarthawakar/LlamaV-o1

Question Answering • Updated Jan 13 • 1.4k • 91

liked 3 models 3 months ago

liked a Space 3 months ago

531

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

replied to lewtun's post 3 months ago

Wow people... This is CRACKED! THANK YOU HF!!!

reacted to lewtun's post with 🔥 3 months ago

Post

6928

We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥

How? By combining step-wise reward models with tree search algorithms :)

We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"

We're open sourcing the full recipe and sharing a detailed blog post.

In our blog post we cover:

📈 Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.

🎄 Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.

🧭 Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM

Here's the links:

- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute

- Code: https://github.com/huggingface/search-and-learn

Enjoy!