Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Swapan jain
hull11
Follow
21world's profile picture
1 follower
ยท
1 following
AI & ML interests
None yet
Recent Activity
liked
a Space
9 days ago
HuggingFaceH4/blogpost-scaling-test-time-compute
replied
to
lewtun
's
post
9 days ago
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute ๐ฅ How? By combining step-wise reward models with tree search algorithms :) We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think" We're open sourcing the full recipe and sharing a detailed blog post. In our blog post we cover: ๐ Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time. ๐ Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets. ๐งญ Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM Here's the links: - Blog post: https://huggingface.co./spaces/HuggingFaceH4/blogpost-scaling-test-time-compute - Code: https://github.com/huggingface/search-and-learn Enjoy!
reacted
to
lewtun
's
post
with ๐
9 days ago
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute ๐ฅ How? By combining step-wise reward models with tree search algorithms :) We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think" We're open sourcing the full recipe and sharing a detailed blog post. In our blog post we cover: ๐ Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time. ๐ Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets. ๐งญ Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM Here's the links: - Blog post: https://huggingface.co./spaces/HuggingFaceH4/blogpost-scaling-test-time-compute - Code: https://github.com/huggingface/search-and-learn Enjoy!
View all activity
Organizations
None yet
hull11
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a Space
9 days ago
Running
377
๐
Scaling test-time compute