Bityuno Zero Qwen2.5-3B Countdown
Bityuno Zero is an implementation inspired by TinyZero, designed to develop self-verification and search skills through reinforcement learning. This model is based on Qwen2.5-3B and has been specifically trained for the "Countdown" task, its so experimental, check the repo for more information!
- Downloads last month
- 12
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for JackCloudman/bityuno-zero-qwen2.5-3B-countdown
Base model
Qwen/Qwen2.5-3B