Qwen2.5-72B-0.6x-Instruct
This is a linear merge of Qwen/Qwen2.5-72B-Instruct at weight 0.6
and Qwen/Qwen2.5-72B at weight 0.4
.
The resulting model is 60% Instruct and 40% base model, hence the name 0.6x-Instruct
.
The goal of the merge was to make the Instruct model more flexible and less rigid. After some initial testing, I think the resulting model meets this goal, and I find it useful and interesting enough to warrant publishing.
- Downloads last month
- 24
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.