Mergekit config
#2
by
ehartford
- opened
Can you please share the mergekit config?
My guess is simple task_arithmetic:
models:
- model: Qwen/Qwen2.5-14B
- model: Qwen/Qwen2.5-14B-Instruct-1M
- model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
merge_method: task_arithmetic
base_model: Qwen/Qwen2.5-14B
parameters:
normalize: false
int8_mask: true
dtype: float16
(untested)
Sure, I just threw it in the Readme.md file