Mergekit config

by ehartford - opened about 16 hours ago

Discussion

ehartford

about 16 hours ago

Can you please share the mergekit config?

nlpguy

about 11 hours ago

My guess is simple task_arithmetic:

models:
  - model: Qwen/Qwen2.5-14B
  - model: Qwen/Qwen2.5-14B-Instruct-1M
  - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B

merge_method: task_arithmetic
base_model: Qwen/Qwen2.5-14B
parameters:
  normalize: false
  int8_mask: true
dtype: float16

(untested)

mkurman

Owner about 11 hours ago

Sure, I just threw it in the Readme.md file

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment