CultriX
/

Qwen2.5-14B-MegaMerge-pt2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the DARE TIES merge method using CultriX/Qwen2.5-14B-MegaMerge-pt1 as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

# final_dare_ties_merge.yaml

models:
  - model: CultriX/Qwen2.5-14B-MergeStock
    parameters:
      density: 0.5  # Retain 50% of the most significant parameters
      weight: 0.6    # Emphasize MergeStock's contributions
  - model: CultriX/Qwen2.5-14B-Wernicke
    parameters:
      density: 0.5  # Retain 50% of the most significant parameters
      weight: 0.4    # Incorporate Wernicke's contributions
merge_method: dare_ties
base_model: CultriX/Qwen2.5-14B-MegaMerge-pt1
parameters:
  normalize: true
  int8_mask: true
dtype: bfloat16
tokenizer_source: Qwen/Qwen2.5-14B-Instruct

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	36.69
IFEval (0-Shot)	56.83
BBH (3-Shot)	50.91
MATH Lvl 5 (4-Shot)	27.34
GPQA (0-shot)	17.23
MuSR (0-shot)	18.74
MMLU-PRO (5-shot)	49.12

Downloads last month: 85

Safetensors

Model size

14.8B params

Tensor type

BF16

·

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for CultriX/Qwen2.5-14B-MegaMerge-pt2

CultriX/Qwen2.5-14B-MergeStock

CultriX/Qwen2.5-14B-Wernicke

Merge model

this model

Finetunes

1 model

Merges

Quantizations

Collection including CultriX/Qwen2.5-14B-MegaMerge-pt2

Qwen2.5-14B

CultriX Qwen2.5-14B Models. • 5 items • Updated Oct 25, 2024 • 1

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

52.350
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

50.640
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

30.060
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

19.130
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

18.250
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

49.150

View on Papers With Code