miscii-14b-0218

I think thereโ€™s a reason Iโ€™m a shadow, but she looks like an angel.

Image source: The Angelโ€™s Message

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Model Stock merge method using /Users/sthenno/models/tempesthenno-ppo-enchanted as a base.

Models Merged

The following models were included in the merge:

  • /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt40
  • /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt50
  • /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt60
  • /Users/sthenno/models/tempesthenno-sft-0218-ckpt60
  • /Users/sthenno/models/tempesthenno-sft-0218-ckpt80

Configuration

The following YAML configuration was used to produce this model:

name: tempesthenno-ms-0218
merge_method: model_stock
base_model: /Users/sthenno/models/tempesthenno-ppo-enchanted
tokenizer:
  source: base
dtype: float32
out_dtype: bfloat16
parameters:
  int8_mask: true
  normalize: true
  rescale: false
models:
  - model: /Users/sthenno/models/tempesthenno-sft-0218-ckpt60
  - model: /Users/sthenno/models/tempesthenno-sft-0218-ckpt80
  - model: /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt40
  - model: /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt50
  - model: /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt60

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 42.90
IFEval (0-Shot) 76.56
BBH (3-Shot) 50.64
MATH Lvl 5 (4-Shot) 51.44
GPQA (0-shot) 17.79
MuSR (0-shot) 13.21
MMLU-PRO (5-shot) 47.75
Downloads last month
550
Safetensors
Model size
14.8B params
Tensor type
BF16
ยท
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for sthenno-com/miscii-14b-0218

Merge model
this model
Finetunes
2 models
Merges
6 models
Quantizations
2 models

Space using sthenno-com/miscii-14b-0218 1

Evaluation results