nbeerbower
/

Lyra4-Gutenberg-12B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Lyra4-Gutenberg-12B

Sao10K/MN-12B-Lyra-v4 finetuned on jondurbin/gutenberg-dpo-v0.1.

Method

ORPO Finetuned using an RTX 3090 + 4060 Ti for 3 epochs.

Fine-tune Llama 3 with ORPO

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	19.63
IFEval (0-Shot)	22.12
BBH (3-Shot)	34.24
MATH Lvl 5 (4-Shot)	11.71
GPQA (0-shot)	9.17
MuSR (0-shot)	11.97
MMLU-PRO (5-shot)	28.57

Downloads last month: 81

Safetensors

Model size

12.2B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for nbeerbower/Lyra4-Gutenberg-12B

Base model

Sao10K/MN-12B-Lyra-v4

Finetuned

(2)

this model

Merges

Quantizations

Dataset used to train nbeerbower/Lyra4-Gutenberg-12B

Spaces using nbeerbower/Lyra4-Gutenberg-12B 4

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

22.120
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

34.240
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

11.710
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

9.170
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

11.970
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

28.570

View on Papers With Code