vicgalle
/

RoleBeagle-11B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

RoleBeagle-11B

A DPO-finetune from vicgalle/CarbonBeagle-11B-truthy over a subset of OpenHermesPreferences containting RP conversations. It keeps most of the intelligence from CarbonBeagle-11B, and hopefuly can role-play better.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	76.06
AI2 Reasoning Challenge (25-Shot)	72.35
HellaSwag (10-Shot)	89.77
MMLU (5-Shot)	66.35
TruthfulQA (0-shot)	77.92
Winogrande (5-shot)	84.06
GSM8k (5-shot)	65.88

Downloads last month: 57

Safetensors

Model size

10.7B params

Tensor type

FP16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for vicgalle/RoleBeagle-11B

Merges

1 model

Quantizations

Dataset used to train vicgalle/RoleBeagle-11B

Spaces using vicgalle/RoleBeagle-11B 3

Evaluation results

normalized accuracy on AI2 Reasoning Challenge (25-Shot)
test set Open LLM Leaderboard

72.350
normalized accuracy on HellaSwag (10-Shot)
validation set Open LLM Leaderboard

89.770
accuracy on MMLU (5-Shot)
test set Open LLM Leaderboard

66.350
mc2 on TruthfulQA (0-shot)
validation set Open LLM Leaderboard

77.920
accuracy on Winogrande (5-shot)
validation set Open LLM Leaderboard

84.060
accuracy on GSM8k (5-shot)
test set Open LLM Leaderboard

65.880

View on Papers With Code