llama-3-neural-chat-v1-8b

Model Details

Model Description

I fine-tuned llama-3 8B on an approach similar to Intel's neural chat language model. I have slightly modified the data sources so it is stronger in coding, math, and writing. I use both SFT and DPO.

Developed by: Locutusque
Model type: Built with Meta Llama 3
Language(s) (NLP): Many?
License: Llama 3 license https://huggingface.co./meta-llama/Meta-Llama-3-8B/blob/main/LICENSE

Quants

EXL2 @bartowski

https://huggingface.co./bartowski/llama-3-neural-chat-v1-8b-exl2

GGUF @bartowski

https://huggingface.co./bartowski/llama-3-neural-chat-v1-8b-GGUF

Uses

This model has great performance in writing and coding.

Training Data

Open-Orca/SlimOrca-Dedup
jondurbin/airoboros-3.2
microsoft/orca-math-word-problems-200k
m-a-p/Code-Feedback
MaziyarPanahi/WizardLM_evol_instruct_V2_196k
mlabonne/orpo-dpo-mix-40k

Direct Use

Conversational AI.

Evaluations

Tasks	Version	Filter	n-shot	Metric	Value		Stderr
truthfulqa_mc2	2	none	0	acc	0.5627	±	0.0154
gsm8k	3	strict-match	5	exact_match	0.5481	±	0.0137
		flexible-extract	5	exact_match	0.5557	±	0.0137
agieval_nous	N/A	none	0	acc	0.3763	±	0.0093
		none	0	acc_norm	0.3665	±	0.0093
- agieval_aqua_rat	1	none	0	acc	0.2087	±	0.0255
		none	0	acc_norm	0.2047	±	0.0254
- agieval_logiqa_en	1	none	0	acc	0.3456	±	0.0187
		none	0	acc_norm	0.3594	±	0.0188
- agieval_lsat_ar	1	none	0	acc	0.1826	±	0.0255
		none	0	acc_norm	0.1783	±	0.0253
- agieval_lsat_lr	1	none	0	acc	0.3549	±	0.0212
		none	0	acc_norm	0.3451	±	0.0211
- agieval_lsat_rc	1	none	0	acc	0.5242	±	0.0305
		none	0	acc_norm	0.5130	±	0.0305
- agieval_sat_en	1	none	0	acc	0.6650	±	0.0330
		none	0	acc_norm	0.6505	±	0.0333
- agieval_sat_en_without_passage	1	none	0	acc	0.4175	±	0.0344
		none	0	acc_norm	0.3738	±	0.0338
- agieval_sat_math	1	none	0	acc	0.4227	±	0.0334
		none	0	acc_norm	0.3682	±	0.0326

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	66.50
AI2 Reasoning Challenge (25-Shot)	60.84
HellaSwag (10-Shot)	84.13
MMLU (5-Shot)	64.69
TruthfulQA (0-shot)	56.34
Winogrande (5-shot)	78.22
GSM8k (5-shot)	54.81

Downloads last month: 153

Safetensors

Model size

8.03B params

Tensor type

BF16

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for Locutusque/llama-3-neural-chat-v1-8b

Base model

meta-llama/Meta-Llama-3-8B

Finetuned

(388)

this model

Merges

10 models

Quantizations

2 models

Datasets used to train Locutusque/llama-3-neural-chat-v1-8b

Spaces using Locutusque/llama-3-neural-chat-v1-8b 6

Evaluation results

normalized accuracy on AI2 Reasoning Challenge (25-Shot)
test set Open LLM Leaderboard

60.840
normalized accuracy on HellaSwag (10-Shot)
validation set Open LLM Leaderboard

84.130
accuracy on MMLU (5-Shot)
test set Open LLM Leaderboard

64.690
mc2 on TruthfulQA (0-shot)
validation set Open LLM Leaderboard

56.340
accuracy on Winogrande (5-shot)
validation set Open LLM Leaderboard

78.220
accuracy on GSM8k (5-shot)
test set Open LLM Leaderboard

54.810

View on Papers With Code