Micro Mistral

A small version of mistral.

Similiar to some of the small llama variants, but uses GQA, tied embeddings, and sliding window attention.

Dataset Minipile Instruct Math OpenOrca Synthetic Data

TODO: Complete Dataset section

Downloads last month
284
Safetensors
Model size
449M params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Datasets used to train Nbardy/micro-mistral