Text Generation
Transformers
GGUF
TensorBoard
Safetensors
mistral
quantized
2-bit
3-bit
4-bit precision
5-bit
6-bit
8-bit precision
GGUF
gemma
alignment-handbook
trl
dpo
Generated from Trainer
conversational
dataset:argilla/dpo-mix-7k
arxiv:2310.16944
Eval Results
Inference Endpoints
has_space
text-generation-inference