Text Generation
Transformers
PyTorch
mistral
text-generation-inference
Inference Endpoints

What are the training hyperparameters?

#4
by tongyx361 - opened

Are they the same as LLaMA-2? What about Llemma?

Sign up or log in to comment