MiniSymp2 is A retrain of my MiniSymposium model attempt except with some more data and better practices.

added EOS tokens where they belong
made the prompt formats more diverse in the data so you could experiment / play with prompt format in context
added some new examples
measured loss curve to make sure I wasn't overfitting
used 8-bit lora instead of 4-bit qlora

GGUF

Model size

7.24B params

Architecture

llama

6-bit

8-bit

Inference API

Unable to determine this model's library. Check the docs .