TinyLlama-1.1B-ckpt-2.5T-exl2

EXL2 quants of TinyLlama/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T intended for use in speculative decoding.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference API
Unable to determine this model's library. Check the docs .

Collection including royallab/TinyLlama-1.1B-ckpt-2.5T-exl2