Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
ucalyptus 
posted an update May 20, 2024

Outstanding issues :

Fix Q4 demo
https://huggingface.co./spaces/ucalyptus/prem-1B-chat-webgpu/discussions/1#664b621d8742922b9e4f3de8
Also work on fp16 (see what onnxruntime-web has to say about this)

i realized that naively quantizing the prem-1b caused it to give gibberish outputs on the webgpu demo. lmao. stay tuned for better models.

In this post