BlueHeeler-10M is a nanoGPT (GPT-2) 6-head x 6-layer x 192-deep model with a context size of 64 trained on scripts from the children's show Bluey
iter 2000: loss 1.2913, time 30647.72ms, mfu 0.05%
- Downloads last month
- 24
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.