pszemraj's picture
update ckpt with 6ish epochs of training with 1024 TOKENS as max output
9996867
raw
history blame contribute delete
13 Bytes
checkpoint-*/