teven commited on
Commit
f9084d8
1 Parent(s): f45ec5f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -3,7 +3,7 @@ language:
3
  - en
4
  ---
5
 
6
- V1 of an English/code tokenizer. Equal mix between:
7
  On the NL side:
8
  - Books
9
  - C4
 
3
  - en
4
  ---
5
 
6
+ V1 of an English/code tokenizer. Byte-level BPE, 64k vocab. Equal mix between:
7
  On the NL side:
8
  - Books
9
  - C4