raptorkwok commited on
Commit
9450e83
·
verified ·
1 Parent(s): 05a25bd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -2,8 +2,16 @@
2
  language:
3
  - zh
4
  library_name: transformers
 
 
 
 
 
 
5
  ---
6
  ## BertTokenizer-based Tokenizer that can tokenize Chinese/Cantonese sentences into phrases
 
 
7
  Usage:
8
  ```
9
  from transformers import BertTokenizer
 
2
  language:
3
  - zh
4
  library_name: transformers
5
+ base_model: fnlp/bart-base-chinese
6
+ tags:
7
+ - BART
8
+ - Chinese
9
+ - Traditional Chinese
10
+ - Cantonese
11
  ---
12
  ## BertTokenizer-based Tokenizer that can tokenize Chinese/Cantonese sentences into phrases
13
+ Apart from the original 51,271 tokens from the base tokenizer, 194,020 additional Chinese vocabularies are added to this tokenizer.
14
+
15
  Usage:
16
  ```
17
  from transformers import BertTokenizer