raptorkwok
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -2,8 +2,16 @@
|
|
2 |
language:
|
3 |
- zh
|
4 |
library_name: transformers
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
---
|
6 |
## BertTokenizer-based Tokenizer that can tokenize Chinese/Cantonese sentences into phrases
|
|
|
|
|
7 |
Usage:
|
8 |
```
|
9 |
from transformers import BertTokenizer
|
|
|
2 |
language:
|
3 |
- zh
|
4 |
library_name: transformers
|
5 |
+
base_model: fnlp/bart-base-chinese
|
6 |
+
tags:
|
7 |
+
- BART
|
8 |
+
- Chinese
|
9 |
+
- Traditional Chinese
|
10 |
+
- Cantonese
|
11 |
---
|
12 |
## BertTokenizer-based Tokenizer that can tokenize Chinese/Cantonese sentences into phrases
|
13 |
+
Apart from the original 51,271 tokens from the base tokenizer, 194,020 additional Chinese vocabularies are added to this tokenizer.
|
14 |
+
|
15 |
Usage:
|
16 |
```
|
17 |
from transformers import BertTokenizer
|