sbintuitions
/

sarashina2.1-1b

Model card Files Files and versions Community

sho-takase commited on 17 days ago

Commit

30348c3

•

1 Parent(s): 00c38ae

revise readme

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -41,7 +41,7 @@ We constructed this Sarashina2.1-1B model, which consists of 1 billion parameter
 First, we trained the model on 10 trillion tokens, including Japanese and English data extracted from web corpora.
 Then, we trained the model using 1 trillion tokens, predominantly consisting of Japanese data, to enhance its performance in Japanese.
 The following tables show the model's performance on Japanese and English tasks.
-We also show the performance of other public LLMs with approximately 1 billion parameters.
 #### Evaluation in Japanese tasks

 First, we trained the model on 10 trillion tokens, including Japanese and English data extracted from web corpora.
 Then, we trained the model using 1 trillion tokens, predominantly consisting of Japanese data, to enhance its performance in Japanese.
 The following tables show the model's performance on Japanese and English tasks.
+We also show the performance of other public LLMs for reference.
 #### Evaluation in Japanese tasks