Catherine Arnett

catherinearnett

AI & ML interests

multilingual NLP, tokenization

Articles

Organizations

catherinearnett's activity

upvoted an article about 19 hours ago
view article
Article

Releasing the largest multilingual open pretraining dataset

59