Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
76
21
David Dale
cointegrated
Follow
Tatbooy's profile picture
IvT-DS's profile picture
rokokzk's profile picture
76 followers
·
8 following
https://daviddale.ru/en
cointegrated
avidale
AI & ML interests
Research engineer at FAIR, Meta. Some pet projects on NLP for under-resourced languages. Interests: Machine translation, Chatbots, applied NLU, controllable text generation (in particular, text style transfer), miniature models.
Recent Activity
new
activity
4 days ago
openlanguagedata/flores_plus:
[DRAFT] Fix orthography in the Russian dev set
new
activity
4 days ago
openlanguagedata/flores_plus:
Fix encoding at chv devtest
liked
a dataset
17 days ago
google/wmt24pp
View all activity
Organizations
cointegrated
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
openlanguagedata/flores_plus
4 days ago
[DRAFT] Fix orthography in the Russian dev set
4
#4 opened 3 months ago by
cointegrated
Fix encoding at chv devtest
4
#9 opened about 1 month ago by
alexantonov
liked
a dataset
17 days ago
google/wmt24pp
Viewer
•
Updated
18 days ago
•
54.9k
•
2.97k
•
28
New activity in
slone/nllb-rus-tyv-v1
17 days ago
Adding `safetensors` variant of this model
#1 opened 17 days ago by
SFconvertbot
New activity in
cointegrated/LaBSE-en-ru
20 days ago
Warn Some weights of the model checkpoint at cointegrated/LaBSE-en-ru were not used when initializing BertModel:
1
#4 opened 5 months ago by
alashkov83
New activity in
slone/LaBSE-shallow-distilled-bak
about 1 month ago
Adding `safetensors` variant of this model
#1 opened about 1 month ago by
SFconvertbot
New activity in
cointegrated/SONAR_200_text_encoder
about 1 month ago
can you please do the same for decoder
1
#2 opened 3 months ago by
damerajee
New activity in
slone/finugorbib
about 1 month ago
[bot] Conversion to Parquet
#1 opened about 1 month ago by
parquet-converter
liked
a dataset
about 1 month ago
udmurtNLP/udmurt-russian-parallel-corpora
Viewer
•
Updated
Feb 1
•
102k
•
89
•
3
New activity in
openlanguagedata/flores_plus
about 1 month ago
Added Dargwa dev set to flores_plus
2
#3 opened 3 months ago by
Murtazali
published
a dataset
about 1 month ago
slone/finugorbib
Viewer
•
Updated
Jan 27
•
849k
•
199
•
1
updated
a dataset
about 1 month ago
slone/finugorbib
Viewer
•
Updated
Jan 27
•
849k
•
199
•
1
liked
a dataset
about 2 months ago
alexantonov/chukot_russian_flores_sample
Viewer
•
Updated
Jan 31
•
100
•
135
•
4
liked
a model
about 2 months ago
Helsinki-NLP/opus-mt-tc-bible-big-mul-mul
Translation
•
Updated
Oct 12, 2024
•
955
•
•
4
New activity in
openlanguagedata/flores_plus
2 months ago
Add data integrity tests
1
#7 opened 2 months ago by
cointegrated
updated
a dataset
2 months ago
openlanguagedata/flores_plus
Viewer
•
Updated
13 days ago
•
434k
•
2k
•
23
New activity in
openlanguagedata/flores_plus
2 months ago
Two sentences in the dev set (one Lombard and one Tamasheq-Tifinagh) seem to be missing
#6 opened 2 months ago by
cointegrated
liked
2 datasets
3 months ago
aronlp/aromanian-romanian-MT-corpus
Viewer
•
Updated
Jan 15
•
105k
•
17
•
1
ontocord/fineweb-permissive-multilingual-2m
Viewer
•
Updated
Oct 9, 2024
•
2.23M
•
176
•
2
updated
a dataset
3 months ago
facebook/LCFO
Viewer
•
Updated
Dec 13, 2024
•
1.55k
•
75
•
3
Load more