Stefan Schweter PRO
stefan-it
AI & ML interests
Flair Library, NER & PoS Tagging, LM Pretraining (mostly encoder-only), Historical Language Models
Recent Activity
reacted
to
davanstrien's
post
with š„
about 4 hours ago
š Big step for multilingual AI data!
The Hugging Face community has rated educational content in languages spoken by 1.6 billion people! New additions:
ā¢ Japanese
ā¢ Italian
ā¢ Old High German
Learn more and contribute: https://huggingface.co./blog/davanstrien/fineweb2-community
These ratings can help enhance training data for major world languages.
updated
a model
about 5 hours ago
stefan-it/bert5urk
upvoted
a
paper
about 12 hours ago
Analyzing the Effect of Linguistic Similarity on Cross-Lingual Transfer:
Tasks and Experimental Setups Matter
Articles
Organizations
Posts
1
Post
1383
My latest project is the outcome of the last 2+ years working with TPUs from the amazing TPU Research Cloud (TRC) program and training Encoder-only LMs with the TensorFlow Model Garden library.
š Link: https://github.com/stefan-it/model-garden-lms
An overview of some features:
- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS
I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!
š Model Hub Link: https://huggingface.co./model-garden-lms
If you find these resources useful, please give them a like!
Made from Bavarian Oberland with ā¤ļø and š„Ø.
š Link: https://github.com/stefan-it/model-garden-lms
An overview of some features:
- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS
I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!
š Model Hub Link: https://huggingface.co./model-garden-lms
If you find these resources useful, please give them a like!
Made from Bavarian Oberland with ā¤ļø and š„Ø.
Collections
14
My pretrained LMs on FineWeb datasets - part of my TensorFlow Model Garden LMs project
A Collection of Historical Multilingual Language Models
-
dbmdz/bert-base-historic-multilingual-cased
Fill-Mask ā¢ Updated ā¢ 67 ā¢ 6 -
dbmdz/bert-base-historic-multilingual-64k-td-cased
Fill-Mask ā¢ Updated ā¢ 108 ā¢ 1 -
hmbyt5-preliminary/byt5-small-historic-multilingual-span20-flax
Text2Text Generation ā¢ Updated ā¢ 68 -
hmteams/teams-base-historic-multilingual-discriminator
Updated ā¢ 7
models
1334
stefan-it/bert5urk
Updated
ā¢
76
ā¢
3
stefan-it/bort-full
Fill-Mask
ā¢
Updated
ā¢
44
stefan-it/span-marker-gelectra-large-germeval14
Token Classification
ā¢
Updated
ā¢
18
ā¢
2
stefan-it/zeitungs-lm-v1
Updated
ā¢
5
ā¢
4
stefan-it/wav2vec2-large-xlsr-53-basque
Automatic Speech Recognition
ā¢
Updated
ā¢
3.29k
stefan-it/german-gpt2-larger
Text Generation
ā¢
Updated
ā¢
570
ā¢
8
stefan-it/xlstm-german-wikipedia
Text Generation
ā¢
Updated
ā¢
22
ā¢
7
stefan-it/flair-barner-wiki-coarse-gbert-large
Token Classification
ā¢
Updated
ā¢
21
ā¢
1
stefan-it/flair-clean-conll-5
Token Classification
ā¢
Updated
ā¢
6
stefan-it/flair-clean-conll-4
Token Classification
ā¢
Updated
ā¢
10
datasets
12
stefan-it/senti-anno
Viewer
ā¢
Updated
ā¢
929
ā¢
116
stefan-it/offenseval2020_tr
Viewer
ā¢
Updated
ā¢
35.3k
ā¢
85
stefan-it/dewiki-20230701-nltk-corpus
Viewer
ā¢
Updated
ā¢
39.4M
ā¢
71
ā¢
2
stefan-it/germeval14_no_wikipedia
Preview
ā¢
Updated
ā¢
97
stefan-it/histnero
Viewer
ā¢
Updated
ā¢
217k
ā¢
72
stefan-it/HisGermaNER
Preview
ā¢
Updated
ā¢
348
ā¢
2
stefan-it/co-funer
Preview
ā¢
Updated
ā¢
56
stefan-it/german-dbmdz-bert-corpus
Viewer
ā¢
Updated
ā¢
52.8M
ā¢
179
ā¢
2
stefan-it/span-marker-base-model-detection
Viewer
ā¢
Updated
ā¢
28
ā¢
61
stefan-it/flair-base-model-detection
Viewer
ā¢
Updated
ā¢
52
ā¢
49
ā¢
1