nyuuzyou PRO
nyuuzyou
AI & ML interests
None yet
Recent Activity
updated
a dataset
2 days ago
nyuuzyou/fimfiction
published
a dataset
2 days ago
nyuuzyou/fimfiction
posted
an
update
6 days ago
🌐 Public MediaWiki Collection Dataset - https://huggingface.co./datasets/nyuuzyou/wikis
Collection of 1.66M+ articles from 930 public MediaWiki instances featuring:
- Full article content from diverse public wikis across the internet
- Complete metadata including templates, categories, and section structure
- Rich structural information preserving wiki organization and links
- Multilingual content across 35+ languages including English, Chinese, Spanish, and more
- Regional language variants including US/UK English, Brazilian Portuguese, and Traditional/Simplified Chinese
Key contents:
- 1,662,448 wiki articles with full text
- Extensive metadata including templates, categories, sections
- Internal wikilinks and external reference information
- Cross-domain knowledge spanning multiple topics and fields
Organizations
nyuuzyou's activity
[bot] Conversion to Parquet
#1 opened 10 days ago
by
parquet-converter

Details on how the data was obtained?
1
#1 opened about 2 months ago
by
mossybwuny

[bot] Conversion to Parquet
#1 opened about 2 months ago
by
parquet-converter

[bot] Conversion to Parquet
#1 opened about 2 months ago
by
parquet-converter

[bot] Conversion to Parquet
#1 opened about 2 months ago
by
parquet-converter

[bot] Conversion to Parquet
#1 opened about 2 months ago
by
parquet-converter

[bot] Conversion to Parquet
#1 opened 2 months ago
by
parquet-converter

[bot] Conversion to Parquet
#1 opened 2 months ago
by
parquet-converter

dupes
1
#1 opened 2 months ago
by
huggingfaceshredding408
[bot] Conversion to Parquet
#1 opened 2 months ago
by
parquet-converter

[bot] Conversion to Parquet
#1 opened 2 months ago
by
parquet-converter

[bot] Conversion to Parquet
#1 opened 3 months ago
by
parquet-converter

[bot] Conversion to Parquet
#1 opened 3 months ago
by
parquet-converter
