Vous trouverez ci-dessous une liste de 258 jeux de données en français mal référencés sur le Hub :
Below is a list of 258 French datasets that are badly referenced on the Hub:



adiren7/darija_to_french_speech_to_text
AdrienB134/ASN_Lettres_De_Suivi_filtered
AdrienB134/ASN_pairs
AdrienB134/easyfinetune_Instruct_test
AdrienB134/easyfinetune_QA_test
AdrienB134/Emilia-dataset-french-split
AdrienB134/french-tts-mul
AdrienB134/french-unique-speaker-tts
AdrienB134/Instruct_ASN_medium
AdrienB134/Instruct_ASN_small
AdrienB134/QA_ASN_small
AdrienB134/QA_ASN_test
AdrienB134/Small-markdown
Adjoumani/translations_french_baoule_V1
adlbh/rekrute-2005-2022
adwaitagashe/bordIRlines
ahmadSiddiqi/x-stance_fr
ahazeemi/iwslt14-en-fr
ai4bharat/intel
ai4bharat/recon
allenai/WildChat-1M
almanach/LADaS
body-parts
alvations/c4p0-v1-en-fr
alvations/c4p0-v1-fr-en
alvations/c4p0-v2-en-fr
alvations/c4p0-v2-fr-en
alvations/dslml24-jelly-submission-fr
alvations/food-and-beverage
alvations/units
alvations/xnli-15way
Alwaly/fr_voxpopuli/
Alwaly/french-Wolof-lang-classification
Alwaly/frenchToWolof
Alwaly/frenchToWolof_
Alwaly/multilingual-wolof-french-asr
Alwaly/multilingual-wolof-french-en
AmazonScience/mintaka
arbml/UFAL
astha/languagemodelsforRNNdecomposition
babs/unlabelled-french-voxpopuli
beethogedeon/fr_fon
BitTranslate/chatgpt-prompts-French
bio-datasets/e3c
bosbos/french_english_instruct
Brendan/nlp244_french_snli
chocobearz/BERSt
cjvt/janes_preklop
CohereForAI/m-ArenaHard
coref-data/corefud_raw
corto-ai/open-australian-legal-multi-lingual-qa
Databasesprojec/FinStmts_ConsUncons_French_Predict_part1
Databasesprojec/FinStmts_ConsUncons_French_Predict_part2
Databasesprojec/FinStmts_ConsUncons_French_SeqClass
Databasesprojec/FinStmts_ConsUncons_Reduced_UndersampleMajority_French_SeqClass
Databoost/TTS_Multilingual_Data
dataset-rewriter/SmallTalkDialogues-10-translated-to-proper-French-466b
dataset-rewriter/SmallTalkDialogues-translated-to-proper-French-466b
ekazuki/french_deputies_tweet
ekazuki/french_deputies_tweet_old
ekazuki/french_deputies_tweet_sentiment
ekazuki/text_to_french_parliament_group
ekazuki/text_to_french_parliament_group_beta
ekazuki/text_to_french_parliament_group_debates
ekazuki/text_to_french_parliament_group_written_questions
EssalhiSara/french.corpus
EssalhiSara/French_corpus
Farah21/frenchOrientation
fdaudens/aya_dataset_french_example
fdaudens/aya_french_dpo
ferrazzipietro/e3c
FreedomIntelligence/ApolloCorpus
FreedomIntelligence/alpaca-gpt4-french
FreedomIntelligence/MMLU_French
FreedomIntelligence/sharegpt-french
freds0/cml_tts_dataset_french
gasp/french_rap_songs
Geraldine/bso-publications-indexation-50k
gmnlp/tico19
GregoryD/explicit-function-calling-french
gustawdaniel/ngram-google-2012
Hazzzardous/synthetic-translations-6k-unvalidated
hcoxec/french_100k
hcoxec/french_danish
hcoxec/french_danish_mix
hcoxec/french_finnish
hcoxec/french_finnish_mix
hcoxec/french_german
hcoxec/french_german_mix
hcoxec/french_romanian
hcoxec/french_romanian_mix
hcoxec/french_spanish
hcoxec/french_spanish_mix
imvladikon/paranames
infinite-dataset-hub/NoGunFranceText
infinite-dataset-hub/Pedale-FrenchTextCorpus
Intuit-GenSRF/all_french_datasets
iot/eng_to_french
ismailiismail/French_English_2
ismailiismail/FrEn_handpicks
ismailiismail/ner
ismailiismail/multi_paraphrasing_french
ismailiismail/paragraphss_paraphrasing
ismailiismail/paraphrasing_french
ismailiismail/paraphrasing_french_5000
iix/Parquet_FIles
Jasgui11/French
JohnnyEudora/Translationh
juletxara/pawsx_mt
juletxara/mgsm_mt
juletxara/xnli_mt
jwang214/arc_french
jzhang86/fr_ifeval
jzhang86/frmmlu_no_train
kaitchup/opus-English-to-French
kaitchup/opus-French-to-English
kloodia/alpaca_french
lidiapierre/fr_sexism_labelled
lincoln/newsquadfr
lightblue/mitsu
llama-lang-adapt/wura
LsTam/CQUAE_documents
LsTam/generated_user_questions_samplemd
LsTam/opus_instruction_format
LsTam/raw_samples_md
lyon-nlp/mteb-fr-reranking-syntec-s2p
m-biriuchinskii/ICDAR2017-filtered-1800-1900-3
m-biriuchinskii/ICDAR2017-filtered-1800-1900-4
m-biriuchinskii/ICDAR2017-filtered-1800-1900-5
Makxxx/wikinews
malteos/french_CEFR
manu/croissant_french_dataset
manu/dataset_en_fr
manu/dataset_en_fr_short
manu/dila_legifrance
manu/europarl-en-fr
manu/fr_corpora_parliament_processed-lowercased
manu/french-30b
manu/french-30b_separate
manu/french-bench-grammar-vocab-reading
manu/french_5p
manu/french_5p_separate
manu/french_bench_arc_challenge
manu/french_bench_hellaswag
manu/french_boolq
manu/french_librispeech_text_only
manu/french_poetry
manu/old_french_30b_separate
manu/opus100-en-fr
manu/theses_fr_2013_2023
manu/tok-corpus-shuffled
manu/wikisource_fr
manu/wmt-en-fr
mattlc/french_multicorpus_tft_v040
MBZUAI/ALM-Bench
MBZUAI/MINT_BAK
MBZUAI/multilingual-llava-bench-in-the-wild
MBZUAI/palo_multilingual_dataset
MBZUAI-Paris/Darija-SFT-Mixture
md-nishat-008/Mojo_Corpus
Mediform/sharegpt-french
mgb-dx-meetup/product-reviews
Michielo/Merged-LID-20
MilaNLProc/honest
misclassified/meps_speeches
musts/french
nedjmaou/MLMA_hate_speech
nguyenthanhasia/MSD_multilingual
nielsr/datacomp_small_french_captions
nirantk/french-books
nickcpk/handcrafted_en_fr_data
odunola/french-audio-preprocessed
odunola/french-english-preprocessed
odunola/french-english-unprocessed
odunola/french-preprocessed-2/
odunola/french-preprocessed-test
opsci/Astree
paulml/chatml-OpenHermes2.5-dpo-binarized-alpha-french
Panoramax/fr_road_sign_subsign
PHBJT/cml-tts
PHBJT/cml-tts-20percent-subset
PHBJT/cml-tts-20percent-subset-description
PITTI/MicRou
PITTI/MicRou_chunked
PleIAs/AMF-PDF
PleIAs/AMF-Text
PleIAs/common_corpus
PleIAs/FrenchCompariaCategorised
PleIAs/GATT_library
PleIAs/KaribuAI
PleIAs/Multilingual-PD
PleIAs/Pleias-1.0-eval
PleIAs/RAG-Evals
PleIAs/TEDEUTenders
PleIAs/WTO-PDF
PleIAs/WTO-Text
Poulpidot/FrenchHateSpeechSuperset
ProfessorBob/E5-finetune-dataset
ProfessorBob/instruct-MultiQ3
ProfessorBob/keyword_extraction
ProfessorBob/Long_context_chunking
ProfessorBob/text-embedding-dataset
Punchwe/ted_talk_multi_parallel
pvisnrt/french-snli
qanastek/ECDC
RaiBP/openwebtext2-first-30-chunks-lang-detect-raw-output
rasaboun/french
rcds/MultiLegalNeg
rcds/slds
rish16/MLe-SNLI
Sabrina1763/wikipedia_french
sakthivinash/Language_Detection
sagot/lefff_morpho
SEACrowd/paracotta_id
SergeiZu/french-film-reviews
shuyuej/French-MedExpQA-Benchmark
shuyuej/French-MMLU-Anatomy-Benchmark
shuyuej/French-MMLU-Clinical-Knowledge-Benchmark
shuyuej/French-MMLU-College-Biology-Benchmark
shuyuej/French-MMLU-College-Medicine-Benchmark
shuyuej/French-MMLU-Medical-Genetics-Benchmark
shuyuej/French-MMLU-Professional-Medicine-Benchmark
startlingadama/bambara-french
StephanAkkerman/frequency-words-2018
stefan-it/autotrain-flair-hipe2022-de-hmbert
sugam11/french-snli
tamedai/oscar_eu_6x3M
tbboukhari/Alpaca-in-french
the-french-artist/hatvp_declarations_text_index_embeds
Tngarg/french_eng
Tngarg/french_english
Tngarg/French_of
Tngarg/french_train
TrainingDataPro/amazon-reviews-dataset
UdyanSachdev/Multi_Language_Audio2Text
unicamp-dl/mmarco
unicamp-dl/mrobust
uvci/koumankan4dyula
vekkt/french_CEFR
Vivian12300/mathqa_test_French_by_llama-8B-instruct
WhissleAI/multilingual-libri-test-french
wraps/everyday-conversations-llama3.1-2k-french
yzhuang/arc_challenge_test_French_by_Meta-Llama-3-8B-Instruct
yzhuang/mathqa_test_French_by_Meta-Llama-3-8B-Instruct
yzhuang/mmlu_test_French_by_Meta-Llama-3-8B-Instruct
yezhengli9/wmt20-de-fr
yezhengli9/wmt20-fr-de
wasertech/TrainingSpeech
zelros/insurance-fr
zelros/pj
zelros/pj-axa
zelros/pj-ca
zelros/pj-ce
zelros/pj-da
zelros/pj-groupama
zelros/pjmaif
zelros/pj-lbp
zelros/pj-sg