LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published 10 days ago • 17
view article Article Atlaset Dataset for Moroccan Darija: From Data Collection, Analysis, to Model Trainings By atlasia and 1 other • 3 days ago • 18
view article Article TerjamaBench: A Cultural Benchmark for English-Darija Machine Translation By imomayiz and 4 others • Jan 10 • 30
view article Article Finding Moroccan Arabic (Darija) in Fineweb 2 By omarkamali and 3 others • Dec 8, 2024 • 22
Lucie LLM Collection Open source LLM for French, English, German, Spanish and Italian • 7 items • Updated 21 days ago • 20
Claire LLM Collection Modeling of spoken dialogs (in French & English) • 9 items • Updated Dec 12, 2024 • 3
Atlas-Chat: Adapting Large Language Models for Low-Resource Moroccan Arabic Dialect Paper • 2409.17912 • Published Sep 26, 2024 • 29
The Curious Decline of Linguistic Diversity: Training Language Models on Synthetic Text Paper • 2311.09807 • Published Nov 16, 2023 • 1