GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages Paper • 2410.23825 • Published Oct 31, 2024 • 3
LangSAMP: Language-Script Aware Multilingual Pretraining Paper • 2409.18199 • Published Sep 26, 2024 • 1
MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment Paper • 2410.05873 • Published Oct 8, 2024 • 3
MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory Paper • 2404.11672 • Published Apr 17, 2024
Consistent Document-Level Relation Extraction via Counterfactuals Paper • 2407.06699 • Published Jul 9, 2024
MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment Paper • 2410.05873 • Published Oct 8, 2024 • 3
GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers Paper • 2205.03286 • Published May 6, 2022
Not All Models Localize Linguistic Knowledge in the Same Place: A Layer-wise Probing on BERToids' Representations Paper • 2109.05958 • Published Sep 13, 2021
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition Paper • 2306.02873 • Published Jun 5, 2023 • 1