LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Paper • 2412.15035 • Published 6 days ago • 4
What's the Meaning of Superhuman Performance in Today's NLU? Paper • 2305.08414 • Published May 15, 2023 • 1
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS Paper • 2411.19655 • Published 27 days ago • 20
LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Paper • 2412.15035 • Published 6 days ago • 4
Word Sense Linking: Disambiguating Outside the Sandbox Paper • 2412.09370 • Published 14 days ago • 8
Word Sense Linking: Disambiguating Outside the Sandbox Paper • 2412.09370 • Published 14 days ago • 8
Word Sense Linking Collection Word Sense Linking is the task designed to identify and disambiguate spans of text to their most suitable senses from a reference inventory. • 6 items • Updated 13 days ago • 6
FENICE Collection FENICE is a metric for summarization factuality, with a focus on interpretability. FENICE leverages NLI and claim extraction to assess factuality • 4 items • Updated 21 days ago • 4
Babelscape/t5-base-summarization-claim-extractor Text2Text Generation • Updated 21 days ago • 664 • 5
Semantic Role Labeling Meets Definition Modeling: Using Natural Language to Describe Predicate-Argument Structures Paper • 2212.01094 • Published Dec 2, 2022 • 2
Echoes from Alexandria: A Large Resource for Multilingual Book Summarization Paper • 2306.04334 • Published Jun 7, 2023 • 2
Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In! Paper • 2408.13831 • Published Aug 25 • 5
FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction Paper • 2403.02270 • Published Mar 4 • 2
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS Paper • 2411.19655 • Published 27 days ago • 20
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS Paper • 2411.19655 • Published 27 days ago • 20