Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models Paper • 2411.06272 • Published 4 days ago • 2