GuardReasoner: Towards Reasoning-based LLM Safeguards Paper • 2501.18492 • Published 11 days ago • 80
Self-Play Preference Optimization for Language Model Alignment Paper • 2405.00675 • Published May 1, 2024 • 27
Mercury: An Efficiency Benchmark for LLM Code Synthesis Paper • 2402.07844 • Published Feb 12, 2024 • 1