ASTRAL: Automated Safety Testing of Large Language Models Paper • 2501.17132 • Published 8 days ago • 2
Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation Paper • 2501.17749 • Published 7 days ago • 12
Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation Paper • 2501.17749 • Published 7 days ago • 12