Configurable Safety Tuning ⚙️ Collection CST allows for configurable inference-time control of LLM safety levels, so users can dictate model behavior based on the system prompt • 11 items • Updated Oct 27 • 2
steiner-preview Collection Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated Oct 20 • 25