Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
thu-ml
's Collections
STAIR
STAIR
updated
12 days ago
Datasets and Models for STAIR (Improving Safety Alignment with Introspective Reasoning)
Upvote
-
thu-ml/STAIR-Llama-3.1-8B-SFT
Text Generation
•
Updated
13 days ago
•
30
thu-ml/STAIR-Qwen2-7B-SFT
Text Generation
•
Updated
13 days ago
•
37
•
1
thu-ml/STAIR-SFT
Viewer
•
Updated
13 days ago
•
20k
•
54
thu-ml/STAIR-Prompts
Viewer
•
Updated
13 days ago
•
63k
•
54
STAIR: Improving Safety Alignment with Introspective Reasoning
Paper
•
2502.02384
•
Published
Feb 4
thu-ml/STAIR-Qwen2-7B-DPO-3
Text Generation
•
Updated
12 days ago
•
23
•
1
thu-ml/STAIR-Llama-3.1-8B-DPO-3
Text Generation
•
Updated
12 days ago
•
22
Upvote
-
Share collection
View history
Collection guide
Browse collections