TaiGary/Qwen2.5-1.5B-Instruct_rejection_sampling_MATH_training_short_CoT Text Generation • Updated 3 days ago • 4
chloeli/qwen-2.5-1.5B-instruct-sft-lora-countdown-search-seq8k-5k Text Generation • Updated 2 days ago • 1
chloeli/qwen-2.5-1.5B-instruct-sft-lora-countdown-search-react-seq8k-5k Text Generation • Updated 2 days ago • 1
chloeli/qwen-2.5-1.5B-instruct-sft-lora-countdown-optimal-seq8k-5k Text Generation • Updated 2 days ago • 2
jahyungu/Qwen2.5-1.5B-Instruct_Sky-T1-7B-step2-distill-5k Text Generation • Updated about 15 hours ago
chloeli/qwen-2.5-1.5B-instruct-sft-lora-countdown-search-react-correct-seq10k-5k Text Generation • Updated 30 minutes ago