c260e6c1-1557-4db9-a429-c8fc211838ca

This model is a fine-tuned version of unsloth/Qwen2.5-Math-1.5B on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 0.000204
train_batch_size: 32
eval_batch_size: 32
seed: 42
gradient_accumulation_steps: 4
total_train_batch_size: 128
optimizer: Use OptimizerNames.ADAMW_BNB with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: constant
lr_scheduler_warmup_steps: 50
training_steps: 400

Training Loss	Epoch	Step	Validation Loss
No log	0.0024	1	5.4215
4.7238	0.1197	50	4.6440
4.4158	0.2394	100	6.4881
4.7159	0.3591	150	4.5210
4.27	0.4788	200	5.9893
4.5483	0.5984	250	4.6019
4.186	0.7181	300	5.4003