1bcf7455-8983-47fe-8c86-61e96468a77e

This model is a fine-tuned version of unsloth/Qwen2.5-Math-1.5B on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 0.000213
train_batch_size: 32
eval_batch_size: 32
seed: 42
gradient_accumulation_steps: 4
total_train_batch_size: 128
optimizer: Use OptimizerNames.ADAMW_BNB with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: constant
lr_scheduler_warmup_steps: 50
training_steps: 400

Training Loss	Epoch	Step	Validation Loss
No log	0.0024	1	5.4121
4.7215	0.1197	50	4.6466
4.4132	0.2394	100	6.4424
4.6992	0.3591	150	4.5251
4.2661	0.4788	200	5.9203
4.5458	0.5984	250	4.6366
4.1857	0.7181	300	5.5895