898604c7-14ac-4493-a7f3-977190e6f5c3

This model is a fine-tuned version of unsloth/Qwen2.5-Math-1.5B on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 0.000218
train_batch_size: 32
eval_batch_size: 32
seed: 42
gradient_accumulation_steps: 4
total_train_batch_size: 128
optimizer: Use OptimizerNames.ADAMW_BNB with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: constant
lr_scheduler_warmup_steps: 50
training_steps: 400

Training Loss	Epoch	Step	Validation Loss
No log	0.0024	1	5.4071
4.7199	0.1197	50	4.6484
4.411	0.2394	100	6.6507
4.7112	0.3591	150	4.5234
4.2649	0.4788	200	5.7118
4.5373	0.5984	250	4.5769
4.1832	0.7181	300	5.3070