a70d0e3e-e006-4729-a8b6-8c2fcdc885a0

This model is a fine-tuned version of unsloth/Qwen2.5-14B on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 0.000203
train_batch_size: 4
eval_batch_size: 4
seed: 42
gradient_accumulation_steps: 2
total_train_batch_size: 8
optimizer: Use OptimizerNames.ADAMW_BNB with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 50
training_steps: 407

Training Loss	Epoch	Step	Validation Loss
No log	0.0025	1	1.7940
1.6343	0.1230	50	1.5928
1.622	0.2460	100	1.5606
1.6077	0.3690	150	1.5299
1.5909	0.4920	200	1.5081
1.5842	0.6150	250	1.4716
1.5487	0.7380	300	1.4472
1.5147	0.8610	350	1.4311
1.5145	0.9840	400	1.4256