End of training

Browse files

Files changed (5) hide show

README.md +34 -34
adapter_config.json +1 -1
adapter_model.bin +2 -2
model.safetensors +2 -2
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0782
 ## Model description
@@ -50,39 +50,39 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.8941        | 0.09  | 10   | 0.5252          |
-| 0.304         | 0.18  | 20   | 0.2303          |
-| 0.286         | 0.27  | 30   | 0.2215          |
-| 0.2077        | 0.36  | 40   | 0.1718          |
-| 0.153         | 0.45  | 50   | 0.1491          |
-| 0.1408        | 0.54  | 60   | 0.1229          |
-| 0.1063        | 0.63  | 70   | 0.1014          |
-| 0.1079        | 0.73  | 80   | 0.0884          |
-| 0.0837        | 0.82  | 90   | 0.0777          |
-| 0.081         | 0.91  | 100  | 0.0710          |
-| 0.0749        | 1.0   | 110  | 0.0747          |
-| 0.0557        | 1.09  | 120  | 0.0717          |
-| 0.0566        | 1.18  | 130  | 0.0735          |
-| 0.0588        | 1.27  | 140  | 0.0713          |
-| 0.0535        | 1.36  | 150  | 0.0778          |
-| 0.0672        | 1.45  | 160  | 0.0677          |
-| 0.0548        | 1.54  | 170  | 0.0702          |
-| 0.0536        | 1.63  | 180  | 0.0633          |
-| 0.0474        | 1.72  | 190  | 0.0653          |
-| 0.0537        | 1.81  | 200  | 0.0633          |
-| 0.0438        | 1.9   | 210  | 0.0669          |
-| 0.0483        | 1.99  | 220  | 0.0669          |
-| 0.0233        | 2.08  | 230  | 0.0749          |
-| 0.0245        | 2.18  | 240  | 0.0840          |
-| 0.017         | 2.27  | 250  | 0.0868          |
-| 0.0166        | 2.36  | 260  | 0.0866          |
-| 0.0258        | 2.45  | 270  | 0.0823          |
-| 0.015         | 2.54  | 280  | 0.0805          |
-| 0.018         | 2.63  | 290  | 0.0811          |
-| 0.0227        | 2.72  | 300  | 0.0795          |
-| 0.0217        | 2.81  | 310  | 0.0788          |
-| 0.0196        | 2.9   | 320  | 0.0784          |
-| 0.0198        | 2.99  | 330  | 0.0782          |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0771
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 4.0653        | 0.09  | 10   | 0.4649          |
+| 0.2835        | 0.18  | 20   | 0.2213          |
+| 0.2868        | 0.27  | 30   | 0.2329          |
+| 0.2284        | 0.36  | 40   | 0.2334          |
+| 0.2317        | 0.45  | 50   | 0.2268          |
+| 0.219         | 0.54  | 60   | 0.2073          |
+| 0.211         | 0.63  | 70   | 0.1848          |
+| 0.1692        | 0.73  | 80   | 0.1167          |
+| 0.1311        | 0.82  | 90   | 0.1255          |
+| 0.1138        | 0.91  | 100  | 0.0954          |
+| 0.0918        | 1.0   | 110  | 0.0852          |
+| 0.0756        | 1.09  | 120  | 0.1067          |
+| 0.0746        | 1.18  | 130  | 0.0875          |
+| 0.0826        | 1.27  | 140  | 0.0751          |
+| 0.0723        | 1.36  | 150  | 0.0737          |
+| 0.0739        | 1.45  | 160  | 0.0685          |
+| 0.0674        | 1.54  | 170  | 0.0687          |
+| 0.0667        | 1.63  | 180  | 0.0673          |
+| 0.0599        | 1.72  | 190  | 0.0692          |
+| 0.0675        | 1.81  | 200  | 0.0677          |
+| 0.0565        | 1.9   | 210  | 0.0731          |
+| 0.0551        | 1.99  | 220  | 0.0714          |
+| 0.0316        | 2.08  | 230  | 0.0762          |
+| 0.0331        | 2.18  | 240  | 0.0865          |
+| 0.0233        | 2.27  | 250  | 0.0880          |
+| 0.0205        | 2.36  | 260  | 0.0863          |
+| 0.0316        | 2.45  | 270  | 0.0802          |
+| 0.0201        | 2.54  | 280  | 0.0790          |
+| 0.0221        | 2.63  | 290  | 0.0803          |
+| 0.0303        | 2.72  | 300  | 0.0800          |
+| 0.0302        | 2.81  | 310  | 0.0781          |
+| 0.027         | 2.9   | 320  | 0.0772          |
+| 0.0268        | 2.99  | 330  | 0.0771          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -2,7 +2,7 @@
   "adaptive_ratio": 0.01,
   "adaptive_ratio_decay": 1.0,
   "additive_modeling": false,
-  "allow_empty_lora": true,
   "auto_mapping": null,
   "base_model_name_or_path": "microsoft/Phi-3-mini-4k-instruct",
   "bias": "none",

   "adaptive_ratio": 0.01,
   "adaptive_ratio_decay": 1.0,
   "additive_modeling": false,
+  "allow_empty_lora": false,
   "auto_mapping": null,
   "base_model_name_or_path": "microsoft/Phi-3-mini-4k-instruct",
   "bias": "none",

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e88f93fb0c59645b9a00533277541668a1c4289052011aa73664e7a50a027cad
-size 431155958

 version https://git-lfs.github.com/spec/v1
+oid sha256:49f2071e803885d35b72f605d46fe40c34fd3eacbef905c2542cb324f8a01c21
+size 430750193

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bcedff0ce95b454a5b844927ecad58d54f80716cec39f9be6d9b2df58c5b5618
-size 7921726216

 version https://git-lfs.github.com/spec/v1
+oid sha256:ce7a769653476c98e072fca20dcffcb654c4d95ddc052c5ac001902b792cfaea
+size 7921324520

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6b909c63d8c491b4ce79d251fb97e112e33d7269e97d461365ed514b4b0b8ce8
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:21827319a0e0c123cb673bf41f635c36c7887f1c75370eb95edc59f0bd884d24
 size 5176