lucienbaumgartner/mtg-spike-multilabel-distilbert

Browse files

Files changed (4) hide show

README.md +17 -17
adapter_config.json +2 -2
adapter_model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -20,13 +20,13 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3549
-- F1 Micro: 0.8864
-- F1 Macro: 0.8088
-- F1 Weighted: 0.8819
-- Precision: 0.8811
-- Recall: 0.8864
-- Accuracy: 0.8864
 ## Model description
@@ -57,16 +57,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | F1 Micro | F1 Macro | F1 Weighted | Precision | Recall | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:-----------:|:---------:|:------:|:--------:|
-| No log        | 1.0   | 406  | 0.3510          | 0.8886   | 0.8121   | 0.8841      | 0.8834    | 0.8886 | 0.8886   |
-| 0.082         | 2.0   | 812  | 0.3618          | 0.8909   | 0.8193   | 0.8875      | 0.8864    | 0.8909 | 0.8909   |
-| 0.0848        | 3.0   | 1218 | 0.3768          | 0.8925   | 0.8206   | 0.8887      | 0.8879    | 0.8925 | 0.8925   |
-| 0.0703        | 4.0   | 1624 | 0.3706          | 0.8953   | 0.8242   | 0.8913      | 0.8908    | 0.8953 | 0.8953   |
-| 0.0806        | 5.0   | 2030 | 0.3868          | 0.8903   | 0.8169   | 0.8864      | 0.8855    | 0.8903 | 0.8903   |
-| 0.0806        | 6.0   | 2436 | 0.3988          | 0.8920   | 0.8211   | 0.8887      | 0.8876    | 0.8920 | 0.8920   |
-| 0.0721        | 7.0   | 2842 | 0.4085          | 0.8898   | 0.8174   | 0.8864      | 0.8853    | 0.8898 | 0.8898   |
-| 0.0701        | 8.0   | 3248 | 0.4035          | 0.8881   | 0.8149   | 0.8847      | 0.8835    | 0.8881 | 0.8881   |
-| 0.0704        | 9.0   | 3654 | 0.4072          | 0.8886   | 0.8152   | 0.8851      | 0.8840    | 0.8886 | 0.8886   |
-| 0.0665        | 10.0  | 4060 | 0.4010          | 0.8909   | 0.8184   | 0.8872      | 0.8863    | 0.8909 | 0.8909   |
 ### Framework versions

 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2810
+- F1 Micro: 0.8770
+- F1 Macro: 0.7787
+- F1 Weighted: 0.8672
+- Precision: 0.8702
+- Recall: 0.8770
+- Accuracy: 0.8770
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | F1 Micro | F1 Macro | F1 Weighted | Precision | Recall | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:-----------:|:---------:|:------:|:--------:|
+| No log        | 1.0   | 406  | 0.2865          | 0.8643   | 0.7287   | 0.8438      | 0.8620    | 0.8643 | 0.8643   |
+| 0.2729        | 2.0   | 812  | 0.2924          | 0.8737   | 0.7671   | 0.8616      | 0.8671    | 0.8737 | 0.8737   |
+| 0.216         | 3.0   | 1218 | 0.2810          | 0.8770   | 0.7787   | 0.8672      | 0.8702    | 0.8770 | 0.8770   |
+| 0.1868        | 4.0   | 1624 | 0.2813          | 0.8787   | 0.7802   | 0.8685      | 0.8725    | 0.8787 | 0.8787   |
+| 0.1728        | 5.0   | 2030 | 0.2944          | 0.8748   | 0.7794   | 0.8664      | 0.8673    | 0.8748 | 0.8748   |
+| 0.1728        | 6.0   | 2436 | 0.2937          | 0.8825   | 0.7967   | 0.8760      | 0.8762    | 0.8825 | 0.8825   |
+| 0.155         | 7.0   | 2842 | 0.3007          | 0.8848   | 0.8039   | 0.8795      | 0.8789    | 0.8848 | 0.8848   |
+| 0.151         | 8.0   | 3248 | 0.3007          | 0.8875   | 0.8070   | 0.8818      | 0.8819    | 0.8875 | 0.8875   |
+| 0.1359        | 9.0   | 3654 | 0.3031          | 0.8870   | 0.8077   | 0.8818      | 0.8814    | 0.8870 | 0.8870   |
+| 0.1359        | 10.0  | 4060 | 0.3035          | 0.8881   | 0.8086   | 0.8826      | 0.8826    | 0.8881 | 0.8881   |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -23,9 +23,9 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "k_lin",
     "q_lin",
-    "v_lin"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_lin",
     "q_lin",
+    "k_lin"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:83428c0c4847c3d7b9fd3e39b946436ee665e4ceeddeaac0f78a0da561412275
 size 3268052

 version https://git-lfs.github.com/spec/v1
+oid sha256:d2809a6cfea6ccf854bea549f0dce48f3cd79a4d2bee8c88c39129803b63a659
 size 3268052

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ea4ef1805460e6423c4cf8a3939aa4616be6e262a52a74d96056b70c2375b8cc
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:c7e9faa9b323f582e5fbc0aae386cb40cb3e645de7133b573cb73640dbb8ee59
 size 4728