Ngit commited on
Commit
1ec392d
·
verified ·
1 Parent(s): ba5180e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +52 -95
README.md CHANGED
@@ -13,98 +13,55 @@ model-index:
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
- # MiniLMv2-L6-H384-distilled-from-RoBERTa-Large-agentflow-distil
17
-
18
- This model is a fine-tuned version of [nreimers/MiniLMv2-L6-H384-distilled-from-RoBERTa-Large](https://huggingface.co/nreimers/MiniLMv2-L6-H384-distilled-from-RoBERTa-Large) on the None dataset.
19
- It achieves the following results on the evaluation set:
20
- - Loss: 0.1540
21
- - Accuracy: 0.9616
22
- - F1: 0.9618
23
-
24
- ## Model description
25
-
26
- More information needed
27
-
28
- ## Intended uses & limitations
29
-
30
- More information needed
31
-
32
- ## Training and evaluation data
33
-
34
- More information needed
35
-
36
- ## Training procedure
37
-
38
- ### Training hyperparameters
39
-
40
- The following hyperparameters were used during training:
41
- - learning_rate: 7e-05
42
- - train_batch_size: 10
43
- - eval_batch_size: 10
44
- - seed: 42
45
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
- - lr_scheduler_type: linear
47
- - lr_scheduler_warmup_ratio: 0.1
48
- - num_epochs: 8
49
-
50
- ### Training results
51
-
52
- | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
53
- |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
54
- | No log | 0.07 | 30 | 3.4249 | 0.1510 | 0.0404 |
55
- | No log | 0.13 | 60 | 3.3994 | 0.2779 | 0.1759 |
56
- | No log | 0.2 | 90 | 3.3313 | 0.3423 | 0.2154 |
57
- | No log | 0.27 | 120 | 3.1475 | 0.3977 | 0.3024 |
58
- | No log | 0.33 | 150 | 2.8961 | 0.3494 | 0.2370 |
59
- | No log | 0.4 | 180 | 2.6867 | 0.5147 | 0.4325 |
60
- | No log | 0.47 | 210 | 2.4676 | 0.5728 | 0.4955 |
61
- | No log | 0.54 | 240 | 2.2129 | 0.5657 | 0.4588 |
62
- | No log | 0.6 | 270 | 1.9712 | 0.6917 | 0.6331 |
63
- | No log | 0.67 | 300 | 1.8016 | 0.6533 | 0.5799 |
64
- | No log | 0.74 | 330 | 1.5721 | 0.7185 | 0.6524 |
65
- | No log | 0.8 | 360 | 1.3381 | 0.8061 | 0.7760 |
66
- | No log | 0.87 | 390 | 1.1876 | 0.8543 | 0.8319 |
67
- | No log | 0.94 | 420 | 0.9877 | 0.8722 | 0.8577 |
68
- | No log | 1.0 | 450 | 0.8819 | 0.8892 | 0.8850 |
69
- | No log | 1.07 | 480 | 0.7511 | 0.8972 | 0.8955 |
70
- | 2.2047 | 1.14 | 510 | 0.5262 | 0.9410 | 0.9408 |
71
- | 2.2047 | 1.21 | 540 | 0.5107 | 0.9294 | 0.9297 |
72
- | 2.2047 | 1.27 | 570 | 0.4612 | 0.9285 | 0.9292 |
73
- | 2.2047 | 1.34 | 600 | 0.3487 | 0.9410 | 0.9407 |
74
- | 2.2047 | 1.41 | 630 | 0.3137 | 0.9374 | 0.9369 |
75
- | 2.2047 | 1.47 | 660 | 0.2951 | 0.9223 | 0.9190 |
76
- | 2.2047 | 1.54 | 690 | 0.2738 | 0.9374 | 0.9377 |
77
- | 2.2047 | 1.61 | 720 | 0.2472 | 0.9446 | 0.9439 |
78
- | 2.2047 | 1.67 | 750 | 0.1988 | 0.9535 | 0.9530 |
79
- | 2.2047 | 1.74 | 780 | 0.2016 | 0.9517 | 0.9519 |
80
- | 2.2047 | 1.81 | 810 | 0.2158 | 0.9428 | 0.9427 |
81
- | 2.2047 | 1.88 | 840 | 0.2519 | 0.9330 | 0.9324 |
82
- | 2.2047 | 1.94 | 870 | 0.2224 | 0.9437 | 0.9436 |
83
- | 2.2047 | 2.01 | 900 | 0.3032 | 0.9285 | 0.9276 |
84
- | 2.2047 | 2.08 | 930 | 0.1815 | 0.9544 | 0.9546 |
85
- | 2.2047 | 2.14 | 960 | 0.2125 | 0.9455 | 0.9455 |
86
- | 2.2047 | 2.21 | 990 | 0.2198 | 0.9455 | 0.9446 |
87
- | 0.2888 | 2.28 | 1020 | 0.1869 | 0.9571 | 0.9568 |
88
- | 0.2888 | 2.34 | 1050 | 0.1705 | 0.9571 | 0.9568 |
89
- | 0.2888 | 2.41 | 1080 | 0.1927 | 0.9526 | 0.9523 |
90
- | 0.2888 | 2.48 | 1110 | 0.1700 | 0.9562 | 0.9561 |
91
- | 0.2888 | 2.54 | 1140 | 0.2162 | 0.9464 | 0.9460 |
92
- | 0.2888 | 2.61 | 1170 | 0.1540 | 0.9616 | 0.9618 |
93
- | 0.2888 | 2.68 | 1200 | 0.1752 | 0.9562 | 0.9561 |
94
- | 0.2888 | 2.75 | 1230 | 0.1476 | 0.9607 | 0.9605 |
95
- | 0.2888 | 2.81 | 1260 | 0.2575 | 0.9410 | 0.9414 |
96
- | 0.2888 | 2.88 | 1290 | 0.1574 | 0.9616 | 0.9614 |
97
- | 0.2888 | 2.95 | 1320 | 0.1574 | 0.9598 | 0.9596 |
98
- | 0.2888 | 3.01 | 1350 | 0.1640 | 0.9580 | 0.9578 |
99
- | 0.2888 | 3.08 | 1380 | 0.1627 | 0.9598 | 0.9594 |
100
- | 0.2888 | 3.15 | 1410 | 0.1866 | 0.9544 | 0.9550 |
101
- | 0.2888 | 3.21 | 1440 | 0.1610 | 0.9526 | 0.9526 |
102
- | 0.2888 | 3.28 | 1470 | 0.2134 | 0.9419 | 0.9412 |
103
-
104
-
105
- ### Framework versions
106
-
107
- - Transformers 4.37.0
108
- - Pytorch 2.1.2
109
- - Datasets 2.1.0
110
- - Tokenizers 0.15.1
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
+ # User Flow Text Classification
17
+
18
+ This model is a fined-tuned version of [nreimers/MiniLMv2-L6-H384-distilled-from-RoBERTa-Large](https://huggingface.co/nreimers/MiniLMv2-L6-H384-distilled-from-RoBERTa-Large).
19
+ The quantized version in ONNX format can be found [here](https://huggingface.co/minuva/MiniLMv2-agentflow-v2-onnx)
20
+
21
+ The model identifies common events and patterns within the conversation flow. Such events include an apology, where the agent acknowledges a mistake, and a complaint, when a user expresses dissatisfaction.
22
+
23
+ This model should be used only for agent dialogs.
24
+ # Load the Model
25
+
26
+ ```py
27
+ from transformers import pipeline
28
+
29
+ pipe = pipeline(model='minuva/MiniLMv2-agentflow-v2', task='text-classification')
30
+ pipe("thats my mistake")
31
+ # [{'label': 'agent_apology_error_mistake', 'score': 0.9965628981590271}]
32
+ ```
33
+ # Categories Explanation
34
+
35
+ <details>
36
+ <summary>Click to expand!</summary>
37
+
38
+ - OTHER: Responses or actions by the agent that do not fit into the predefined categories or are outside the scope of the specific interactions listed.
39
+
40
+ - agent_apology_error_mistake: When the agent acknowledges an error or mistake in the information provided or in the handling of the request.
41
+
42
+ - agent_apology_unsatisfactory: The agent expresses an apology for providing an unsatisfactory response or for any dissatisfaction experienced by the user.
43
+
44
+ - agent_didnt_understand: Indicates that the agent did not understand the user's request or question.
45
+
46
+ - agent_limited_capabilities: The agent communicates its limitations in addressing certain requests or providing certain types of information.
47
+
48
+ - agent_refuses_answer: When the agent explicitly refuses to answer a question or fulfill a request, due to policy restrictions or ethical considerations.
49
+
50
+ - image_limitations": The agent points out limitations related to handling or interpreting images.
51
+
52
+ - no_information_doesnt_know": The agent indicates that it has no information available or does not know the answer to the user's question.
53
+
54
+ - success_and_followup_assistance": The agent successfully provides the requested information or service and offers further assistance or follow-up actions if needed.
55
+ </details>
56
+
57
+ <br>
58
+
59
+
60
+ # Metrics in our private test dataset
61
+ | Model (params) | Loss | Accuracy | F1 |
62
+ |--------------------|-------------|----------|--------|
63
+ | minuva/MiniLMv2-agentflow-v2 (33M) | 0.1540 | 0.9616 | 0.9618 |
64
+
65
+ # Deployment
66
+
67
+ Check [our repository](https://github.com/minuva/flow-cloudrun) to see how to easily deploy this (quantized) model in a serverless environment with fast CPU inference and light resource utilization.