File size: 10,230 Bytes
2285042
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
Visible device: cuda
Seed used: 1
Batch size: 64
Epochs: 40
Learning rate: 1e-05
Entropy weight: 0.01
Regularization weight: 0.0
Only use multiwoz like domains: False
We use: 100.0% of the data
Dialogue order used: 0
Vectorizer: Data set used is multiwoz21
We filter state by active domains: True
Vectorizer: Data set used is multiwoz21
Embedding semantic descriptions: True
Embedded descriptions successfully. Size: torch.Size([338, 768])
Data set used for descriptions: multiwoz21
We use Roberta to embed actions.
Didnt load a model
Start training
Epoch: 0
Average actions: 1.957058072090149
Average target actions: 2.669339895248413
Precision: 0.13822525597269625
Recall: 0.10146667362597213
F1: 0.11702736056346508
<<dialog policy>> epoch 0: saved network to mdl
Best Precision: 0.13822525597269625
Best Recall: 0.10146667362597213
Best F1: 0.11702736056346508
Epoch: 1
Precision: 0.13822525597269625
Recall: 0.10146667362597213
F1: 0.11702736056346508
Best Precision: 0.13822525597269625
Best Recall: 0.10146667362597213
Best F1: 0.11702736056346508
Epoch: 2
Average actions: 2.0794308185577393
Average target actions: 2.6675729751586914
Precision: 0.22303363258743134
Recall: 0.1737564591053813
F1: 0.19533519143318176
<<dialog policy>> epoch 2: saved network to mdl
Best Precision: 0.22303363258743134
Best Recall: 0.1737564591053813
Best F1: 0.19533519143318176
Epoch: 3
Precision: 0.22303363258743134
Recall: 0.1737564591053813
F1: 0.19533519143318176
Best Precision: 0.22303363258743134
Best Recall: 0.1737564591053813
Best F1: 0.19533519143318176
Epoch: 4
Average actions: 2.0110926628112793
Average target actions: 2.665806293487549
Precision: 0.26409084614319345
Recall: 0.19907093272091445
F1: 0.22701705306389688
<<dialog policy>> epoch 4: saved network to mdl
Best Precision: 0.26409084614319345
Best Recall: 0.19907093272091445
Best F1: 0.22701705306389688
Epoch: 5
Precision: 0.26409084614319345
Recall: 0.19907093272091445
F1: 0.22701705306389688
Best Precision: 0.26409084614319345
Best Recall: 0.19907093272091445
Best F1: 0.22701705306389688
Epoch: 6
Average actions: 1.9673057794570923
Average target actions: 2.667219877243042
Precision: 0.2910210146465719
Recall: 0.21467717521791324
F1: 0.2470863871200288
<<dialog policy>> epoch 6: saved network to mdl
Best Precision: 0.2910210146465719
Best Recall: 0.21467717521791324
Best F1: 0.2470863871200288
Epoch: 7
Precision: 0.2910210146465719
Recall: 0.21467717521791324
F1: 0.2470863871200288
Best Precision: 0.2910210146465719
Best Recall: 0.21467717521791324
Best F1: 0.2470863871200288
Epoch: 8
Average actions: 1.8258512020111084
Average target actions: 2.667926549911499
Precision: 0.30450038138825325
Recall: 0.20836160551176994
F1: 0.24742012457776819
<<dialog policy>> epoch 8: saved network to mdl
Best Precision: 0.30450038138825325
Best Recall: 0.21467717521791324
Best F1: 0.24742012457776819
Epoch: 9
Precision: 0.30450038138825325
Recall: 0.20836160551176994
F1: 0.24742012457776819
Best Precision: 0.30450038138825325
Best Recall: 0.21467717521791324
Best F1: 0.24742012457776819
Epoch: 10
Average actions: 1.7796674966812134
Average target actions: 2.66333270072937
Precision: 0.3297132588483475
Recall: 0.2202620178506185
F1: 0.2640966268227048
<<dialog policy>> epoch 10: saved network to mdl
Best Precision: 0.3297132588483475
Best Recall: 0.2202620178506185
Best F1: 0.2640966268227048
Epoch: 11
Precision: 0.3297132588483475
Recall: 0.2202620178506185
F1: 0.2640966268227048
Best Precision: 0.3297132588483475
Best Recall: 0.2202620178506185
Best F1: 0.2640966268227048
Epoch: 12
Average actions: 1.8398014307022095
Average target actions: 2.67004656791687
Precision: 0.34064769975786924
Recall: 0.23498094890129964
F1: 0.27811583011583013
<<dialog policy>> epoch 12: saved network to mdl
Best Precision: 0.34064769975786924
Best Recall: 0.23498094890129964
Best F1: 0.27811583011583013
Epoch: 13
Precision: 0.34064769975786924
Recall: 0.23498094890129964
F1: 0.27811583011583013
Best Precision: 0.34064769975786924
Best Recall: 0.23498094890129964
Best F1: 0.27811583011583013
Epoch: 14
Average actions: 1.7070426940917969
Average target actions: 2.667219877243042
Precision: 0.35462034091835903
Recall: 0.22694295109348087
F1: 0.2767663908338638
Best Precision: 0.35462034091835903
Best Recall: 0.23498094890129964
Best F1: 0.27811583011583013
Epoch: 15
Precision: 0.35462034091835903
Recall: 0.22694295109348087
F1: 0.2767663908338638
Best Precision: 0.35462034091835903
Best Recall: 0.23498094890129964
Best F1: 0.27811583011583013
Epoch: 16
Average actions: 1.6812468767166138
Average target actions: 2.6643927097320557
Precision: 0.34859650575474044
Recall: 0.21974006994101988
F1: 0.2695607632219234
Best Precision: 0.35462034091835903
Best Recall: 0.23498094890129964
Best F1: 0.27811583011583013
Epoch: 17
Precision: 0.34859650575474044
Recall: 0.21974006994101988
F1: 0.2695607632219234
Best Precision: 0.35462034091835903
Best Recall: 0.23498094890129964
Best F1: 0.27811583011583013
Epoch: 18
Average actions: 1.675270438194275
Average target actions: 2.6640396118164062
Precision: 0.35976419794088343
Recall: 0.22616002922908293
F1: 0.27772970547703746
Best Precision: 0.35976419794088343
Best Recall: 0.23498094890129964
Best F1: 0.27811583011583013
Epoch: 19
Precision: 0.35976419794088343
Recall: 0.22616002922908293
F1: 0.27772970547703746
Best Precision: 0.35976419794088343
Best Recall: 0.23498094890129964
Best F1: 0.27811583011583013
Epoch: 20
Average actions: 1.5666790008544922
Average target actions: 2.6647462844848633
Precision: 0.3769442716203004
Recall: 0.2213581084607756
F1: 0.27892140743176586
<<dialog policy>> epoch 20: saved network to mdl
Best Precision: 0.3769442716203004
Best Recall: 0.23498094890129964
Best F1: 0.27892140743176586
Epoch: 21
Precision: 0.3769442716203004
Recall: 0.2213581084607756
F1: 0.27892140743176586
Best Precision: 0.3769442716203004
Best Recall: 0.23498094890129964
Best F1: 0.27892140743176586
Epoch: 22
Average actions: 1.6693706512451172
Average target actions: 2.6661596298217773
Precision: 0.3716379382130069
Recall: 0.23294535205386502
F1: 0.2863834702258727
<<dialog policy>> epoch 22: saved network to mdl
Best Precision: 0.3769442716203004
Best Recall: 0.23498094890129964
Best F1: 0.2863834702258727
Epoch: 23
Precision: 0.3716379382130069
Recall: 0.23294535205386502
F1: 0.2863834702258727
Best Precision: 0.3769442716203004
Best Recall: 0.23498094890129964
Best F1: 0.2863834702258727
Epoch: 24
Average actions: 1.6701388359069824
Average target actions: 2.6643927097320557
Precision: 0.3714618714618715
Recall: 0.23289315726290516
F1: 0.2862917455327067
Best Precision: 0.3769442716203004
Best Recall: 0.23498094890129964
Best F1: 0.2863834702258727
Epoch: 25
Precision: 0.3714618714618715
Recall: 0.23289315726290516
F1: 0.2862917455327067
Best Precision: 0.3769442716203004
Best Recall: 0.23498094890129964
Best F1: 0.2863834702258727
Epoch: 26
Average actions: 1.6909722089767456
Average target actions: 2.665099620819092
Precision: 0.3781160016454134
Recall: 0.2398872592515267
F1: 0.2935428242958421
<<dialog policy>> epoch 26: saved network to mdl
Best Precision: 0.3781160016454134
Best Recall: 0.2398872592515267
Best F1: 0.2935428242958421
Epoch: 27
Precision: 0.3781160016454134
Recall: 0.2398872592515267
F1: 0.2935428242958421
Best Precision: 0.3781160016454134
Best Recall: 0.2398872592515267
Best F1: 0.2935428242958421
Epoch: 28
Average actions: 1.8047566413879395
Average target actions: 2.6643927097320557
Precision: 0.3654779326811985
Recall: 0.24766428310454616
F1: 0.29525231783958683
<<dialog policy>> epoch 28: saved network to mdl
Best Precision: 0.3781160016454134
Best Recall: 0.24766428310454616
Best F1: 0.29525231783958683
Epoch: 29
Precision: 0.3654779326811985
Recall: 0.24766428310454616
F1: 0.29525231783958683
Best Precision: 0.3781160016454134
Best Recall: 0.24766428310454616
Best F1: 0.29525231783958683
Epoch: 30
Average actions: 1.680601716041565
Average target actions: 2.6640396118164062
Precision: 0.37665562913907286
Recall: 0.23748629886737305
F1: 0.2913025384935497
Best Precision: 0.3781160016454134
Best Recall: 0.24766428310454616
Best F1: 0.29525231783958683
Epoch: 31
Precision: 0.37665562913907286
Recall: 0.23748629886737305
F1: 0.2913025384935497
Best Precision: 0.3781160016454134
Best Recall: 0.24766428310454616
Best F1: 0.29525231783958683
Epoch: 32
Average actions: 1.7778853178024292
Average target actions: 2.667219877243042
Precision: 0.3660120491354354
Recall: 0.2441672321102354
F1: 0.2929242329367564
Best Precision: 0.3781160016454134
Best Recall: 0.24766428310454616
Best F1: 0.29525231783958683
Epoch: 33
Precision: 0.3660120491354354
Recall: 0.2441672321102354
F1: 0.2929242329367564
Best Precision: 0.3781160016454134
Best Recall: 0.24766428310454616
Best F1: 0.29525231783958683
Epoch: 34
Average actions: 1.726846694946289
Average target actions: 2.66333270072937
Precision: 0.3723121526938874
Recall: 0.24129651860744297
F1: 0.29281732961743095
Best Precision: 0.3781160016454134
Best Recall: 0.24766428310454616
Best F1: 0.29525231783958683
Epoch: 35
Precision: 0.3723121526938874
Recall: 0.24129651860744297
F1: 0.29281732961743095
Best Precision: 0.3781160016454134
Best Recall: 0.24766428310454616
Best F1: 0.29525231783958683
Epoch: 36
Average actions: 1.8067078590393066
Average target actions: 2.6675729751586914
Precision: 0.37099753694581283
Recall: 0.2515788924265358
F1: 0.29983515287238344
<<dialog policy>> epoch 36: saved network to mdl
Best Precision: 0.3781160016454134
Best Recall: 0.2515788924265358
Best F1: 0.29983515287238344
Epoch: 37
Precision: 0.37099753694581283
Recall: 0.2515788924265358
F1: 0.29983515287238344
Best Precision: 0.3781160016454134
Best Recall: 0.2515788924265358
Best F1: 0.29983515287238344
Epoch: 38
Average actions: 1.7964909076690674
Average target actions: 2.6647462844848633
Precision: 0.36536823356307596
Recall: 0.2462550237486299
F1: 0.2942130207034173
Best Precision: 0.3781160016454134
Best Recall: 0.2515788924265358
Best F1: 0.29983515287238344
Epoch: 39
Precision: 0.36536823356307596
Recall: 0.2462550237486299
F1: 0.2942130207034173
Best Precision: 0.3781160016454134
Best Recall: 0.2515788924265358
Best F1: 0.29983515287238344