| Visible device: cuda | |
| Seed used: 0 | |
| Batch size: 64 | |
| Epochs: 40 | |
| Learning rate: 1e-05 | |
| Entropy weight: 0.01 | |
| Regularization weight: 0.0 | |
| Only use multiwoz like domains: False | |
| We use: 1.0% of the data | |
| Dialogue order used: 0 | |
| Vectorizer: Data set used is multiwoz21 | |
| We filter state by active domains: True | |
| Vectorizer: Data set used is multiwoz21 | |
| Embedding semantic descriptions: True | |
| Embedded descriptions successfully. Size: torch.Size([338, 768]) | |
| Data set used for descriptions: multiwoz21 | |
| We use Roberta to embed actions. | |
| Loaded model from experiments/seed0/save/supervised.pol.mdl | |
| Start training | |
| Epoch: 0 | |
| Average actions: 1.9973957538604736 | |
| Average target actions: 2.5520834922790527 | |
| Precision: 0.09615384615384616 | |
| Recall: 0.07462686567164178 | |
| F1: 0.08403361344537816 | |
| <<dialog policy>> epoch 0: saved network to mdl | |
| Best Precision: 0.09615384615384616 | |
| Best Recall: 0.07462686567164178 | |
| Best F1: 0.08403361344537816 | |
| Epoch: 1 | |
| Precision: 0.09615384615384616 | |
| Recall: 0.07462686567164178 | |
| F1: 0.08403361344537816 | |
| Best Precision: 0.09615384615384616 | |
| Best Recall: 0.07462686567164178 | |
| Best F1: 0.08403361344537816 | |
| Epoch: 2 | |
| Average actions: 2.3515625 | |
| Average target actions: 2.6197917461395264 | |
| Precision: 0.10526315789473684 | |
| Recall: 0.08955223880597014 | |
| F1: 0.0967741935483871 | |
| <<dialog policy>> epoch 2: saved network to mdl | |
| Best Precision: 0.10526315789473684 | |
| Best Recall: 0.08955223880597014 | |
| Best F1: 0.0967741935483871 | |
| Epoch: 3 | |
| Precision: 0.10526315789473684 | |
| Recall: 0.08955223880597014 | |
| F1: 0.0967741935483871 | |
| Best Precision: 0.10526315789473684 | |
| Best Recall: 0.08955223880597014 | |
| Best F1: 0.0967741935483871 | |
| Epoch: 4 | |
| Average actions: 1.6770832538604736 | |
| Average target actions: 2.8567709922790527 | |
| Precision: 0.1347517730496454 | |
| Recall: 0.0945273631840796 | |
| F1: 0.11111111111111112 | |
| <<dialog policy>> epoch 4: saved network to mdl | |
| Best Precision: 0.1347517730496454 | |
| Best Recall: 0.0945273631840796 | |
| Best F1: 0.11111111111111112 | |
| Epoch: 5 | |
| Precision: 0.1347517730496454 | |
| Recall: 0.0945273631840796 | |
| F1: 0.11111111111111112 | |
| Best Precision: 0.1347517730496454 | |
| Best Recall: 0.0945273631840796 | |
| Best F1: 0.11111111111111112 | |
| Epoch: 6 | |
| Average actions: 1.9088542461395264 | |
| Average target actions: 2.7213542461395264 | |
| Precision: 0.12080536912751678 | |
| Recall: 0.08955223880597014 | |
| F1: 0.10285714285714286 | |
| Best Precision: 0.1347517730496454 | |
| Best Recall: 0.0945273631840796 | |
| Best F1: 0.11111111111111112 | |
| Epoch: 7 | |
| Precision: 0.12080536912751678 | |
| Recall: 0.08955223880597014 | |
| F1: 0.10285714285714286 | |
| Best Precision: 0.1347517730496454 | |
| Best Recall: 0.0945273631840796 | |
| Best F1: 0.11111111111111112 | |
| Epoch: 8 | |
| Average actions: 2.0572915077209473 | |
| Average target actions: 2.8229167461395264 | |
| Precision: 0.12903225806451613 | |
| Recall: 0.09950248756218906 | |
| F1: 0.11235955056179776 | |
| <<dialog policy>> epoch 8: saved network to mdl | |
| Best Precision: 0.1347517730496454 | |
| Best Recall: 0.09950248756218906 | |
| Best F1: 0.11235955056179776 | |
| Epoch: 9 | |
| Precision: 0.12903225806451613 | |
| Recall: 0.09950248756218906 | |
| F1: 0.11235955056179776 | |
| Best Precision: 0.1347517730496454 | |
| Best Recall: 0.09950248756218906 | |
| Best F1: 0.11235955056179776 | |
| Epoch: 10 | |
| Average actions: 2.0911459922790527 | |
| Average target actions: 2.6875 | |
| Precision: 0.11612903225806452 | |
| Recall: 0.08955223880597014 | |
| F1: 0.10112359550561797 | |
| Best Precision: 0.1347517730496454 | |
| Best Recall: 0.09950248756218906 | |
| Best F1: 0.11235955056179776 | |
| Epoch: 11 | |
| Precision: 0.11612903225806452 | |
| Recall: 0.08955223880597014 | |
| F1: 0.10112359550561797 | |
| Best Precision: 0.1347517730496454 | |
| Best Recall: 0.09950248756218906 | |
| Best F1: 0.11235955056179776 | |
| Epoch: 12 | |
| Average actions: 2.0833332538604736 | |
| Average target actions: 2.5859375 | |
| Precision: 0.11976047904191617 | |
| Recall: 0.09950248756218906 | |
| F1: 0.10869565217391305 | |
| Best Precision: 0.1347517730496454 | |
| Best Recall: 0.09950248756218906 | |
| Best F1: 0.11235955056179776 | |
| Epoch: 13 | |
| Precision: 0.11976047904191617 | |
| Recall: 0.09950248756218906 | |
| F1: 0.10869565217391305 | |
| Best Precision: 0.1347517730496454 | |
| Best Recall: 0.09950248756218906 | |
| Best F1: 0.11235955056179776 | |
| Epoch: 14 | |
| Average actions: 2.1119790077209473 | |
| Average target actions: 2.7213542461395264 | |
| Precision: 0.16778523489932887 | |
| Recall: 0.12437810945273632 | |
| F1: 0.14285714285714285 | |
| <<dialog policy>> epoch 14: saved network to mdl | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 15 | |
| Precision: 0.16778523489932887 | |
| Recall: 0.12437810945273632 | |
| F1: 0.14285714285714285 | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 16 | |
| Average actions: 1.7994792461395264 | |
| Average target actions: 2.5520834922790527 | |
| Precision: 0.10135135135135136 | |
| Recall: 0.07462686567164178 | |
| F1: 0.08595988538681948 | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 17 | |
| Precision: 0.10135135135135136 | |
| Recall: 0.07462686567164178 | |
| F1: 0.08595988538681948 | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 18 | |
| Average actions: 2.0572915077209473 | |
| Average target actions: 2.7552084922790527 | |
| Precision: 0.13548387096774195 | |
| Recall: 0.1044776119402985 | |
| F1: 0.11797752808988765 | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 19 | |
| Precision: 0.13548387096774195 | |
| Recall: 0.1044776119402985 | |
| F1: 0.11797752808988765 | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 20 | |
| Average actions: 1.9661457538604736 | |
| Average target actions: 2.7213542461395264 | |
| Precision: 0.1118421052631579 | |
| Recall: 0.0845771144278607 | |
| F1: 0.0963172804532578 | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 21 | |
| Precision: 0.1118421052631579 | |
| Recall: 0.0845771144278607 | |
| F1: 0.0963172804532578 | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 22 | |
| Average actions: 1.9557292461395264 | |
| Average target actions: 2.5520834922790527 | |
| Precision: 0.07741935483870968 | |
| Recall: 0.05970149253731343 | |
| F1: 0.06741573033707865 | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 23 | |
| Precision: 0.07741935483870968 | |
| Recall: 0.05970149253731343 | |
| F1: 0.06741573033707865 | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 24 | |
| Average actions: 2.0833334922790527 | |
| Average target actions: 2.8229167461395264 | |
| Precision: 0.09090909090909091 | |
| Recall: 0.06965174129353234 | |
| F1: 0.07887323943661972 | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 25 | |
| Precision: 0.09090909090909091 | |
| Recall: 0.06965174129353234 | |
| F1: 0.07887323943661972 | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 26 | |
| Average actions: 1.7135417461395264 | |
| Average target actions: 2.6197917461395264 | |
| Precision: 0.145985401459854 | |
| Recall: 0.09950248756218906 | |
| F1: 0.1183431952662722 | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 27 | |
| Precision: 0.145985401459854 | |
| Recall: 0.09950248756218906 | |
| F1: 0.1183431952662722 | |
| Best Precision: 0.16778523489932887 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14285714285714285 | |
| Epoch: 28 | |
| Average actions: 2.0364584922790527 | |
| Average target actions: 2.5520834922790527 | |
| Precision: 0.16891891891891891 | |
| Recall: 0.12437810945273632 | |
| F1: 0.14326647564469916 | |
| <<dialog policy>> epoch 28: saved network to mdl | |
| Best Precision: 0.16891891891891891 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14326647564469916 | |
| Epoch: 29 | |
| Precision: 0.16891891891891891 | |
| Recall: 0.12437810945273632 | |
| F1: 0.14326647564469916 | |
| Best Precision: 0.16891891891891891 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14326647564469916 | |
| Epoch: 30 | |
| Average actions: 2.0026040077209473 | |
| Average target actions: 2.3828125 | |
| Precision: 0.16216216216216217 | |
| Recall: 0.11940298507462686 | |
| F1: 0.13753581661891118 | |
| Best Precision: 0.16891891891891891 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14326647564469916 | |
| Epoch: 31 | |
| Precision: 0.16216216216216217 | |
| Recall: 0.11940298507462686 | |
| F1: 0.13753581661891118 | |
| Best Precision: 0.16891891891891891 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14326647564469916 | |
| Epoch: 32 | |
| Average actions: 1.8046875 | |
| Average target actions: 2.6875 | |
| Precision: 0.12142857142857143 | |
| Recall: 0.0845771144278607 | |
| F1: 0.09970674486803519 | |
| Best Precision: 0.16891891891891891 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14326647564469916 | |
| Epoch: 33 | |
| Precision: 0.12142857142857143 | |
| Recall: 0.0845771144278607 | |
| F1: 0.09970674486803519 | |
| Best Precision: 0.16891891891891891 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14326647564469916 | |
| Epoch: 34 | |
| Average actions: 1.9348957538604736 | |
| Average target actions: 2.6875 | |
| Precision: 0.12162162162162163 | |
| Recall: 0.08955223880597014 | |
| F1: 0.10315186246418337 | |
| Best Precision: 0.16891891891891891 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14326647564469916 | |
| Epoch: 35 | |
| Precision: 0.12162162162162163 | |
| Recall: 0.08955223880597014 | |
| F1: 0.10315186246418337 | |
| Best Precision: 0.16891891891891891 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14326647564469916 | |
| Epoch: 36 | |
| Average actions: 2.0989584922790527 | |
| Average target actions: 2.484375 | |
| Precision: 0.14743589743589744 | |
| Recall: 0.11442786069651742 | |
| F1: 0.1288515406162465 | |
| Best Precision: 0.16891891891891891 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14326647564469916 | |
| Epoch: 37 | |
| Precision: 0.14743589743589744 | |
| Recall: 0.11442786069651742 | |
| F1: 0.1288515406162465 | |
| Best Precision: 0.16891891891891891 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14326647564469916 | |
| Epoch: 38 | |
| Average actions: 2.0260415077209473 | |
| Average target actions: 2.5520834922790527 | |
| Precision: 0.1456953642384106 | |
| Recall: 0.10945273631840796 | |
| F1: 0.12499999999999997 | |
| Best Precision: 0.16891891891891891 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14326647564469916 | |
| Epoch: 39 | |
| Precision: 0.1456953642384106 | |
| Recall: 0.10945273631840796 | |
| F1: 0.12499999999999997 | |
| Best Precision: 0.16891891891891891 | |
| Best Recall: 0.12437810945273632 | |
| Best F1: 0.14326647564469916 | |