kyujinpy
/

KO-Platypus2-7B-ex

@@ -9,7 +9,7 @@ pipeline_tag: text-generation
 license: cc-by-nc-4.0
 ---
-# **Ko-Platypus2-13B**
 **More detail repo(Github): [KO-Platypus](https://github.com/Marker-Inc-Korea/KO-Platypus)**
 ![KO-Platypus2-13B](./KO_platypus.png)
@@ -64,7 +64,7 @@ I use A100 GPU 40GB and COLAB, when trianing.
 > Question Answering (QA)
 ### COPA (F1)
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
 | [Polyglot-ko-1.3b](https://huggingface.co/EleutherAI/polyglot-ko-1.3b) | 0.7196 | 0.7193 | 0.7204 | 0.7206 |
@@ -74,11 +74,11 @@ I use A100 GPU 40GB and COLAB, when trianing.
 | [Llama-2-Ko-7b 20B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.7388 | 0.7626 | 0.7808 | 0.7979 |
 | [Llama-2-Ko-7b 40B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.7436 | 0.7927 | 0.8037 | 0.8259 |
 | [KO-platypus2-13B](https://huggingface.co/kyujinpy/KO-Platypus2-13B) | 0.5820 | 0.6269 | 0.6267 | 0.6527 |
-| **KO-platypus2-7B-EX(ours)** | NaN | NaN | NaN | NaN |
 > Natural Language Inference (NLI; 자연어 추론 평가)
 ### HellaSwag (F1)
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
 | [Polyglot-ko-1.3b](https://huggingface.co/EleutherAI/polyglot-ko-1.3b) | 0.5247 | 0.5260 | 0.5278 | 0.5427 |
@@ -88,11 +88,11 @@ I use A100 GPU 40GB and COLAB, when trianing.
 | [Llama-2-Ko-7b 20B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.4518 | 0.4668 | 0.4726 | 0.4828 |
 | [Llama-2-Ko-7b 40B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.4562 | 0.4657 | 0.4698 | 0.4774 |
 | [KO-platypus2-13B](https://huggingface.co/kyujinpy/KO-Platypus2-13B) | 0.3912 | 0.4129 | 0.4144 | 0.4330 |
-| **KO-platypus2-7B-EX(ours)** | NaN | NaN | NaN | NaN |
 > Question Answering (QA)
 ### BoolQ (F1)
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
 | [Polyglot-ko-1.3b](https://huggingface.co/EleutherAI/polyglot-ko-1.3b) | 0.3552 | 0.4751 | 0.4109 | 0.4038 |
@@ -102,11 +102,11 @@ I use A100 GPU 40GB and COLAB, when trianing.
 | [Llama-2-Ko-7b 20B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.3607 | 0.6797 | 0.6801 | 0.6622 |
 | [Llama-2-Ko-7b 40B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.5786 | 0.6977 | 0.7084 | 0.7144 |
 | [KO-platypus2-13B](https://huggingface.co/kyujinpy/KO-Platypus2-13B) | 0.3539 | 0.7168 | 0.7328 | 0.7172 |
-| **KO-platypus2-7B-EX(ours)** | NaN | NaN | NaN | NaN |
 > Classification
 ### SentiNeg (F1)
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
 | [Polyglot-ko-1.3b](https://huggingface.co/EleutherAI/polyglot-ko-1.3b) | 0.6790 | 0.6257 | 0.5514 | 0.7851 |
@@ -116,7 +116,7 @@ I use A100 GPU 40GB and COLAB, when trianing.
 | [Llama-2-Ko-7b 20B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.4855 | 0.8295 | 0.8711 | 0.8513 |
 | [Llama-2-Ko-7b 40B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.4594 | 0.7611 | 0.7276 | 0.9370 |
 | [KO-platypus2-13B](https://huggingface.co/kyujinpy/KO-Platypus2-13B) | 0.5216 | 0.8236 | 0.8487 | 0.8789 |
-| **KO-platypus2-7B-EX(ours)** | NaN | NaN | NaN | NaN |
 # Implementation Code
 ```python

 license: cc-by-nc-4.0
 ---
+# **Ko-Platypus2-7B-EX**
 **More detail repo(Github): [KO-Platypus](https://github.com/Marker-Inc-Korea/KO-Platypus)**
 ![KO-Platypus2-13B](./KO_platypus.png)
 > Question Answering (QA)
 ### COPA (F1)
+![jpg](./results/copa.jpg)
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
 | [Polyglot-ko-1.3b](https://huggingface.co/EleutherAI/polyglot-ko-1.3b) | 0.7196 | 0.7193 | 0.7204 | 0.7206 |
 | [Llama-2-Ko-7b 20B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.7388 | 0.7626 | 0.7808 | 0.7979 |
 | [Llama-2-Ko-7b 40B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.7436 | 0.7927 | 0.8037 | 0.8259 |
 | [KO-platypus2-13B](https://huggingface.co/kyujinpy/KO-Platypus2-13B) | 0.5820 | 0.6269 | 0.6267 | 0.6527 |
+| **KO-platypus2-7B-EX(ours)** | 0.7509 | 0.7899 | 0.8029 | 0.8290 |
 > Natural Language Inference (NLI; 자연어 추론 평가)
 ### HellaSwag (F1)
+![jpg](./results/hella.jpg)
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
 | [Polyglot-ko-1.3b](https://huggingface.co/EleutherAI/polyglot-ko-1.3b) | 0.5247 | 0.5260 | 0.5278 | 0.5427 |
 | [Llama-2-Ko-7b 20B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.4518 | 0.4668 | 0.4726 | 0.4828 |
 | [Llama-2-Ko-7b 40B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.4562 | 0.4657 | 0.4698 | 0.4774 |
 | [KO-platypus2-13B](https://huggingface.co/kyujinpy/KO-Platypus2-13B) | 0.3912 | 0.4129 | 0.4144 | 0.4330 |
+| **KO-platypus2-7B-EX(ours)** | 0.4571 | 0.4461 | 0.4371 | 0.4525 |
 > Question Answering (QA)
 ### BoolQ (F1)
+![jpg](./results/bool.jpg)
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
 | [Polyglot-ko-1.3b](https://huggingface.co/EleutherAI/polyglot-ko-1.3b) | 0.3552 | 0.4751 | 0.4109 | 0.4038 |
 | [Llama-2-Ko-7b 20B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.3607 | 0.6797 | 0.6801 | 0.6622 |
 | [Llama-2-Ko-7b 40B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.5786 | 0.6977 | 0.7084 | 0.7144 |
 | [KO-platypus2-13B](https://huggingface.co/kyujinpy/KO-Platypus2-13B) | 0.3539 | 0.7168 | 0.7328 | 0.7172 |
+| **KO-platypus2-7B-EX(ours)** | 0.6028 | 0.6979 | 0.7016 | NaN |
 > Classification
 ### SentiNeg (F1)
+![jpg](./results/senti.jpg)
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
 | [Polyglot-ko-1.3b](https://huggingface.co/EleutherAI/polyglot-ko-1.3b) | 0.6790 | 0.6257 | 0.5514 | 0.7851 |
 | [Llama-2-Ko-7b 20B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.4855 | 0.8295 | 0.8711 | 0.8513 |
 | [Llama-2-Ko-7b 40B](https://huggingface.co/beomi/llama-2-ko-7b) | 0.4594 | 0.7611 | 0.7276 | 0.9370 |
 | [KO-platypus2-13B](https://huggingface.co/kyujinpy/KO-Platypus2-13B) | 0.5216 | 0.8236 | 0.8487 | 0.8789 |
+| **KO-platypus2-7B-EX(ours)** | 0.5821 | 0.7653 | 0.7991 | NaN |
 # Implementation Code
 ```python

results/copa.jpg ADDED Viewed

results/hella.jpg ADDED Viewed