gardarjuto commited on
Commit
8874217
·
1 Parent(s): a5bd804

switch to flat inflection benchmark

Browse files
Files changed (1) hide show
  1. src/about.py +4 -10
src/about.py CHANGED
@@ -14,9 +14,7 @@ class Tasks(Enum):
14
  # task_key in the json file, metric_key in the json file, name to display in the leaderboard
15
  task0 = Task("icelandic_winogrande_stringmatch", "exact_match,get-answer", "WinoGrande-IS (3-shot)")
16
  task1 = Task("icelandic_sentences_ged_stringmatch", "exact_match,get-answer", "GED")
17
- task2 = Task("icelandic_inflection_easy", "json_metric,get-answer", "Inflection (common)")
18
- task3 = Task("icelandic_inflection_medium", "json_metric,get-answer", "Inflection (uncommon)")
19
- task4 = Task("icelandic_inflection_hard", "json_metric,get-answer", "Inflection (rare)")
20
  task5 = Task("icelandic_belebele", "exact_match,get-answer", "Belebele (IS)")
21
  task6 = Task("icelandic_arc_challenge", "exact_match,get-answer", "ARC-Challenge-IS")
22
 
@@ -54,13 +52,9 @@ This is a benchmark for binary sentence-level Icelandic grammatical error detect
54
  Each example consists of a sentence that may contain one or more grammatical errors, and the task is to predict whether the sentence contains an error.
55
  - Link to dataset: https://huggingface.co/datasets/mideind/icelandic-sentences-gec
56
 
57
- ### Inflection benchmarks
58
- The inflection benchmarks test the model's ability to generate inflected forms of Icelandic adjective-noun pairs. They are divided into three levels of difficulty by
59
- commonness: common (100 examples), uncommon (100 examples), and rare (100 examples). The model gets a point for an example if it generates error-free json with the
60
- correct inflected forms in all cases, singular and plural.
61
- - Link to dataset (common): https://huggingface.co/datasets/mideind/icelandic-inflection-easy
62
- - Link to dataset (uncommon): https://huggingface.co/datasets/mideind/icelandic-inflection-medium
63
- - Link to dataset (rare): https://huggingface.co/datasets/mideind/icelandic-inflection-hard
64
 
65
  ### Belebele (IS)
66
  This is the Icelandic subset (900 examples) of the Belebele benchmark, a multiple-choice reading comprehension task. The task is to answer questions about a given passage.
 
14
  # task_key in the json file, metric_key in the json file, name to display in the leaderboard
15
  task0 = Task("icelandic_winogrande_stringmatch", "exact_match,get-answer", "WinoGrande-IS (3-shot)")
16
  task1 = Task("icelandic_sentences_ged_stringmatch", "exact_match,get-answer", "GED")
17
+ task2 = Task("icelandic_inflection_all", "exact_match,get-answer", "Inflection (1-shot)")
 
 
18
  task5 = Task("icelandic_belebele", "exact_match,get-answer", "Belebele (IS)")
19
  task6 = Task("icelandic_arc_challenge", "exact_match,get-answer", "ARC-Challenge-IS")
20
 
 
52
  Each example consists of a sentence that may contain one or more grammatical errors, and the task is to predict whether the sentence contains an error.
53
  - Link to dataset: https://huggingface.co/datasets/mideind/icelandic-sentences-gec
54
 
55
+ ### Inflection benchmark
56
+ The inflection benchmark tests models' ability to generate inflected forms of 300 Icelandic adjective-noun pairs for all four cases, singular and plural.
57
+ - Link to dataset: https://huggingface.co/datasets/mideind/icelandic-inflection-all-flat
 
 
 
 
58
 
59
  ### Belebele (IS)
60
  This is the Icelandic subset (900 examples) of the Belebele benchmark, a multiple-choice reading comprehension task. The task is to answer questions about a given passage.