Kundyzka
/

XLM-Roberta-large-informatics-kaz

@@ -5,18 +5,18 @@ datasets:
 language:
 - kk
 metrics:
-  - name: exact_match
     type: Exact Match
     value: 13.116
-  - name: f1
     type: F1 Score
-    value: 26.950
-  - name: exact_match_test
-    type: Exact Match (Test)
-    value: 49.740
-  - name: f1_test
-    type: F1 Score (Test)
     value: 70.127
 base_model:
 - FacebookAI/xlm-roberta-large
 pipeline_tag: question-answering
@@ -26,7 +26,7 @@ tags:
 # Description
-This model was developed by **Kundyz Maksutova**, PhD Candidate, as part of research on question-answering systems in the Kazakh language. It is a fine-tuned version of `FacebookAI/xlm-roberta-large` on the `Kundyzka/informatics_kaz` dataset, tailored for the domain of computer science.
 ### Key Features:
 - **Base Model**: `FacebookAI/xlm-roberta-large`
@@ -34,17 +34,30 @@ This model was developed by **Kundyz Maksutova**, PhD Candidate, as part of rese
 - **Language**: Kazakh (`kk`)
 - **Task**: Question Answering
 - **Performance**:
-  - Validation:
     - F1 Score: 26.950
     - Exact Match: 13.116
-  - Test:
     - F1 Score: 70.127
     - Exact Match: 49.740
 ### Intended Use:
-This model is designed for use in question-answering systems, particularly in the domain of computer science and related fields. It can handle questions and provide accurate answers in the Kazakh language, making it an ideal tool for educational, research, and application development purposes.
 ### Tags:
 - `computerscience`
-For more details, fine-tuning instructions, or adaptation to other tasks, refer to the model repository.

 language:
 - kk
 metrics:
+  - name: F1 (Before Training)
+    type: F1 Score
+    value: 26.950
+  - name: Exact Match (Before Training)
     type: Exact Match
     value: 13.116
+  - name: F1 (After Training)
     type: F1 Score
     value: 70.127
+  - name: Exact Match (After Training)
+    type: Exact Match
+    value: 49.740
 base_model:
 - FacebookAI/xlm-roberta-large
 pipeline_tag: question-answering
 # Description
+This model was developed by **Kundyz Maksutova**, PhD Candidate, as part of research on question-answering systems in the Kazakh language. It is a fine-tuned version of `FacebookAI/xlm-roberta-large` on the `Kundyzka/informatics_kaz` dataset, specifically optimized for handling questions in the domain of computer science.
 ### Key Features:
 - **Base Model**: `FacebookAI/xlm-roberta-large`
 - **Language**: Kazakh (`kk`)
 - **Task**: Question Answering
 - **Performance**:
+  - **Before Training**:
     - F1 Score: 26.950
     - Exact Match: 13.116
+  - **After Training**:
     - F1 Score: 70.127
     - Exact Match: 49.740
+### Dataset:
+The `Kundyzka/informatics_kaz` dataset is designed to provide a diverse set of questions and answers in Kazakh, specifically covering topics in computer science. This dataset ensures that the model effectively handles domain-specific queries and terminology.
 ### Intended Use:
+This model is intended for answering questions in the Kazakh language, with potential applications in:
+- **Educational Platforms**: Assisting students with computer science-related questions.
+- **Research Projects**: Supporting the study and development of Kazakh natural language processing tools.
+- **AI Applications**: Enhancing chatbots or intelligent systems requiring domain-specific question-answering capabilities.
+### Limitations and Ethical Considerations:
+- **Domain-Specific Bias**: The model performs best on computer science queries and may not generalize well to other domains.
+- **Dataset Bias**: The dataset may introduce biases that affect model predictions.
+- **Language Support**: The model is optimized for Kazakh and does not handle other languages.
 ### Tags:
 - `computerscience`
+- `question-answering`
+- `Kazakh`
+This model represents a significant contribution to improving natural language processing tools for low-resource languages like Kazakh. For further details or customization, refer to the model repository.