MatthiasPicard commited on
Commit
4b19318
·
verified ·
1 Parent(s): 5edbe5b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -45
README.md CHANGED
@@ -12,20 +12,8 @@ pinned: false
12
 
13
  ## Model Description
14
 
15
- This is a random baseline model for the Frugal AI Challenge 2024, specifically for the text classification task of identifying climate disinformation. The model serves as a performance floor, randomly assigning labels to text inputs without any learning.
16
-
17
- ### Intended Use
18
-
19
- - **Primary intended uses**: Baseline comparison for climate disinformation classification models
20
- - **Primary intended users**: Researchers and developers participating in the Frugal AI Challenge
21
- - **Out-of-scope use cases**: Not intended for production use or real-world classification tasks
22
-
23
- ## Training Data
24
-
25
- The model uses the QuotaClimat/frugalaichallenge-text-train dataset:
26
- - Size: ~6000 examples
27
- - Split: 80% train, 20% test
28
- - 8 categories of climate disinformation claims
29
 
30
  ### Labels
31
  0. No relevant claim detected
@@ -37,35 +25,4 @@ The model uses the QuotaClimat/frugalaichallenge-text-train dataset:
37
  6. Proponents are biased
38
  7. Fossil fuels are needed
39
 
40
- ## Performance
41
-
42
- ### Metrics
43
- - **Accuracy**: ~12.5% (random chance with 8 classes)
44
- - **Environmental Impact**:
45
- - Emissions tracked in gCO2eq
46
- - Energy consumption tracked in Wh
47
-
48
- ### Model Architecture
49
- The model implements a random choice between the 8 possible labels, serving as the simplest possible baseline.
50
-
51
- ## Environmental Impact
52
-
53
- Environmental impact is tracked using CodeCarbon, measuring:
54
- - Carbon emissions during inference
55
- - Energy consumption during inference
56
-
57
- This tracking helps establish a baseline for the environmental impact of model deployment and inference.
58
-
59
- ## Limitations
60
- - Makes completely random predictions
61
- - No learning or pattern recognition
62
- - No consideration of input text
63
- - Serves only as a baseline reference
64
- - Not suitable for any real-world applications
65
-
66
- ## Ethical Considerations
67
 
68
- - Dataset contains sensitive topics related to climate disinformation
69
- - Model makes random predictions and should not be used for actual classification
70
- - Environmental impact is tracked to promote awareness of AI's carbon footprint
71
- ```
 
12
 
13
  ## Model Description
14
 
15
+ This space is related to the text task of the Frugal AI Challenge. The final model used is a ModernBert trained on a mix of around ~95 000 samples, both Real and Synthetic Data.
16
+ The dataset was open-sourced at MatthiasPicard/Frugal-AI-Train-Data-88k. The finetuned model along the training logs was open-sourced at MatthiasPicard/ModernBERT_frugal_88k.
 
 
 
 
 
 
 
 
 
 
 
 
17
 
18
  ### Labels
19
  0. No relevant claim detected
 
25
  6. Proponents are biased
26
  7. Fossil fuels are needed
27
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
28