Update README.md
Browse files
README.md
CHANGED
@@ -175,7 +175,14 @@ AI adoption is evident across various sectors, including:
|
|
175 |
|
176 |
## Training Details
|
177 |
|
178 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
179 |
|
180 |
## Model Limitations
|
181 |
Potential Biases: With its fine-tuning centered on AI news sources, inherent biases from these sources may reflect in the model's outputs.
|
@@ -187,7 +194,16 @@ The Llama2-7B-AIVision360 model, developed in collaboration with CeADAR Connect
|
|
187 |
For any further inquiries or feedback concerning Llama2-7B-AIVision360, please forward your communications to [email protected]
|
188 |
|
189 |
### Out-of-Scope Use
|
|
|
190 |
|
191 |
-
|
|
|
|
|
192 |
|
193 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
175 |
|
176 |
## Training Details
|
177 |
|
178 |
+
### Training Hyperparameters
|
179 |
+
- per_device_train_batch_size = 10
|
180 |
+
- gradient_accumulation_steps = 4
|
181 |
+
- optim = "paged_adamw_32bit"
|
182 |
+
- warmup_steps = 100
|
183 |
+
- learning_rate = 2e-4
|
184 |
+
- max_grad_norm = 0.3
|
185 |
+
- warmup_ratio = 0.03
|
186 |
|
187 |
## Model Limitations
|
188 |
Potential Biases: With its fine-tuning centered on AI news sources, inherent biases from these sources may reflect in the model's outputs.
|
|
|
194 |
For any further inquiries or feedback concerning Llama2-7B-AIVision360, please forward your communications to [email protected]
|
195 |
|
196 |
### Out-of-Scope Use
|
197 |
+
Llama2-7B-AIVision360 is specifically tailored for AI news discussions. It is not optimized for:
|
198 |
|
199 |
+
- General, non-AI-related conversations.
|
200 |
+
- Domain-specific tasks outside AI news.
|
201 |
+
- Direct interfacing with physical devices or applications.
|
202 |
|
203 |
+
|
204 |
+
### Bias, Risks, and Limitations
|
205 |
+
- Dataset Biases: The AIVision360-8k dataset may contain inherent biases that influence the model's outputs.
|
206 |
+
- Over-reliance: The model is an aid, not a replacement for human expertise. Decisions should be made with careful consideration.
|
207 |
+
- Content Understanding: The model lacks human-like understanding and cannot judge the veracity of news.
|
208 |
+
- Language Limitations: The model's primary language is English. Performance may decrease with other languages.
|
209 |
+
- Knowledge Cut-off: The model may not be aware of events or trends post its last training update.
|