ceadar-ie commited on
Commit
03ac24a
·
1 Parent(s): 5cd1dc4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -3
README.md CHANGED
@@ -175,7 +175,14 @@ AI adoption is evident across various sectors, including:
175
 
176
  ## Training Details
177
 
178
- #### Training Hyperparameters
 
 
 
 
 
 
 
179
 
180
  ## Model Limitations
181
  Potential Biases: With its fine-tuning centered on AI news sources, inherent biases from these sources may reflect in the model's outputs.
@@ -187,7 +194,16 @@ The Llama2-7B-AIVision360 model, developed in collaboration with CeADAR Connect
187
  For any further inquiries or feedback concerning Llama2-7B-AIVision360, please forward your communications to [email protected]
188
 
189
  ### Out-of-Scope Use
 
190
 
191
- ## Bias, Risks, and Limitations
 
 
192
 
193
- ## Citation
 
 
 
 
 
 
 
175
 
176
  ## Training Details
177
 
178
+ ### Training Hyperparameters
179
+ - per_device_train_batch_size = 10
180
+ - gradient_accumulation_steps = 4
181
+ - optim = "paged_adamw_32bit"
182
+ - warmup_steps = 100
183
+ - learning_rate = 2e-4
184
+ - max_grad_norm = 0.3
185
+ - warmup_ratio = 0.03
186
 
187
  ## Model Limitations
188
  Potential Biases: With its fine-tuning centered on AI news sources, inherent biases from these sources may reflect in the model's outputs.
 
194
  For any further inquiries or feedback concerning Llama2-7B-AIVision360, please forward your communications to [email protected]
195
 
196
  ### Out-of-Scope Use
197
+ Llama2-7B-AIVision360 is specifically tailored for AI news discussions. It is not optimized for:
198
 
199
+ - General, non-AI-related conversations.
200
+ - Domain-specific tasks outside AI news.
201
+ - Direct interfacing with physical devices or applications.
202
 
203
+
204
+ ### Bias, Risks, and Limitations
205
+ - Dataset Biases: The AIVision360-8k dataset may contain inherent biases that influence the model's outputs.
206
+ - Over-reliance: The model is an aid, not a replacement for human expertise. Decisions should be made with careful consideration.
207
+ - Content Understanding: The model lacks human-like understanding and cannot judge the veracity of news.
208
+ - Language Limitations: The model's primary language is English. Performance may decrease with other languages.
209
+ - Knowledge Cut-off: The model may not be aware of events or trends post its last training update.