mhenrichsen
/

danskgpt-tiny-chat

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

mhenrichsen commited on Jan 6, 2024

Commit

d4c79c0

·

1 Parent(s): 4b3db6c

Update README.md

Files changed (1) hide show

README.md +2 -20

README.md CHANGED Viewed

@@ -87,26 +87,8 @@ print("AI:", chat_response)
 ```
-## Training procedure
-### Training hyperparameters
-The following hyperparameters were used during training:
-- learning_rate: 5e-05
-- train_batch_size: 16
-- eval_batch_size: 16
-- seed: 42
-- distributed_type: multi-GPU
-- num_devices: 4
-- gradient_accumulation_steps: 4
-- total_train_batch_size: 256
-- total_eval_batch_size: 64
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
-- lr_scheduler_type: cosine
-- lr_scheduler_warmup_steps: 10
-- num_epochs: 3
-### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|

 ```
+## Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|