leafspark
/

wikichat

Text Generation

Model card Files Files and versions

leafspark commited on Apr 17, 2024

Commit

d2c4ea1

·

verified ·

1 Parent(s): 03b0dbd

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ The GGUFs uploaded are full FP16 precision.
 - 40M parameters
 - 8 attention heads
 - 28 layers
-- 4096 context (upgraded from 1536, please expect a temporary performance drop)
 ## Prompt Format:
 ```
@@ -38,8 +38,8 @@ Ensure clarity and practicality, allowing readers to easily follow and apply the
 ## Training Details:
 - 1x RTX 3070 8GB
 - 1x Ryzen 3 3700x
-- 7660 iterations
-- Approx 100 million tokens/120k samples (>0.01 epoches)
 - Training data = 1 billion tokens
 ## Notes:

 - 40M parameters
 - 8 attention heads
 - 28 layers
+- 4096/1536 context (refer to model name; upgraded from 1536, please expect a temporary performance drop)
 ## Prompt Format:
 ```
 ## Training Details:
 - 1x RTX 3070 8GB
 - 1x Ryzen 3 3700x
+- 8590 iterations
+- Approx 200 million tokens/140k samples (>0.05 epoches)
 - Training data = 1 billion tokens
 ## Notes: