crumb commited on
Commit
7314377
·
1 Parent(s): fb51487

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md CHANGED
@@ -31,6 +31,16 @@ prompt.
31
  - Sharded model: [crumbly/gpt2-linear-xl-sharded](https://hf.co/crumbly/gpt2-linear-xl-sharded)
32
  - Sharded + Brain-float 16bit model: [crumbly/gpt2-linear-xl-sharded-bf16](https://hf.co/crumbly/gpt2-linear-xl-sharded-bf16)
33
 
 
 
 
 
 
 
 
 
 
 
34
 
35
  ### Usage
36
 
 
31
  - Sharded model: [crumbly/gpt2-linear-xl-sharded](https://hf.co/crumbly/gpt2-linear-xl-sharded)
32
  - Sharded + Brain-float 16bit model: [crumbly/gpt2-linear-xl-sharded-bf16](https://hf.co/crumbly/gpt2-linear-xl-sharded-bf16)
33
 
34
+ Config:
35
+
36
+ ```
37
+ {
38
+ "n_embd": 1600,
39
+ "n_head": 25,
40
+ "n_layer": 48,
41
+ "n_positions": 1024,
42
+ }
43
+ ```
44
 
45
  ### Usage
46