mphi commited on
Commit
457bdb7
·
1 Parent(s): 76933a6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -10,7 +10,11 @@ model-index:
10
 
11
  A GPT model for Estonian (base-size), trained from scratch on 2.2 billion words (Estonian National Corpus + News Crawl + Common Crawl). Trained for 3 epochs.
12
 
13
- Model details:
 
 
 
 
14
  - num. of layers: 12
15
  - num. of heads: 12
16
  - embedding size: 768
 
10
 
11
  A GPT model for Estonian (base-size), trained from scratch on 2.2 billion words (Estonian National Corpus + News Crawl + Common Crawl). Trained for 3 epochs.
12
 
13
+ ### Format
14
+
15
+ For training data was prepended with a text domain tag, and it should be added as prefix when using the model: >general<, >web<, >news<, >doaj< and >wiki< (standing for general texts, web crawled texts, news, article abstracts and wikipedia texts). Use the prefixes like this, e.g: ">web< Kas tead, et".
16
+
17
+ ### Model details
18
  - num. of layers: 12
19
  - num. of heads: 12
20
  - embedding size: 768