Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,71 @@
|
|
1 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
license: apache-2.0
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
- zh
|
5 |
+
- de
|
6 |
+
- fr
|
7 |
+
- es
|
8 |
+
- pt
|
9 |
+
- ru
|
10 |
+
- it
|
11 |
+
- ja
|
12 |
+
- ko
|
13 |
+
- vi
|
14 |
+
- ar
|
15 |
+
tags:
|
16 |
+
- pytorch
|
17 |
+
- text-generation
|
18 |
+
- causal-lm
|
19 |
+
- rwkv
|
20 |
license: apache-2.0
|
21 |
+
datasets:
|
22 |
+
- khalidalt/Joud
|
23 |
---
|
24 |
+
|
25 |
+
# RWKV-4-World-7b-Arabic
|
26 |
+
|
27 |
+
## Model Description
|
28 |
+
|
29 |
+
|
30 |
+
RWKV-4-World-7b-Arabic is a pretrinaed version of RWKV-4-world that finetuned on Arabic datasets.
|
31 |
+
|
32 |
+
RWKV-4
|
33 |
+
|
34 |
+
How to use:
|
35 |
+
* use https://github.com/josStorer/RWKV-Runner for GUI
|
36 |
+
* use latest rwkv pip package (0.8.0+)
|
37 |
+
* use https://github.com/BlinkDL/ChatRWKV/blob/main/v2/benchmark_world.py and https://github.com/BlinkDL/ChatRWKV/blob/main/API_DEMO_WORLD.py to test it
|
38 |
+
|
39 |
+
The differences between World & Raven:
|
40 |
+
* set pipeline = PIPELINE(model, "rwkv_vocab_v20230424") instead of 20B_tokenizer.json (EXACTLY AS WRITTEN HERE. "rwkv_vocab_v20230424" is included in rwkv 0.7.4+)
|
41 |
+
* use Question/Answer or User/AI or Human/Bot for chat. **DO NOT USE Bob/Alice or Q/A**
|
42 |
+
|
43 |
+
For 0.1/0.4/1.5B models, use **fp32** for first layer (will overflow in fp16 at this moment - fixable in future), or bf16 if you have 30xx/40xx GPUs. Example strategy: cuda fp32 *1 -> cuda fp16
|
44 |
+
|
45 |
+
NOTE: the new greedy tokenizer (https://github.com/BlinkDL/ChatRWKV/blob/main/tokenizer/rwkv_tokenizer.py) will tokenize '\n\n' as one single token instead of ['\n','\n']
|
46 |
+
|
47 |
+
QA prompt (replace \n\n in xxx to \n):
|
48 |
+
```
|
49 |
+
Question: xxx
|
50 |
+
|
51 |
+
Answer:
|
52 |
+
```
|
53 |
+
and
|
54 |
+
```
|
55 |
+
Instruction: xxx
|
56 |
+
|
57 |
+
Input: xxx
|
58 |
+
|
59 |
+
Response:
|
60 |
+
```
|
61 |
+
|
62 |
+
A good chat prompt (replace \n\n in xxx to \n):
|
63 |
+
```
|
64 |
+
User: hi
|
65 |
+
|
66 |
+
Assistant: Hi. I am your assistant and I will provide expert full response in full details. Please feel free to ask any question and I will always answer it.
|
67 |
+
|
68 |
+
User: xxx
|
69 |
+
|
70 |
+
Assistant:
|
71 |
+
```
|