beyoru commited on
Commit
587bd67
·
verified ·
1 Parent(s): e44610a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -9
README.md CHANGED
@@ -1,10 +1,9 @@
1
  ---
2
- base_model: unsloth/Qwen2.5-1.5B-Instruct
3
  tags:
4
  - text-generation-inference
5
  - transformers
6
  - unsloth
7
- - qwen2
8
  - trl
9
  - grpo
10
  license: apache-2.0
@@ -13,11 +12,12 @@ language:
13
  ---
14
 
15
  # Uploaded model
 
 
 
 
 
 
 
16
 
17
- - **Developed by:** beyoru
18
- - **License:** apache-2.0
19
- - **Finetuned from model :** unsloth/Qwen2.5-1.5B-Instruct
20
-
21
- This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
22
-
23
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
1
  ---
2
+ base_model: Qwen/Qwen2.5-1.5B-Instruct
3
  tags:
4
  - text-generation-inference
5
  - transformers
6
  - unsloth
 
7
  - trl
8
  - grpo
9
  license: apache-2.0
 
12
  ---
13
 
14
  # Uploaded model
15
+ ```
16
+ config = {
17
+ "rank": 8,
18
+ "alpha": 16,
19
+ "learning_rate": 2e-5,
20
+ "target_modules": ["mlps"]
21
+ }
22
 
23
+ ```