cloudyu
/

google-gemma-7b-it-dpo-v1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

cloudyu commited on Feb 23, 2024

Commit

46c6384

·

verified ·

1 Parent(s): 1ae78f5

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -16,7 +16,9 @@ TRL supports the DPO Trainer for training language models from preference data,
  target_modules=[ "gate_proj", "up_proj", "down_proj"]
 ```
-sample codeimport torch
 from transformers import AutoTokenizer, AutoModelForCausalLM
 import math
@@ -40,4 +42,3 @@ while len(prompt) > 0:
 ```
-```

  target_modules=[ "gate_proj", "up_proj", "down_proj"]
 ```
+sample code
+```
+import torch
 from transformers import AutoTokenizer, AutoModelForCausalLM
 import math
 ```