fla-hub
/

rwkv7-168M-pile

Text Generation

Model card Files Files and versions Community

yzhangcs commited on Jan 22

Commit

f1a8e95

·

verified ·

1 Parent(s): 04a0812

Update README.md

Files changed (1) hide show

README.md +5 -7

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ base_model:
 pipeline_tag: text-generation
 ---
-# rwkv7-168m-pile
 <!-- Provide a quick summary of what the model is/does. -->
@@ -26,7 +26,7 @@ This is RWKV-7 model under flash-linear attention format.
 - **Developed by:** Bo Peng, Yu Zhang, Songlin Yang, Ruochong Zhang
 - **Funded by:** Shenzhen Yuanshi Intelligent Co. Ltd.
-- **Model type:** RWKV-7
 - **Language(s) (NLP):** English
 - **License:** Apache-2.0
 - **Parameter count:** 165M
@@ -46,9 +46,7 @@ This is RWKV-7 model under flash-linear attention format.
 Install flash-linear-attention before using this model:
 ```bash
-git clone https://github.com/fla-org/flash-linear-attention
-cd flash-linear-attention
-pip install -e .
 ```
 ### Direct Use
@@ -57,8 +55,8 @@ pip install -e .
 You can use this model just as any other HuggingFace models:
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
-model = AutoModelForCausalLM.from_pretrained('fla-hub/rwkv7-168m-pile', trust_remote_code=True)
-tokenizer = AutoTokenizer.from_pretrained('fla-hub/rwkv7-168m-pile', trust_remote_code=True)
 ```
 ## Training Details

 pipeline_tag: text-generation
 ---
+# rwkv7-168M-pile
 <!-- Provide a quick summary of what the model is/does. -->
 - **Developed by:** Bo Peng, Yu Zhang, Songlin Yang, Ruochong Zhang
 - **Funded by:** Shenzhen Yuanshi Intelligent Co. Ltd.
+- **Model type:** RWKV7
 - **Language(s) (NLP):** English
 - **License:** Apache-2.0
 - **Parameter count:** 165M
 Install flash-linear-attention before using this model:
 ```bash
+pip install git+https://github.com/fla-org/flash-linear-attention
 ```
 ### Direct Use
 You can use this model just as any other HuggingFace models:
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
+model = AutoModelForCausalLM.from_pretrained('fla-hub/rwkv7-168M-pile', trust_remote_code=True)
+tokenizer = AutoTokenizer.from_pretrained('fla-hub/rwkv7-168M-pile', trust_remote_code=True)
 ```
 ## Training Details