wangkevin02 commited on
Commit
c0586b9
·
verified ·
1 Parent(s): bd5dba5

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +5 -6
README.md CHANGED
@@ -13,12 +13,11 @@ base_model:
13
 
14
  Our User Simulator with Implicit Profiles (USP) replicates human-like conversational behavior in interactions with large language models (LLMs). By emulating diverse user dynamics based on predefined profiles, it reconstructs realistic user-LLM dialogues, leveraging the LLaMA-3-base-8B architecture with **Conditional Supervised Fine-Tuning (SFT)** and **Reinforcement Learning with Cycle Consistency (RLCC)**. For a detailed methodology and insights, refer to [Our Paper](#).
15
 
16
- ## Constraints
17
-
18
- 1. **Maximum Context Length**: Supports up to **4,096 tokens**. Exceeding this may degrade performance; keep inputs within this limit for best results.
19
-
20
- 1. **Language Limitation**: Optimized for English. Non-English performance may vary due to limited training data.
21
-
22
 
23
  ## Sample Implementation
24
 
 
13
 
14
  Our User Simulator with Implicit Profiles (USP) replicates human-like conversational behavior in interactions with large language models (LLMs). By emulating diverse user dynamics based on predefined profiles, it reconstructs realistic user-LLM dialogues, leveraging the LLaMA-3-base-8B architecture with **Conditional Supervised Fine-Tuning (SFT)** and **Reinforcement Learning with Cycle Consistency (RLCC)**. For a detailed methodology and insights, refer to [Our Paper](#).
15
 
16
+ > *Note*: Our model is subject to the following constraints:
17
+ >
18
+ > 1. **Maximum Context Length**: Supports up to **4,096 tokens**. Exceeding this may degrade performance; keep inputs within this limit for best results.
19
+ >
20
+ > 1. **Language Limitation**: Optimized for English. Non-English performance may vary due to limited training data.
 
21
 
22
  ## Sample Implementation
23