Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -11,6 +11,8 @@ base_model:
|
|
11 |
|
12 |
## Model Description
|
13 |
|
|
|
|
|
14 |
The **Profile Generator** is a model designed to extract and generate detailed user profiles from given dialogues, particularly those simulated by our User Simulator for Reinforcement Learning with Cycle Consistency (RLCC) as described in [our paper](https://tongyi.aliyun.com/qianwen/?sessionId=ea3bbcf36a2346a0a7819b06fcb36a1c#). Built upon the **LLaMA-3-Instruct** architecture, this model has been fine-tuned through knowledge distillation of the user profile generation capabilities of **GPT-4o**. As demonstrated in the table below, the distilled Profile Generator achieves dialogue profile consistency (DPC) nearly equivalent to GPT-4o.
|
15 |
|
16 |
| Dataset | Profile Source | DP.P | Avg DP.P # Fact | DPR | Avg DPR # Fact | DPC | SC Val.Score |
|
|
|
11 |
|
12 |
## Model Description
|
13 |
|
14 |
+
> **GitHub repository** for exploring the source code and additional resources: https://github.com/wangkevin02/USP
|
15 |
+
|
16 |
The **Profile Generator** is a model designed to extract and generate detailed user profiles from given dialogues, particularly those simulated by our User Simulator for Reinforcement Learning with Cycle Consistency (RLCC) as described in [our paper](https://tongyi.aliyun.com/qianwen/?sessionId=ea3bbcf36a2346a0a7819b06fcb36a1c#). Built upon the **LLaMA-3-Instruct** architecture, this model has been fine-tuned through knowledge distillation of the user profile generation capabilities of **GPT-4o**. As demonstrated in the table below, the distilled Profile Generator achieves dialogue profile consistency (DPC) nearly equivalent to GPT-4o.
|
17 |
|
18 |
| Dataset | Profile Source | DP.P | Avg DP.P # Fact | DPR | Avg DPR # Fact | DPC | SC Val.Score |
|