Update README.md
Browse files
README.md
CHANGED
|
@@ -1,213 +1,74 @@
|
|
| 1 |
-
|
| 2 |
-
|
| 3 |
-
|
| 4 |
-
|
| 5 |
-
|
| 6 |
-
|
| 7 |
-
|
| 8 |
-
|
| 9 |
-
|
| 10 |
-
|
| 11 |
-
|
| 12 |
-
|
| 13 |
-
|
| 14 |
-
|
| 15 |
-
|
| 16 |
-
|
| 17 |
-
|
| 18 |
-
|
| 19 |
-
|
| 20 |
-
|
| 21 |
-
|
| 22 |
-
|
| 23 |
-
|
| 24 |
-
|
| 25 |
-
|
| 26 |
-
|
| 27 |
-
|
| 28 |
-
|
| 29 |
-
|
| 30 |
-
|
| 31 |
-
|
| 32 |
-
|
| 33 |
-
|
| 34 |
-
|
| 35 |
-
|
| 36 |
-
|
| 37 |
-
|
| 38 |
-
|
| 39 |
-
|
| 40 |
-
|
| 41 |
-
|
| 42 |
-
|
| 43 |
-
|
| 44 |
-
|
| 45 |
-
|
| 46 |
-
|
| 47 |
-
|
| 48 |
-
|
| 49 |
-
|
| 50 |
-
|
| 51 |
-
|
| 52 |
-
|
| 53 |
-
|
| 54 |
-
|
| 55 |
-
|
| 56 |
-
|
| 57 |
-
|
| 58 |
-
|
| 59 |
-
## 环境配置
|
| 60 |
-
以下指令需在 Python 版本大于3.8的环境中执行。
|
| 61 |
-
|
| 62 |
-
### Windows/Linux/MacOS等平台通用方法
|
| 63 |
-
下列方法任选其一。
|
| 64 |
-
#### 1. 通过 pip 安装依赖
|
| 65 |
-
1. 安装Pytorch及其核心依赖,若已安装则跳过。参考自: https://pytorch.org/get-started/locally/
|
| 66 |
-
```bash
|
| 67 |
-
pip install torch torchvision torchaudio
|
| 68 |
-
```
|
| 69 |
-
2. 如果是 win 系统 + Nvidia Ampere 架构(RTX30xx),根据 #21 的经验,需要指定 pytorch 对应的 cuda 版本
|
| 70 |
```bash
|
| 71 |
-
|
| 72 |
```
|
| 73 |
-
|
| 74 |
-
- N卡
|
| 75 |
```bash
|
| 76 |
-
|
| 77 |
```
|
| 78 |
-
- A卡/I卡
|
| 79 |
-
```bash
|
| 80 |
-
pip install -r requirements-dml.txt
|
| 81 |
-
```
|
| 82 |
-
- A卡ROCM(Linux)
|
| 83 |
-
```bash
|
| 84 |
-
pip install -r requirements-amd.txt
|
| 85 |
-
```
|
| 86 |
-
- I卡IPEX(Linux)
|
| 87 |
-
```bash
|
| 88 |
-
pip install -r requirements-ipex.txt
|
| 89 |
-
```
|
| 90 |
-
|
| 91 |
-
#### 2. 通过 poetry 来安装依赖
|
| 92 |
-
安装 Poetry 依赖管理工具,若已安装则跳过。参考自: https://python-poetry.org/docs/#installation
|
| 93 |
-
```bash
|
| 94 |
-
curl -sSL https://install.python-poetry.org | python3 -
|
| 95 |
-
```
|
| 96 |
-
|
| 97 |
-
通过 Poetry 安装依赖时,python 建议使用 3.7-3.10 版本,其余版本在安装 llvmlite==0.39.0 时会出现冲突
|
| 98 |
-
```bash
|
| 99 |
-
poetry init -n
|
| 100 |
-
poetry env use "path to your python.exe"
|
| 101 |
-
poetry run pip install -r requirments.txt
|
| 102 |
-
```
|
| 103 |
-
|
| 104 |
-
### MacOS
|
| 105 |
-
可以通过 `run.sh` 来安装依赖
|
| 106 |
-
```bash
|
| 107 |
-
sh ./run.sh
|
| 108 |
-
```
|
| 109 |
-
|
| 110 |
-
## 其他预模型准备
|
| 111 |
-
RVC需要其他一些预模型来推理和训练。
|
| 112 |
|
| 113 |
-
|
| 114 |
-
|
| 115 |
-
|
| 116 |
-
|
| 117 |
-
|
| 118 |
-
- ./assets/hubert/hubert_base.pt
|
| 119 |
-
|
| 120 |
-
- ./assets/pretrained
|
| 121 |
-
|
| 122 |
-
- ./assets/uvr5_weights
|
| 123 |
-
|
| 124 |
-
想使用v2版本模型的话,需要额外下载
|
| 125 |
-
|
| 126 |
-
- ./assets/pretrained_v2
|
| 127 |
-
|
| 128 |
-
### 2. 安装 ffmpeg
|
| 129 |
-
若ffmpeg和ffprobe已安装则跳过。
|
| 130 |
-
|
| 131 |
-
#### Ubuntu/Debian 用户
|
| 132 |
-
```bash
|
| 133 |
-
sudo apt install ffmpeg
|
| 134 |
-
```
|
| 135 |
-
#### MacOS 用户
|
| 136 |
-
```bash
|
| 137 |
-
brew install ffmpeg
|
| 138 |
-
```
|
| 139 |
-
#### Windows 用户
|
| 140 |
-
下载后放置在根目录。
|
| 141 |
-
- 下载[ffmpeg.exe](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/ffmpeg.exe)
|
| 142 |
-
|
| 143 |
-
- 下载[ffprobe.exe](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/ffprobe.exe)
|
| 144 |
-
|
| 145 |
-
### 3. 下载 rmvpe 人声音高提取算法所需文件
|
| 146 |
-
|
| 147 |
-
如果你想使用最新的RMVPE人声音高提取算法,则你需要下载音高提取模型参数并放置于RVC根目录。
|
| 148 |
-
|
| 149 |
-
- 下载[rmvpe.pt](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/rmvpe.pt)
|
| 150 |
-
|
| 151 |
-
#### 下载 rmvpe 的 dml 环境(可选, A卡/I卡用户)
|
| 152 |
-
|
| 153 |
-
- 下载[rmvpe.onnx](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/rmvpe.onnx)
|
| 154 |
-
|
| 155 |
-
### 4. AMD显卡Rocm(可选, 仅Linux)
|
| 156 |
-
|
| 157 |
-
如果你想基于AMD的Rocm技术在Linux系统上运行RVC,请先在[这里](https://rocm.docs.amd.com/en/latest/deploy/linux/os-native/install.html)安装所需的驱动。
|
| 158 |
-
|
| 159 |
-
若你使用的是Arch Linux,可以使用pacman来安装所需驱动:
|
| 160 |
-
````
|
| 161 |
-
pacman -S rocm-hip-sdk rocm-opencl-sdk
|
| 162 |
-
````
|
| 163 |
-
对于某些型号的显卡,你可能需要额外配置如下的环境变量(如:RX6700XT):
|
| 164 |
-
````
|
| 165 |
-
export ROCM_PATH=/opt/rocm
|
| 166 |
-
export HSA_OVERRIDE_GFX_VERSION=10.3.0
|
| 167 |
-
````
|
| 168 |
-
同时确保你的当前用户处于`render`与`video`用户组内:
|
| 169 |
-
````
|
| 170 |
-
sudo usermod -aG render $USERNAME
|
| 171 |
-
sudo usermod -aG video $USERNAME
|
| 172 |
-
````
|
| 173 |
-
|
| 174 |
-
## 开始使用
|
| 175 |
-
### 直接启动
|
| 176 |
-
使用以下指令来启动 WebUI
|
| 177 |
-
```bash
|
| 178 |
-
python infer-web.py
|
| 179 |
-
```
|
| 180 |
-
|
| 181 |
-
若先前使用 Poetry 安装依赖,则可以通过以下方式启动WebUI
|
| 182 |
-
```bash
|
| 183 |
-
poetry run python infer-web.py
|
| 184 |
-
```
|
| 185 |
-
|
| 186 |
-
### 使用整合包
|
| 187 |
-
下载并解压`RVC-beta.7z`
|
| 188 |
-
#### Windows 用户
|
| 189 |
-
双击`go-web.bat`
|
| 190 |
-
#### MacOS 用户
|
| 191 |
-
```bash
|
| 192 |
-
sh ./run.sh
|
| 193 |
-
```
|
| 194 |
-
### 对于需要使用IPEX技术的I卡用户(仅Linux)
|
| 195 |
-
```bash
|
| 196 |
-
source /opt/intel/oneapi/setvars.sh
|
| 197 |
-
```
|
| 198 |
|
| 199 |
-
## 参考项目
|
| 200 |
-
+ [ContentVec](https://github.com/auspicious3000/contentvec/)
|
| 201 |
-
+ [VITS](https://github.com/jaywalnut310/vits)
|
| 202 |
-
+ [HIFIGAN](https://github.com/jik876/hifi-gan)
|
| 203 |
-
+ [Gradio](https://github.com/gradio-app/gradio)
|
| 204 |
-
+ [FFmpeg](https://github.com/FFmpeg/FFmpeg)
|
| 205 |
-
+ [Ultimate Vocal Remover](https://github.com/Anjok07/ultimatevocalremovergui)
|
| 206 |
-
+ [audio-slicer](https://github.com/openvpi/audio-slicer)
|
| 207 |
-
+ [Vocal pitch extraction:RMVPE](https://github.com/Dream-High/RMVPE)
|
| 208 |
-
+ The pretrained model is trained and tested by [yxlllc](https://github.com/yxlllc/RMVPE) and [RVC-Boss](https://github.com/RVC-Boss).
|
| 209 |
|
| 210 |
-
|
| 211 |
-
<a href="https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/graphs/contributors" target="_blank">
|
| 212 |
-
<img src="https://contrib.rocks/image?repo=RVC-Project/Retrieval-based-Voice-Conversion-WebUI" />
|
| 213 |
-
</a>
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
---
|
| 4 |
+
|
| 5 |
+
|
| 6 |
+
# easyGUI
|
| 7 |
+
|
| 8 |
+
`easyGUI` is a user-friendly voice conversion framework based on VITS, designed to eliminate timbre leakage by replacing input features with those from the training set. It's efficient even on lower-end GPUs, requiring only about 10 minutes of low-noise speech data for good results. The framework features a simple web interface, supports A card and I card acceleration, and uses the advanced RMVPE algorithm for pitch extraction.
|
| 9 |
+
|
| 10 |
+
## Installation
|
| 11 |
+
|
| 12 |
+
### Prerequisites
|
| 13 |
+
- Python 3.8 or higher
|
| 14 |
+
|
| 15 |
+
### Installation Steps
|
| 16 |
+
1. **Install Pytorch**:
|
| 17 |
+
```bash
|
| 18 |
+
pip install torch torchvision torchaudio
|
| 19 |
+
```
|
| 20 |
+
For Nvidia Ampere (RTX30xx):
|
| 21 |
+
```bash
|
| 22 |
+
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117
|
| 23 |
+
```
|
| 24 |
+
|
| 25 |
+
2. **Install Dependencies**:
|
| 26 |
+
- For Nvidia GPUs:
|
| 27 |
+
```bash
|
| 28 |
+
pip install -r requirements.txt
|
| 29 |
+
```
|
| 30 |
+
- For AMD/Intel GPUs:
|
| 31 |
+
```bash
|
| 32 |
+
pip install -r requirements-dml.txt
|
| 33 |
+
```
|
| 34 |
+
- For AMD ROCm (Linux):
|
| 35 |
+
```bash
|
| 36 |
+
pip install -r requirements-amd.txt
|
| 37 |
+
```
|
| 38 |
+
- For Intel IPEX (Linux):
|
| 39 |
+
```bash
|
| 40 |
+
pip install -r requirements-ipex.txt
|
| 41 |
+
```
|
| 42 |
+
|
| 43 |
+
3. **Install Optional Dependencies** (if needed):
|
| 44 |
+
```bash
|
| 45 |
+
sh ./run.sh # For MacOS
|
| 46 |
+
```
|
| 47 |
+
|
| 48 |
+
### Additional Setup
|
| 49 |
+
- **Download Assets**:
|
| 50 |
+
Download necessary models and files using the scripts in the `tools` directory.
|
| 51 |
+
- **Install FFmpeg**:
|
| 52 |
+
```bash
|
| 53 |
+
sudo apt install ffmpeg # Ubuntu/Debian
|
| 54 |
+
brew install ffmpeg # MacOS
|
| 55 |
+
```
|
| 56 |
+
|
| 57 |
+
## Usage
|
| 58 |
+
Start the WebUI:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 59 |
```bash
|
| 60 |
+
python demo.py
|
| 61 |
```
|
| 62 |
+
If using Poetry:
|
|
|
|
| 63 |
```bash
|
| 64 |
+
poetry run python demo.py
|
| 65 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 66 |
|
| 67 |
+
## Features
|
| 68 |
+
- Top1 retrieval to replace input features
|
| 69 |
+
- Fast training on less powerful GPUs
|
| 70 |
+
- Model merging to change timbre
|
| 71 |
+
- Advanced pitch extraction with RMVPE
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 72 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 73 |
|
| 74 |
+
---
|
|
|
|
|
|
|
|
|