96 lines
4.2 KiB
Markdown
96 lines
4.2 KiB
Markdown
<div align="center">
|
|
|
|
<h1>Retrieval-based-Voice-Conversion-WebUI</h1>
|
|
An easy-to-use SVC framework based on VITS.<br><br>
|
|
|
|
[![madewithlove](https://forthebadge.com/images/badges/built-with-love.svg)](https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI)
|
|
|
|
<img src="https://counter.seku.su/cmoe?name=rvc&theme=r34" /><br>
|
|
|
|
[![Open In Colab](https://img.shields.io/badge/Colab-F9AB00?style=for-the-badge&logo=googlecolab&color=525252)](https://colab.research.google.com/github/liujing04/Retrieval-based-Voice-Conversion-WebUI/blob/main/Retrieval_based_Voice_Conversion_WebUI.ipynb)
|
|
[![Licence](https://img.shields.io/github/license/liujing04/Retrieval-based-Voice-Conversion-WebUI?style=for-the-badge)](https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/blob/main/%E4%BD%BF%E7%94%A8%E9%9C%80%E9%81%B5%E5%AE%88%E7%9A%84%E5%8D%8F%E8%AE%AE-LICENSE.txt)
|
|
[![Huggingface](https://img.shields.io/badge/🤗%20-Spaces-blue.svg?style=for-the-badge)](https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/)
|
|
|
|
</div>
|
|
|
|
------
|
|
[**Changelog**](https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/blob/main/Changelog_CN.md)
|
|
|
|
[**English**](./README.md) | [**中文简体**](./README_zh_CN.md)
|
|
|
|
> Check our [Demo Video](https://www.bilibili.com/video/BV1pm4y1z7Gm/) here!
|
|
|
|
> Realtime Voice Conversion Software using RVC : [w-okada/voice-changer](https://github.com/w-okada/voice-changer)
|
|
|
|
## Summary
|
|
This repository has the following features:
|
|
+ Using top1 feature model retrieval to reduce tone leakage;
|
|
+ Easy and fast training, even on relatively poor graphics cards;
|
|
+ Training with a small amount of data also obtains relatively good results;
|
|
+ Supporting model fusion to change timbres;
|
|
+ Easy-to-use Webui interface;
|
|
+ Use the UVR5 model to quickly separate vocals and instruments.
|
|
+ The dataset for the pre-training model uses nearly 50 hours of high quality VCTK open source, and high quality licensed song datasets will be added one after another for your use, without worrying about copyright infringement.
|
|
## Preparing the environment
|
|
We recommend you install the dependencies through poetry.
|
|
|
|
The following commands need to be executed in the environment of Python version 3.8 or higher:
|
|
```bash
|
|
# Install PyTorch-related core dependencies, skip if installed
|
|
# Reference: https://pytorch.org/get-started/locally/
|
|
pip install torch torchvision torchaudio
|
|
|
|
#For Windows + 30-series Nvidia cards, you need to specify the cuda version corresponding to pytorch according to the experience of https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/issues/21
|
|
|
|
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117
|
|
|
|
# Install the Poetry dependency management tool, skip if installed
|
|
# Reference: https://python-poetry.org/docs/#installation
|
|
curl -sSL https://install.python-poetry.org | python3 -
|
|
|
|
# Install the project dependencies
|
|
poetry install
|
|
```
|
|
You can also use pip to install the dependencies
|
|
|
|
**Notice**: `faiss 1.7.2` will raise Segmentation Fault: 11 under `MacOS`, please change corresponding line in `requirements.txt` to `faiss-cpu==1.7.0`
|
|
|
|
```bash
|
|
pip install -r requirements.txt
|
|
```
|
|
|
|
## Preparation of other Pre-models
|
|
RVC requires other pre-models to infer and train.
|
|
|
|
You need to download them from our [Huggingface space](https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/).
|
|
|
|
Here's a list of Pre-models and other files that RVC needs:
|
|
```bash
|
|
hubert_base.pt
|
|
|
|
./pretrained
|
|
|
|
./uvr5_weights
|
|
|
|
#If you are using Windows, you may also need this dictionary, skip if FFmpeg is installed
|
|
ffmpeg.exe
|
|
```
|
|
Then use this command to start Webui:
|
|
```bash
|
|
python infer-web.py
|
|
```
|
|
If you are using Windows, you can download and extract `RVC-beta.7z` to use RVC directly and use `go-web.bat` to start Webui.
|
|
|
|
We will develop an English version of the WebUI in 2 weeks.
|
|
|
|
There's also a tutorial on RVC in Chinese and you can check it out if needed.
|
|
|
|
## Credits
|
|
|
|
## Thanks to all contributors for their efforts
|
|
|
|
<a href="https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/graphs/contributors" target="_blank">
|
|
<img src="https://contrib.rocks/image?repo=liujing04/Retrieval-based-Voice-Conversion-WebUI" />
|
|
</a>
|
|
|