Cool_Tools/Retrieval-based-Voice-Conversion-WebUI

mirror of synced 2024-11-14 18:57:39 +01:00

Voice data <= 10 mins can also be used to train a good VC model!

audio-analysis change conversational-ai conversion converter retrieval-model retrieve-data rvc sovits so-vits-svc vc vits voice voice-conversion voiceconversion voice-converter

Go to file

liujing04 b951bdb895 Update infer-web.py		2023-04-14 13:56:50 +00:00
.github/workflows	优化笔记本、翻译并重新引入自动push (#48 )	2023-04-13 13:32:08 +00:00
configs	Add files via upload	2023-03-31 17:49:09 +08:00
infer	fix: MacOS 纯 CPU 推理时 Segmentation fault: 11	2023-04-10 18:28:39 +08:00
infer_pack	optimize: 精简未用到的配置项并在特征提取初步引入mps (#32 )	2023-04-11 18:14:55 +08:00
locale	Fix hubert model choosing and unused faiss search (#46 )	2023-04-14 13:00:31 +00:00
logs/mute	Delete a	2023-04-09 16:00:56 +00:00
pretrained	删除无用文件，增加--colab启动选项	2023-04-01 15:02:53 +08:00
train	Merge pull request #7 from fumiama/main	2023-04-10 10:32:35 +08:00
uvr5_pack	optimize: 精简未用到的配置项并在特征提取初步引入mps (#32 )	2023-04-11 18:14:55 +08:00
uvr5_weights	删除无用文件，增加--colab启动选项	2023-04-01 15:02:53 +08:00
weights	删除无用文件，增加--colab启动选项	2023-04-01 15:02:53 +08:00
.gitignore	fix: train step2a & add arg --port --pycmd --noparallel	2023-04-01 16:42:19 +08:00
Changelog_CN.md	Update Changelog_CN.md	2023-04-09 15:57:14 +00:00
config.py	fix: i18n导致无法启动 & 增加 --noautoopen 参数 (#39 )	2023-04-12 16:53:50 +08:00
envfilescheck.bat	Add a batch file for windows user to download requearment files automated by using aria2.	2023-04-04 00:23:39 +08:00
export_onnx.py	Add files via upload	2023-04-08 23:36:25 +08:00
extract_f0_print.py	Add files via upload	2023-04-13 23:57:27 +08:00
extract_feature_print.py	Add files via upload	2023-04-13 23:57:27 +08:00
extract_locale.py	fix: i18n导致无法启动 & 增加 --noautoopen 参数 (#39 )	2023-04-12 16:53:50 +08:00
go-web.bat	Add files via upload	2023-04-13 23:57:27 +08:00
gui.py	Fix hubert model choosing and unused faiss search (#46 )	2023-04-14 13:00:31 +00:00
infer_uvr5.py	optimize: 精简未用到的配置项并在特征提取初步引入mps (#32 )	2023-04-11 18:14:55 +08:00
infer-web.py	Update infer-web.py	2023-04-14 13:56:50 +00:00
LICENSE	Initial commit	2023-03-27 17:59:11 +08:00
my_utils.py	Update my_utils.py	2023-04-09 16:05:04 +00:00
poetry.lock	add poetry dependency file	2023-04-05 10:44:07 +08:00
pyproject.toml	add poetry dependency file	2023-04-05 10:44:07 +08:00
README.en.md	让README有条不紊地进行，以及所有这些 (#42 )	2023-04-12 13:33:11 +00:00
README.ja.md	润色README及其日文翻译 (#52 )	2023-04-14 14:59:05 +08:00
README.md	润色README及其日文翻译 (#52 )	2023-04-14 14:59:05 +08:00
requirements-win-for-realtime_vc_gui.txt	Fix GUI i18n (#38 )	2023-04-12 14:28:28 +08:00
requirements.txt	edit README	2023-04-10 18:46:58 +08:00
Retrieval_based_Voice_Conversion_WebUI.ipynb	优化笔记本、翻译并重新引入自动push (#48 )	2023-04-13 13:32:08 +00:00
slicer2.py	optimize: 精简未用到的配置项并在特征提取初步引入mps (#32 )	2023-04-11 18:14:55 +08:00
train_nsf_sim_cache_sid_load_pretrain.py	Update train_nsf_sim_cache_sid_load_pretrain.py	2023-04-10 11:08:26 +08:00
trainset_preprocess_pipeline_print.py	Add files via upload	2023-04-13 23:57:27 +08:00
vc_infer_pipeline.py	Add files via upload	2023-04-13 23:57:27 +08:00
webui_locale.py	Add files via upload	2023-04-13 23:57:27 +08:00
使用需遵守的协议-LICENSE.txt	fix: train step2a & add arg --port --pycmd --noparallel	2023-04-01 16:42:19 +08:00
小白简易教程.doc	Add files via upload	2023-03-31 18:52:30 +08:00

README.en.md

Retrieval-based-Voice-Conversion-WebUI

An easy-to-use SVC framework based on VITS.

Changelog

English | 中文简体 | 日本語

Check our Demo Video here!

Realtime Voice Conversion Software using RVC : w-okada/voice-changer

Summary

This repository has the following features:

Reduce tone leakage by replacing source feature to training-set feature using top1 retrieval;
Easy and fast training, even on relatively poor graphics cards;
Training with a small amount of data also obtains relatively good results (>=10min low noise speech recommended);
Supporting model fusion to change timbres (using ckpt processing tab->ckpt merge);
Easy-to-use Webui interface;
Use the UVR5 model to quickly separate vocals and instruments.
The dataset for the pre-training model uses nearly 50 hours of high quality VCTK open source dataset, and high quality licensed song datasets will be added to training-set one after another for your use, without worrying about copyright infringement.

Preparing the environment

We recommend you install the dependencies through poetry.

The following commands need to be executed in the environment of Python version 3.8 or higher:

# Install PyTorch-related core dependencies, skip if installed
# Reference: https://pytorch.org/get-started/locally/
pip install torch torchvision torchaudio

#For Windows + Nvidia Ampere Architecture(RTX30xx), you need to specify the cuda version corresponding to pytorch according to the experience of https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/issues/21

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117

# Install the Poetry dependency management tool, skip if installed
# Reference: https://python-poetry.org/docs/#installation
curl -sSL https://install.python-poetry.org | python3 -

# Install the project dependencies
poetry install

You can also use pip to install the dependencies

Notice: faiss 1.7.2 will raise Segmentation Fault: 11 under MacOS, please change corresponding line in requirements.txt to faiss-cpu==1.7.0

pip install -r requirements.txt

Preparation of other Pre-models

RVC requires other pre-models to infer and train.

You need to download them from our Huggingface space.

Here's a list of Pre-models and other files that RVC needs:

hubert_base.pt

./pretrained 

./uvr5_weights

#If you are using Windows, you may also need this dictionary, skip if FFmpeg is installed
ffmpeg.exe

Then use this command to start Webui:

python infer-web.py

If you are using Windows, you can download and extract RVC-beta.7z to use RVC directly and use go-web.bat to start Webui.

We will develop an English version of the WebUI in 2 weeks.

There's also a tutorial on RVC in Chinese and you can check it out if needed.

README.en.md

Retrieval-based-Voice-Conversion-WebUI

Summary

Preparing the environment

Preparation of other Pre-models

Credits

Thanks to all contributors for their efforts