ultimatevocalremovergui/README.md

162 lines
8.1 KiB
Markdown
Raw Normal View History

2022-07-23 11:03:35 +02:00
# Ultimate Vocal Remover GUI v5.4.0
<img src="https://raw.githubusercontent.com/Anjok07/ultimatevocalremovergui/master/img/UVR_v54.png?raw=true" />
2022-05-11 02:54:58 +02:00
[![Release](https://img.shields.io/github/release/anjok07/ultimatevocalremovergui.svg)](https://github.com/anjok07/ultimatevocalremovergui/releases/latest)
[![Downloads](https://img.shields.io/github/downloads/anjok07/ultimatevocalremovergui/total.svg)](https://github.com/anjok07/ultimatevocalremovergui/releases)
2022-07-23 14:23:00 +02:00
English | [简体中文](README_CN.md)
2022-05-11 02:54:58 +02:00
## About
2022-07-28 06:03:53 +02:00
This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 4-stem models).
2022-05-11 02:54:58 +02:00
- **Core Developers**
2022-05-11 08:41:49 +02:00
- [Anjok07](https://github.com/anjok07)
- [aufr33](https://github.com/aufr33)
2022-05-11 02:54:58 +02:00
2022-07-26 10:17:07 +02:00
- **Support the Project**
2022-07-28 05:36:12 +02:00
- [Donate](https://www.buymeacoffee.com/uvr5)
2022-07-26 10:17:07 +02:00
2022-05-11 02:54:58 +02:00
## Installation
### Windows Installation
2022-07-23 12:13:49 +02:00
This installation bundle contains the UVR interface, Python, PyTorch, and other dependencies needed to run the application effectively. No prerequisites are required.
2022-05-11 02:54:58 +02:00
2022-05-11 23:00:33 +02:00
- Please Note:
2022-05-11 21:31:33 +02:00
- This installer is intended for those running Windows 10 or higher.
- Application functionality for systems running Windows 7 or lower is not guaranteed.
2022-07-23 12:13:49 +02:00
- Application functionality for Intel Pentium & Celeron CPUs systems is not guaranteed.
2022-05-11 21:27:48 +02:00
2022-06-14 00:49:26 +02:00
- Download the UVR installer via the link below:
2022-07-26 07:51:48 +02:00
- [Main Download Link](https://uvr.uvr.workers.dev/UVR_v5.4.0_setup.exe)
2022-08-01 02:43:18 +02:00
- [Main Download Link mirror](https://www.mediafire.com/file/nrakuh8t8p993y8/UVR_v5.4.0_setup.exe)
2022-07-26 07:51:48 +02:00
- Update Package instructions for those who have UVR already installed:
2022-07-24 11:47:32 +02:00
- [Update Package](https://github.com/Anjok07/ultimatevocalremovergui/releases/download/v5.4.0/UVR_v5.4_Update_Package.exe)
2022-05-11 08:41:49 +02:00
- **Optional**
2022-07-23 12:13:49 +02:00
- Additional models and application patches can be downloaded via the "Settings" menu within the application.
2022-05-11 02:54:58 +02:00
2022-07-23 11:03:35 +02:00
- **Please Note:** See the latest release page for more recent updates [here](https://github.com/Anjok07/ultimatevocalremovergui/releases/tag/v5.4.0)
2022-05-13 01:58:38 +02:00
2022-05-11 21:27:48 +02:00
### Other Platforms
This application can be run on Mac & Linux by performing a manual install (see the **Manual Developer Installation** section below for more information). Some features may not be available on non-Windows platforms.
2022-05-11 02:54:58 +02:00
## Application Manual
**General Options**
2022-07-23 11:03:35 +02:00
<img src="https://github.com/Anjok07/ultimatevocalremovergui/blob/master/img/gen_opt.png?raw=true" />
2022-05-11 02:54:58 +02:00
**VR Architecture Options**
2022-07-23 11:03:35 +02:00
<img src="https://github.com/Anjok07/ultimatevocalremovergui/blob/master/img/vr_opt.png?raw=true" />
2022-05-11 02:54:58 +02:00
**MDX-Net Options**
2022-06-13 10:13:28 +02:00
<img src="https://github.com/Anjok07/ultimatevocalremovergui/blob/master/img/mdx_opt.png?raw=true" />
**Demucs v3 Options**
2022-07-23 11:03:35 +02:00
<img src="https://github.com/Anjok07/ultimatevocalremovergui/blob/master/img/demucs_opt.png?raw=true" />
2022-05-11 02:54:58 +02:00
**Ensemble Options**
2022-06-13 10:13:28 +02:00
<img src="https://github.com/Anjok07/ultimatevocalremovergui/blob/master/img/ense_opt.png?raw=true" />
2022-05-11 02:54:58 +02:00
**User Ensemble**
2022-06-13 10:13:28 +02:00
<img src="https://github.com/Anjok07/ultimatevocalremovergui/blob/master/img/user_ens_opt.png?raw=true" />
2022-05-11 02:54:58 +02:00
2022-05-11 23:00:33 +02:00
### Other Application Notes
2022-05-11 02:54:58 +02:00
2022-05-11 23:00:33 +02:00
- Nvidia GPUs with at least 8GBs of V-RAM are recommended.
- This application is only compatible with 64-bit platforms.
2022-05-11 21:27:48 +02:00
- This application relies on Sox - Sound Exchange for Noise Reduction.
2022-05-11 21:00:07 +02:00
- This application relies on FFmpeg to process non-wav audio files.
2022-05-11 23:00:33 +02:00
- The application will automatically remember your settings when closed.
- Conversion times will significantly depend on your hardware.
2022-05-12 10:15:29 +02:00
- These models are computationally intensive. Please proceed with caution and pay attention to your PC to ensure it doesn't overheat. ***We are not responsible for any hardware damage.***
2022-05-11 03:19:47 +02:00
## Change Log
- **v4 vs. v5**
- The v5 models significantly outperform the v4 models.
2022-05-11 08:41:49 +02:00
- The extraction's aggressiveness can be adjusted using the "Aggression Setting." The default value of 10 is optimal for most tracks.
2022-05-11 03:19:47 +02:00
- All v2 and v4 models have been removed.
2022-05-11 08:41:49 +02:00
- Ensemble Mode added - This allows the user to get the most robust result from each model.
2022-05-11 03:19:47 +02:00
- Stacked models have been entirely removed.
2022-05-11 08:41:49 +02:00
The new aggression setting and model ensembling have replaced the stacked model feature.
2022-05-11 03:19:47 +02:00
- The NFFT, HOP_SIZE, and SR values are now set internally.
- The MDX-NET AI engine and models have been added.
- This is a brand new feature added to the UVR GUI.
2022-05-12 10:15:29 +02:00
- 4 MDX-Net models are included in this package.
- The MDX-Net models provided were trained by the core UVR developers
2022-05-11 08:41:49 +02:00
- This network is less resource-intensive but incredibly powerful.
- MDX-Net is a Hybrid Waveform/Spectrogram network.
2022-06-15 21:18:40 +02:00
- The Demucs v3 AI engine and models have been added.
- The ability to separate all 4 stems through Demucs v3.
2022-05-11 03:19:47 +02:00
2022-05-11 02:54:58 +02:00
## Troubleshooting
### Common Issues
- If FFmpeg is not installed, the application will throw an error if the user attempts to convert a non-WAV file.
2022-05-12 10:15:29 +02:00
- Memory allocation errors can usually be resolved by lowering the "Chunk Size".
2022-05-11 02:54:58 +02:00
### Issue Reporting
2022-05-11 23:00:33 +02:00
Please be as detailed as possible when posting a new issue.
2022-06-15 21:18:40 +02:00
If possible, click the "Settings Button" to the left of the "Start Processing" button and click the "Error Log" button for detailed error information that can be provided to us.
2022-05-11 02:54:58 +02:00
2022-05-11 21:00:07 +02:00
## Manual Installation (For Developers)
These instructions are for those installing UVR v5.2.0 **manually** only.
1. Download & install Python 3.9 or lower (but no lower than 3.6) [here](https://www.python.org/downloads/)
- **Note:** Ensure the *"Add Python to PATH"* box is checked
2022-05-11 23:22:31 +02:00
2. Download the Source code [here](https://github.com/Anjok07/ultimatevocalremovergui/archive/refs/heads/master.zip)
2022-07-23 11:03:35 +02:00
3. Download the models via the "Settings" menu within the application.
2022-05-11 21:00:07 +02:00
4. Extract the *ultimatevocalremovergui-master* folder within ultimatevocalremovergui-master.zip where ever you wish.
2022-07-23 11:03:35 +02:00
5. Download the SoX archive [here](https://sourceforge.net/projects/sox/files/sox/14.4.2/sox-14.4.2-win32.zip/download) and extract the contents into the *ultimatevocalremovergui-master/lib_v5/sox* directory.
6. Open the command prompt from the ultimatevocalremovergui-master directory and run the following commands, separately -
2022-05-11 21:00:07 +02:00
```
pip install --no-cache-dir -r requirements.txt
```
```
pip install torch==1.9.0+cu111 torchvision==0.10.0+cu111 torchaudio==0.9.0 -f https://download.pytorch.org/whl/torch_stable.html
```
2022-05-12 07:24:32 +02:00
From here you should be able to open and run the UVR.py file
2022-05-11 21:00:07 +02:00
2022-05-12 07:24:32 +02:00
- FFmpeg
2022-05-11 21:00:07 +02:00
2022-05-12 07:33:10 +02:00
- FFmpeg must be installed and configured for the application to process any track that isn't a *.wav* file. You will need to look up instruction on how to configure it on your operating system.
2022-05-11 21:00:07 +02:00
2022-05-11 02:54:58 +02:00
## License
The **Ultimate Vocal Remover GUI** code is [MIT-licensed](LICENSE).
2022-05-11 03:19:47 +02:00
- **Please Note:** For all third-party application developers who wish to use our models, please honor the MIT license by providing credit to UVR and its developers.
## Credits
2022-05-11 21:00:07 +02:00
- [DilanBoskan](https://github.com/DilanBoskan) - Your contributions at the start of this project were essential to the success of UVR. Thank you!
2022-06-15 21:18:40 +02:00
- [Bas Curtiz](https://www.youtube.com/user/bascurtiz) - Designed the official UVR logo, icon, banner, and splash screen.
2022-05-11 21:00:07 +02:00
- [tsurumeso](https://github.com/tsurumeso) - Developed the original VR Architecture code.
- [Kuielab & Woosung Choi](https://github.com/kuielab) - Developed the original MDX-Net AI code.
2022-05-13 10:35:56 +02:00
- [Adefossez & Demucs](https://github.com/facebookresearch/demucs) - Developed the original Demucs AI code.
2022-07-25 08:32:23 +02:00
- [KimberleyJSN](https://github.com/KimberleyJensen) - Advised and aided the implementation of the training scripts for MDX-Net and Demucs. Thank you!
2022-05-11 21:27:48 +02:00
- [Hv](https://github.com/NaJeongMo/Colab-for-MDX_B) - Helped implement chunks into the MDX-Net AI code. Thank you!
2022-05-11 02:54:58 +02:00
## Contributing
2022-06-15 21:18:40 +02:00
- For anyone interested in the ongoing development of **Ultimate Vocal Remover GUI**, please send us a pull request, and we will review it.
- This project is 100% open-source and free for anyone to use and modify as they wish.
2022-05-11 03:19:47 +02:00
- We only maintain the development and support for the **Ultimate Vocal Remover GUI** and the models provided.
2022-05-11 02:54:58 +02:00
## References
- [1] Takahashi et al., "Multi-scale Multi-band DenseNets for Audio Source Separation", https://arxiv.org/pdf/1706.09588.pdf