ultimatevocalremovergui/README.md

# Ultimate Vocal Remover GUI v4.0.0
<img src="https://raw.githubusercontent.com/Anjok07/ultimatevocalremovergui/beta/img/UVRVP4.png" />

[![Release](https://img.shields.io/github/release/anjok07/ultimatevocalremovergui.svg)](https://github.com/anjok07/ultimatevocalremovergui/releases/latest)
[![Downloads](https://img.shields.io/github/downloads/anjok07/ultimatevocalremovergui/total.svg)](https://github.com/anjok07/ultimatevocalremovergui/releases)

## About

This application is a GUI version of the vocal remover AI created and posted by GitHub user [tsurumeso](https://github.com/tsurumeso). You can find tsurumeso's original command line version [here](https://github.com/tsurumeso/vocal-remover). 

- **Special Thanks**
    - [tsurumeso](https://github.com/tsurumeso) - The engineer who authored the AI code. Thank you for the hard work and dedication you put into the AI application this GUI is built around!
    - [DilanBoskan](https://github.com/DilanBoskan) - The main GUI code contributor. Thank you for helping bring this GUI to life! Your hard work and continued support is greatly appreciated.

## Installation

The application was made with Tkinter for cross-platform compatibility, so it should work with Windows, Mac, and Linux systems. However, this application has only been tested on Windows 10 & Linux Ubuntu.

### Install Required Applications & Packages

1. Download & install Python 3.7 [here](https://www.python.org/ftp/python/3.7.0/python-3.7.0-amd64.exe) (Windows link)
    - **Note:** Ensure the *"Add Python 3.7 to PATH"* box is checked
2. Once Python has installed, download **Ultimate Vocal Remover GUI Version 4.0.0** [here](https://github.com/Anjok07/ultimatevocalremovergui/releases/download/v4.0.0/UVR-V4GUI.zip)
3. Place the UVR-V4GUI folder contained within the *.zip* file where ever you wish. 
    - Your documents folder or home directory is recommended for easy access.
4. From the UVR-V4GUI directory, open the Windows Command Prompt and run the following installs -

```
pip install --no-cache-dir -r requirements.txt
pip install torch==1.6.0+cu101 torchvision==0.7.0+cu101 -f https://download.pytorch.org/whl/torch_stable.html
```

### FFmpeg 

FFmpeg must be installed and configured in order for the application to be able to process any track that isn't a *.wav* file. Instructions for installing FFmpeg can be found on YouTube, WikiHow, Reddit, GitHub, and many other sources around the web.

- **Note:** If you are experiencing any errors when attempting to process any media files that are not in the *.wav* format, please ensure FFmpeg is installed & configured correctly.

### Running the Vocal Remover GUI & Models

- Open the file labeled *'VocalRemover.py'*.
   - It's recommended that you create a shortcut for the file labeled *'VocalRemover.py'* to your desktop for easy access.
     - **Note:** If you are unable to open the *'VocalRemover.py'* file, please go to the [**troubleshooting**](https://github.com/Anjok07/ultimatevocalremovergui/tree/beta#troubleshooting) section below.
- **Note:** All output audio files will be in the *'.wav'* format.

## Option Guide

### Choose AI Engine:

- This option allows you to toggle between tsurumeso's v2 & v4 AI engines. 
  - **Note:** Each engine comes with it's own set of models.
  - **Note:** The TTA option and the ability to set the N_FFT value is limited to the v4 engine only.
  
### Model Selections:

The v2 & v4 AI engines use different sets of models. When selected, the models available for v2 or v4 will automatically populate within the model selection dropdowns. 

- **Choose Main Model** - Here is where you choose the main model to perform a deep vocal removal.
  - Each of the models provided were trained on different parameters, though they can convert tracks of all genres. 
  - Each model differs in the way they process given tracks.  
     - The [*'Model Test Mode'*](https://github.com/Anjok07/ultimatevocalremovergui/tree/beta#checkboxes) option makes it easier for the user to test different models on given tracks.
- **Choose Stacked Model** - These models are meant to clean up vocal artifacts from instrumental outputs. 
  - The stacked models provided are only meant to process instrumental outputs created by a main model. 
  - Selecting the [*'Stack Passes'*](https://github.com/Anjok07/ultimatevocalremovergui/tree/beta#checkboxes) option will enable you to select a stacked model to run with a main model. 
    - If you wish to only run a stacked model on a track, make sure the [*'Stack Conversion Only'*](https://github.com/Anjok07/ultimatevocalremovergui/tree/beta#checkboxes) option is checked.
  - The wide range of main model/stacked model combinations gives the user more flexibility in discovering what model blend works best for the track(s) they are proessing.
    - To reiterate, the [*'Model Test Mode'*](https://github.com/Anjok07/ultimatevocalremovergui/tree/beta#checkboxes) option streamlines the process of testing different main model/stacked model combinations on a given track. More information on this option can be found in the next section.

### Checkboxes
- **GPU Conversion** - Selecting this option ensures the GPU is used to process conversions. 
  - **Note:** This option will not work if you don't have a Cuda compatible GPU.
    - Nividia GPU's are most compatible with Cuda.
  - **Note:** CPU conversions are much slower compared to those processed through the GPU. 
- **Post-process** - This option can potentially identify leftover instrumental artifacts within the vocal outputs. This option may improve the separation on *some* songs. 
  - **Note:** Having this option selected can potentially have an adverse effect on the conversion process, depending on the track. Because of this, it's only recommended as a last resort.
- **TTA** - This option performs Test-Time-Augmentation to improve the separation quality. 
  - **Note:** Having this selected will increase the time it takes to complete a conversion.
  - **Note:** This option is ***not*** compatible with the *v2* AI engine.
- **Output Image** - Selecting this option will include the spectrograms in *.jpg* format for the instrumental & vocal audio outputs.
- **Stack Passes** - This option activates the stacked model conversion process and allows the user to set the number of times a track runs through a stacked model.
  - **Note:** Unless you have the *'Save All Stacked Outputs'* option selected, the following outputs will be saved - 
    - Instrumental generated after the last stack pass & 
    - The vocal track generated by the main model
  - **Note:** The best range is 3-7 passes. 8 or more passes can result in degraded sound quality for the track.
- **Stack Conversion Only** - Selecting this option allows the user to bypass the main model and run a track through a stacked model only.
- **Save All Stacked Outputs** - Having this option selected will auto-generate a new folder named after the track being processed to your *'Save to'* path. The new folder will contain all of the outputs that were generated after each stack pass. The amount of audio outputs will depend on the number of stack passes chosen.
  - **Note:** Each output audio file will be appended with the number of passes it has had.
    - **Example:** If 5 stack passes are chosen, the application will provide you with all 5 pairs of audio outputs generated after each pass, if this option is enabled.
  - This option can be very useful in determining the optimal number of passes needed to clean a track.
  - The *'stacked vocal'* tracks will contain the audio of the vocal artifacts that were removed from the instrumental. 
    - These files can be used to verify artifact removal.
- **Model Test Mode** - This option makes it easier for users to test the results of different models, and model combinations, by eliminating the hassel of having to manually change the filenames and/or create new folders when processing the same track through multiple models. This option structures the model testing process.
  - When *'Model Test Mode'* is selected, the application will auto-generate a new folder in the *'Save to'* path you have chosen.
    - The new auto-generated folder will be named after the model(s) selected.
    - The output audio files will be saved to the auto-generated directory.
    - The filenames for the instrumental & vocal outputs will have the selected model(s) name(s) appended to them. 

### Parameter Values

All models released here will have the values they were trained with appended to the end of their filenames like so, **'MGM-HIGHEND_sr44100_hl512_w512_nf2048.pth'**. The *'_sr44100_hl512_w512_nf2048'* portion automatically sets the *SR*, *HOP LENGNTH*, *WINDOW SIZE*, & *N_FFT* values within the application. If there are no values appended to the end of a selected model filename, the *SR*, *HOP LENGNTH*, *WINDOW SIZE*, & *N_FFT* fields will be editable and auto-populate with default values. 

- **Default Values:**
  - **SR** - 44100
  - **HOP LENGTH** - 1024
  - **WINDOW SIZE** - 512
  - **N_FFT** - 2048

### Other Buttons:

- **Add New Model** - This button will automatically open the models folder. 
  - **Note:** If you are adding a new model, make sure to add it accordingly based on the AI engine it was trained on.
    - **Example:** If you wish to add a model trained on the v4 engine, add it to the correct folder located in the 'models/v4/' directory.
  - **Note:** The application will automatically detect any models added the correct directories without needing a restart.
- **Restart Button** - If the application hangs for any reason, you can hit the circular arrow button immediately to the right of the *'Start Conversion'* button.

## Models Included

All of the models included in the release were trained on large datasets containing diverse sets of music genres.

**PLEASE NOTE:** Do not change the name of the models provided! The required parameters are specified and appended to the end of the filenames.

Here's a list of the models included within the package -

- **v4 AI Engine**
    - **Main Models**
        - **MGM_MAIN_v4_sr44100_hl512_w512_nf2048.pth** - This is the main model that does an excellent job removing vocals from most tracks.
        - **MGM_LOWEND_A_v4_sr32000_hl512_w512_nf2048.pth** - This model focuses a bit more on removing vocals from lower frequencies.
        - **MGM_LOWEND_B_v4_sr33075_hl384_w512_nf2048.pth** - This is also a model that focuses on lower end frequencies, but trained with different parameters.
        - **MGM_HIGHEND_v4_sr44100_hl1024_nf2048.pth** - This model slightly focuses a bit more on higher end frequencies.
        - **MODEL_BVKARAOKE_by_aufr33_v4_sr33075_hl384_w512_nf1536.pth** - This is a beta model that removes main vocals while leaving background vocals intact.
    - **Stacked Models**
        - **StackedMGM_MM_v4_sr44100_hl512_w512_nf2048.pth** - This is a strong vocal artifact removal model. This model was made to run with *'MGM_MAIN_v4_sr44100_hl512_w512_nf2048.pth'*. However, any combination may yield the desired results.
        - **StackedMGM_MLA_v4_sr32000_hl512_w512_nf2048.pth** - This is a strong vocal artifact removal model. This model was made to run with *'MGM_MAIN_v4_sr44100_hl512_w512_nf2048.pth'*. However, any combination may yield a desired results.
        - **StackedMGM_LL_v4_sr32000_hl512_w512_nf2048.pth** - This is a strong vocal artifact removal model. This model was made to run with *'MGM_LOWEND_A_v4_sr32000_hl512_w512_nf2048.pth'*. However, any combination may yield a desired results.

- **v2 AI Engine**
    - **Main Models**
        - **Multi_Genre_Model_v2_sr44100_hl1024_w512.pth** - This model yields excellent results for most tracks processed through it.
    - **Stacked Models**
        - **StackedRegA_v2_sr44100_hl1024_w512.pth** - This is a standard vocal artifact removal model.
        - **StackedRegB_v2_sr44100_hl1024_w512.pth** - This is a standard vocal artifact removal model.
        - **StackedArg_v2_sr44100_hl1024_w512.pth** - This model removes vocal artifacts a bit more aggressively, but may greatly degrade the audio quality of the output audio. 

A special thank you to aufr33 for helping me expand the dataset used to train some of these models and for the helpful training tips.

## Other GUI Notes

- The application will automatically remember your *'save to'* path upon closing and reopening until it's changed.
  - **Note:** The last directory accessed within the application will also be remembered.
- Multiple conversions are supported.
- The ability to drag & drop audio files to convert has also been added.
- Conversion times will greatly depend on your hardware. 
  - **Note:** This application will *not* be friendly to older or budget hardware. Please proceed with caution! Pay attention to your PC and make sure it doesn't overheat. ***We are not responsible for any hardware damage.***

## Troubleshooting

### Common Issues

- This application is not compatible with 32-bit versions of Python. Please make sure your version of Python is 64-bit. 
- If FFmpeg is not installed, the application will throw an error if the user attempts to convert a non-WAV file.

### Issue Reporting

Please be as detailed as possible when posting a new issue. Make sure to provide any error outputs and/or screenshots/gif's to give us a clearer understanding of the issue you are experiencing.

If the *'VocalRemover.py'* file won't open *under any circumstances* and all other resources have been exhausted, please do the following - 

1. Open the cmd prompt from the UVR-V4GUI directory
2. Run the following command - 
```
python VocalRemover.py
```
3. Copy and paste the error output in the cmd prompt to the issues center on the GitHub repository.

## License

The **Ultimate Vocal Remover GUI** code is [MIT-licensed](LICENSE). 

## Contributing

- For anyone interested in the ongoing development of **Ultimate Vocal Remover GUI** please send us a pull request and we will review it. This project is 100% open-source and free for anyone to use and/or modify as they wish. 
- Please note that we do not maintain or directly support any of tsurumesos AI application code. We only maintain the development and support for the **Ultimate Vocal Remover GUI**. 

## References
- [1] Takahashi et al., "Multi-scale Multi-band DenseNets for Audio Source Separation", https://arxiv.org/pdf/1706.09588.pdf
Update README.md 2020-11-12 12:57:55 +01:00			`# Ultimate Vocal Remover GUI v4.0.0`
Update README.md 2020-11-13 09:38:50 +01:00			`<img src="https://raw.githubusercontent.com/Anjok07/ultimatevocalremovergui/beta/img/UVRVP4.png" />`
Update README.md 2020-11-10 12:20:37 +01:00
Update README.md 2020-11-09 19:49:25 +01:00			`[![Release](https://img.shields.io/github/release/anjok07/ultimatevocalremovergui.svg)](https://github.com/anjok07/ultimatevocalremovergui/releases/latest)`
Update README.md 2020-11-09 19:48:10 +01:00			`[![Downloads](https://img.shields.io/github/downloads/anjok07/ultimatevocalremovergui/total.svg)](https://github.com/anjok07/ultimatevocalremovergui/releases)`
Add files via upload 2020-11-09 11:40:12 +01:00
Update README.md 2020-11-10 10:21:11 +01:00			`## About`
Add files via upload 2020-11-09 11:40:12 +01:00
Update README.md 2020-11-11 02:29:24 +01:00			`This application is a GUI version of the vocal remover AI created and posted by GitHub user [tsurumeso](https://github.com/tsurumeso). You can find tsurumeso's original command line version [here](https://github.com/tsurumeso/vocal-remover).`
Add files via upload 2020-11-09 11:40:12 +01:00
Update README.md 2020-11-10 10:21:11 +01:00			`- Special Thanks`
Update README.md 2020-11-11 02:29:24 +01:00			`- [tsurumeso](https://github.com/tsurumeso) - The engineer who authored the AI code. Thank you for the hard work and dedication you put into the AI application this GUI is built around!`
Update README.md 2020-11-11 03:41:08 +01:00			`- [DilanBoskan](https://github.com/DilanBoskan) - The main GUI code contributor. Thank you for helping bring this GUI to life! Your hard work and continued support is greatly appreciated.`
Add files via upload 2020-11-09 11:40:12 +01:00
			`## Installation`

Update README.md 2020-11-11 02:29:24 +01:00			`The application was made with Tkinter for cross-platform compatibility, so it should work with Windows, Mac, and Linux systems. However, this application has only been tested on Windows 10 & Linux Ubuntu.`
Add files via upload 2020-11-09 11:40:12 +01:00
			`### Install Required Applications & Packages`

Update README.md 2020-11-10 10:21:11 +01:00			`1. Download & install Python 3.7 [here](https://www.python.org/ftp/python/3.7.0/python-3.7.0-amd64.exe) (Windows link)`
Update README.md 2020-11-11 03:41:08 +01:00			`- Note: Ensure the "Add Python 3.7 to PATH" box is checked`
Update README.md 2020-11-13 12:52:27 +01:00			`2. Once Python has installed, download Ultimate Vocal Remover GUI Version 4.0.0 [here](https://github.com/Anjok07/ultimatevocalremovergui/releases/download/v4.0.0/UVR-V4GUI.zip)`
Update README.md 2020-11-10 09:19:02 +01:00			`3. Place the UVR-V4GUI folder contained within the .zip file where ever you wish.`
Update README.md 2020-11-11 03:41:08 +01:00			`- Your documents folder or home directory is recommended for easy access.`
Update README.md 2020-11-10 04:56:35 +01:00			`4. From the UVR-V4GUI directory, open the Windows Command Prompt and run the following installs -`
Add files via upload 2020-11-09 11:40:12 +01:00
			```
Update README.md 2020-11-09 13:12:14 +01:00			`pip install --no-cache-dir -r requirements.txt`
Add files via upload 2020-11-09 11:40:12 +01:00			`pip install torch==1.6.0+cu101 torchvision==0.7.0+cu101 -f https://download.pytorch.org/whl/torch_stable.html`
			```

Update README.md 2020-11-11 03:59:27 +01:00			`### FFmpeg`

			`FFmpeg must be installed and configured in order for the application to be able to process any track that isn't a .wav file. Instructions for installing FFmpeg can be found on YouTube, WikiHow, Reddit, GitHub, and many other sources around the web.`

Update README.md 2020-11-11 10:33:17 +01:00			`- Note: If you are experiencing any errors when attempting to process any media files that are not in the .wav format, please ensure FFmpeg is installed & configured correctly.`
Update README.md 2020-11-11 03:59:27 +01:00
Update README.md 2020-11-10 04:56:35 +01:00			`### Running the Vocal Remover GUI & Models`
Add files via upload 2020-11-09 11:40:12 +01:00
Update README.md 2020-11-10 10:21:11 +01:00			`- Open the file labeled 'VocalRemover.py'.`
			`- It's recommended that you create a shortcut for the file labeled 'VocalRemover.py' to your desktop for easy access.`
Update README.md 2020-11-11 02:29:24 +01:00			`- Note: If you are unable to open the 'VocalRemover.py' file, please go to the [troubleshooting](https://github.com/Anjok07/ultimatevocalremovergui/tree/beta#troubleshooting) section below.`
Update README.md 2020-11-11 10:35:31 +01:00			`- Note: All output audio files will be in the '.wav' format.`
Add files via upload 2020-11-09 11:40:12 +01:00
			`## Option Guide`

			`### Choose AI Engine:`

			`- This option allows you to toggle between tsurumeso's v2 & v4 AI engines.`
Update README.md 2020-11-11 03:41:08 +01:00			`- Note: Each engine comes with it's own set of models.`
			`- Note: The TTA option and the ability to set the N_FFT value is limited to the v4 engine only.`

Add files via upload 2020-11-09 11:40:12 +01:00			`### Model Selections:`

Update README.md 2020-11-11 03:41:08 +01:00			`The v2 & v4 AI engines use different sets of models. When selected, the models available for v2 or v4 will automatically populate within the model selection dropdowns.`
Update README.md 2020-11-10 09:19:02 +01:00
Update README.md 2020-11-11 03:41:08 +01:00			`- Choose Main Model - Here is where you choose the main model to perform a deep vocal removal.`
Update README.md 2020-11-10 09:19:02 +01:00			`- Each of the models provided were trained on different parameters, though they can convert tracks of all genres.`
Update README.md 2020-11-11 10:01:40 +01:00			`- Each model differs in the way they process given tracks.`
Update README.md 2020-11-11 03:41:08 +01:00			`- The ['Model Test Mode'](https://github.com/Anjok07/ultimatevocalremovergui/tree/beta#checkboxes) option makes it easier for the user to test different models on given tracks.`
Update README.md 2020-11-10 09:19:02 +01:00			`- Choose Stacked Model - These models are meant to clean up vocal artifacts from instrumental outputs.`
			`- The stacked models provided are only meant to process instrumental outputs created by a main model.`
Update README.md 2020-11-11 02:29:24 +01:00			`- Selecting the ['Stack Passes'](https://github.com/Anjok07/ultimatevocalremovergui/tree/beta#checkboxes) option will enable you to select a stacked model to run with a main model.`
			`- If you wish to only run a stacked model on a track, make sure the ['Stack Conversion Only'](https://github.com/Anjok07/ultimatevocalremovergui/tree/beta#checkboxes) option is checked.`
Update README.md 2020-11-11 03:41:08 +01:00			`- The wide range of main model/stacked model combinations gives the user more flexibility in discovering what model blend works best for the track(s) they are proessing.`
			`- To reiterate, the ['Model Test Mode'](https://github.com/Anjok07/ultimatevocalremovergui/tree/beta#checkboxes) option streamlines the process of testing different main model/stacked model combinations on a given track. More information on this option can be found in the next section.`
Add files via upload 2020-11-09 11:40:12 +01:00
Update README.md 2020-11-10 10:21:11 +01:00			`### Checkboxes`
Update README.md 2020-11-11 03:41:08 +01:00			`- GPU Conversion - Selecting this option ensures the GPU is used to process conversions.`
			`- Note: This option will not work if you don't have a Cuda compatible GPU.`
			`- Nividia GPU's are most compatible with Cuda.`
			`- Note: CPU conversions are much slower compared to those processed through the GPU.`
Update README.md 2020-11-10 10:46:17 +01:00			`- Post-process - This option can potentially identify leftover instrumental artifacts within the vocal outputs. This option may improve the separation on some songs.`
Update README.md 2020-11-11 02:29:24 +01:00			`- Note: Having this option selected can potentially have an adverse effect on the conversion process, depending on the track. Because of this, it's only recommended as a last resort.`
Update README.md 2020-11-10 09:19:02 +01:00			`- TTA - This option performs Test-Time-Augmentation to improve the separation quality.`
Update README.md 2020-11-11 02:29:24 +01:00			`- Note: Having this selected will increase the time it takes to complete a conversion.`
Update README.md 2020-11-11 03:41:08 +01:00			`- Note: This option is *not* compatible with the v2 AI engine.`
			`- Output Image - Selecting this option will include the spectrograms in .jpg format for the instrumental & vocal audio outputs.`
			`- Stack Passes - This option activates the stacked model conversion process and allows the user to set the number of times a track runs through a stacked model.`
Update README.md 2020-11-11 10:33:17 +01:00			`- Note: Unless you have the 'Save All Stacked Outputs' option selected, the following outputs will be saved -`
			`- Instrumental generated after the last stack pass &`
			`- The vocal track generated by the main model`
Update README.md 2020-11-11 03:41:08 +01:00			`- Note: The best range is 3-7 passes. 8 or more passes can result in degraded sound quality for the track.`
Update README.md 2020-11-10 09:19:02 +01:00			`- Stack Conversion Only - Selecting this option allows the user to bypass the main model and run a track through a stacked model only.`
Update README.md 2020-11-11 10:33:17 +01:00			`- Save All Stacked Outputs - Having this option selected will auto-generate a new folder named after the track being processed to your 'Save to' path. The new folder will contain all of the outputs that were generated after each stack pass. The amount of audio outputs will depend on the number of stack passes chosen.`
			`- Note: Each output audio file will be appended with the number of passes it has had.`
Update README.md 2020-11-11 03:41:08 +01:00			`- Example: If 5 stack passes are chosen, the application will provide you with all 5 pairs of audio outputs generated after each pass, if this option is enabled.`
Update README.md 2020-11-10 09:19:02 +01:00			`- This option can be very useful in determining the optimal number of passes needed to clean a track.`
Update README.md 2020-11-11 10:33:17 +01:00			`- The 'stacked vocal' tracks will contain the audio of the vocal artifacts that were removed from the instrumental.`
			`- These files can be used to verify artifact removal.`
			`- Model Test Mode - This option makes it easier for users to test the results of different models, and model combinations, by eliminating the hassel of having to manually change the filenames and/or create new folders when processing the same track through multiple models. This option structures the model testing process.`
			`- When 'Model Test Mode' is selected, the application will auto-generate a new folder in the 'Save to' path you have chosen.`
Update README.md 2020-11-11 10:01:40 +01:00			`- The new auto-generated folder will be named after the model(s) selected.`
			`- The output audio files will be saved to the auto-generated directory.`
Update README.md 2020-11-11 03:41:08 +01:00			`- The filenames for the instrumental & vocal outputs will have the selected model(s) name(s) appended to them.`

			`### Parameter Values`

			`All models released here will have the values they were trained with appended to the end of their filenames like so, 'MGM-HIGHEND_sr44100_hl512_w512_nf2048.pth'. The '_sr44100_hl512_w512_nf2048' portion automatically sets the SR, HOP LENGNTH, WINDOW SIZE, & N_FFT values within the application. If there are no values appended to the end of a selected model filename, the SR, HOP LENGNTH, WINDOW SIZE, & N_FFT fields will be editable and auto-populate with default values.`

			`- Default Values:`
			`- SR - 44100`
			`- HOP LENGTH - 1024`
			`- WINDOW SIZE - 512`
			`- N_FFT - 2048`
Add files via upload 2020-11-09 11:40:12 +01:00
			`### Other Buttons:`

Update README.md 2020-11-11 03:41:08 +01:00			`- Add New Model - This button will automatically open the models folder.`
			`- Note: If you are adding a new model, make sure to add it accordingly based on the AI engine it was trained on.`
			`- Example: If you wish to add a model trained on the v4 engine, add it to the correct folder located in the 'models/v4/' directory.`
Update README.md 2020-11-11 02:29:24 +01:00			`- Note: The application will automatically detect any models added the correct directories without needing a restart.`
Update README.md 2020-11-10 05:02:53 +01:00			`- Restart Button - If the application hangs for any reason, you can hit the circular arrow button immediately to the right of the 'Start Conversion' button.`
Add files via upload 2020-11-09 11:40:12 +01:00
Update README.md 2020-11-10 10:21:11 +01:00			`## Models Included`
Add files via upload 2020-11-09 11:40:12 +01:00
Update README.md 2020-11-13 12:32:14 +01:00			`All of the models included in the release were trained on large datasets containing diverse sets of music genres.`

Update README.md 2020-11-10 10:46:17 +01:00			`PLEASE NOTE: Do not change the name of the models provided! The required parameters are specified and appended to the end of the filenames.`
Add files via upload 2020-11-09 11:40:12 +01:00
Update README.md 2020-11-10 10:46:17 +01:00			`Here's a list of the models included within the package -`
Update README.md 2020-11-09 11:42:47 +01:00
Update README.md 2020-11-13 12:32:14 +01:00			`- v4 AI Engine`
Update README.md 2020-11-10 10:46:17 +01:00			`- Main Models`
Update README.md 2020-11-13 12:32:14 +01:00			`- MGM_MAIN_v4_sr44100_hl512_w512_nf2048.pth - This is the main model that does an excellent job removing vocals from most tracks.`
			`- MGM_LOWEND_A_v4_sr32000_hl512_w512_nf2048.pth - This model focuses a bit more on removing vocals from lower frequencies.`
			`- MGM_LOWEND_B_v4_sr33075_hl384_w512_nf2048.pth - This is also a model that focuses on lower end frequencies, but trained with different parameters.`
			`- MGM_HIGHEND_v4_sr44100_hl1024_nf2048.pth - This model slightly focuses a bit more on higher end frequencies.`
			`- MODEL_BVKARAOKE_by_aufr33_v4_sr33075_hl384_w512_nf1536.pth - This is a beta model that removes main vocals while leaving background vocals intact.`
Update README.md 2020-11-10 10:46:17 +01:00			`- Stacked Models`
Update README.md 2020-11-13 12:32:14 +01:00			`- StackedMGM_MM_v4_sr44100_hl512_w512_nf2048.pth - This is a strong vocal artifact removal model. This model was made to run with 'MGM_MAIN_v4_sr44100_hl512_w512_nf2048.pth'. However, any combination may yield the desired results.`
			`- StackedMGM_MLA_v4_sr32000_hl512_w512_nf2048.pth - This is a strong vocal artifact removal model. This model was made to run with 'MGM_MAIN_v4_sr44100_hl512_w512_nf2048.pth'. However, any combination may yield a desired results.`
			`- StackedMGM_LL_v4_sr32000_hl512_w512_nf2048.pth - This is a strong vocal artifact removal model. This model was made to run with 'MGM_LOWEND_A_v4_sr32000_hl512_w512_nf2048.pth'. However, any combination may yield a desired results.`

			`- v2 AI Engine`
Update README.md 2020-11-10 10:46:17 +01:00			`- Main Models`
Update README.md 2020-11-13 12:32:14 +01:00			`- Multi_Genre_Model_v2_sr44100_hl1024_w512.pth - This model yields excellent results for most tracks processed through it.`
Update README.md 2020-11-10 10:46:17 +01:00			`- Stacked Models`
Update README.md 2020-11-13 12:32:14 +01:00			`- StackedRegA_v2_sr44100_hl1024_w512.pth - This is a standard vocal artifact removal model.`
			`- StackedRegB_v2_sr44100_hl1024_w512.pth - This is a standard vocal artifact removal model.`
			`- StackedArg_v2_sr44100_hl1024_w512.pth - This model removes vocal artifacts a bit more aggressively, but may greatly degrade the audio quality of the output audio.`
Update README.md 2020-11-10 05:02:53 +01:00
Update README.md 2020-11-13 12:35:01 +01:00			`A special thank you to aufr33 for helping me expand the dataset used to train some of these models and for the helpful training tips.`
Update README.md 2020-11-10 04:56:35 +01:00
Update README.md 2020-11-10 10:21:11 +01:00			`## Other GUI Notes`

Update README.md 2020-11-11 03:41:08 +01:00			`- The application will automatically remember your 'save to' path upon closing and reopening until it's changed.`
			`- Note: The last directory accessed within the application will also be remembered.`
Update README.md 2020-11-10 10:21:11 +01:00			`- Multiple conversions are supported.`
			`- The ability to drag & drop audio files to convert has also been added.`
Update README.md 2020-11-10 10:46:17 +01:00			`- Conversion times will greatly depend on your hardware.`
Update README.md 2020-11-11 02:29:24 +01:00			`- Note: This application will not be friendly to older or budget hardware. Please proceed with caution! Pay attention to your PC and make sure it doesn't overheat. *We are not responsible for any hardware damage.*`
Update README.md 2020-11-10 10:21:11 +01:00
			`## Troubleshooting`
Add files via upload 2020-11-09 11:40:12 +01:00
Update README.md 2020-11-11 10:52:29 +01:00			`### Common Issues`

			`- This application is not compatible with 32-bit versions of Python. Please make sure your version of Python is 64-bit.`
			`- If FFmpeg is not installed, the application will throw an error if the user attempts to convert a non-WAV file.`

			`### Issue Reporting`

Update README.md 2020-11-11 03:59:27 +01:00			`Please be as detailed as possible when posting a new issue. Make sure to provide any error outputs and/or screenshots/gif's to give us a clearer understanding of the issue you are experiencing.`

Update README.md 2020-11-10 10:59:02 +01:00			`If the 'VocalRemover.py' file won't open under any circumstances and all other resources have been exhausted, please do the following -`
Add files via upload 2020-11-09 11:40:12 +01:00
			`1. Open the cmd prompt from the UVR-V4GUI directory`
			`2. Run the following command -`
			```
Update README.md 2020-11-09 11:53:22 +01:00			`python VocalRemover.py`
Add files via upload 2020-11-09 11:40:12 +01:00			```
Update README.md 2020-11-10 05:02:53 +01:00			`3. Copy and paste the error output in the cmd prompt to the issues center on the GitHub repository.`
Add files via upload 2020-11-09 11:40:12 +01:00
Update README.md 2020-11-10 10:21:11 +01:00			`## License`
Add files via upload 2020-11-09 11:40:12 +01:00
Update README.md 2020-11-10 10:21:11 +01:00			`The Ultimate Vocal Remover GUI code is [MIT-licensed](LICENSE).`

			`## Contributing`

Update README.md 2020-11-11 02:29:24 +01:00			`- For anyone interested in the ongoing development of Ultimate Vocal Remover GUI please send us a pull request and we will review it. This project is 100% open-source and free for anyone to use and/or modify as they wish.`
			`- Please note that we do not maintain or directly support any of tsurumesos AI application code. We only maintain the development and support for the Ultimate Vocal Remover GUI.`
Add files via upload 2020-11-09 11:40:12 +01:00
			`## References`
Update README.md 2020-11-10 10:21:11 +01:00			`- [1] Takahashi et al., "Multi-scale Multi-band DenseNets for Audio Source Separation", https://arxiv.org/pdf/1706.09588.pdf`