From f975550ed96128ba628fdac646b20b0fe75f73c2 Mon Sep 17 00:00:00 2001 From: aufr33 <65520685+aufr33@users.noreply.github.com> Date: Sun, 22 Aug 2021 06:18:25 +0300 Subject: [PATCH] Update README.md --- README.md | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.md b/README.md index 9ffa960..c63e862 100644 --- a/README.md +++ b/README.md @@ -38,12 +38,12 @@ Please note, this version is based on vocal-remover 4.0.0 of tsurumeso's origina - New format of spectrograms. Instead of a single spectrogram with a fixed FFT size, combined spectrograms are now used. This version combines several different types of spectrograms within specific frequency ranges. This approach allowed for a clearer view of the high frequencies and good resolutions at low frequencies, thus allowing for more targeted vocal removals. - The arguments --sr, --n_fft, --hop_length are removed. JSON files are now used instead. - The following new features were added - - **--high_end_process** - This argument restores the high frequencies of the output audio. It is intended for models with a narrow bandwidth, 16 kHz and below. The 5 choices for this argument are: - - *none* - No processing (default) - - *bypass* - This copies the missing frequencies from the input. - - *correlation* - This also copies missing frequencies from the input, however, the magnitude of the copied frequency will depend on the magnitude of the generated instrumental's high frequencies. It will be removed in the final release. - - *mirroring* - This algorithm is more advanced than *correlation*. It uses the high frequencies from the input and mirrored instrumental's frequencies. + - **--high_end_process** - This argument restores the high-end frequencies of the output audio. In order to optimize performance, some models have bandwidths below 20 kHz. This argument allows you to choose one of 5 post-processing options: + - *mirroring* - This is the default value. This algorithm uses the high-end frequencies from the input and mirrored outout's frequencies. - *mirroring2* - This version of mirroring is optimized for better performance. + - *bypass* - This copies the missing frequencies from the input without any modification. + - *none* - No processing. + - *correlation* - Obsolete. It will be removed in the final release. - **--aggressiveness** - This argument allows you to set how strong the vocal removal will be. The range is 0.0-1.0 The higher the value, the more the vocals will be removed. Please note, the highest value can result in muddy sounding instrumentals depending on the track being converted, so this isn't always recommended. The default is 0.1. For the vocal model specifically, the recommended value is 0.5-0.6. - **--deepextraction** - This argument generates an additional instrumental output with deep artifact vocal removal. This option is experimental and is more suited for acoustic or other light types of tracks with stubborn vocals. Many others might sound bad.