-
I am currently looking for community recommendations on the best karaoke models. Could you please share your favorite model for creating karaoke files, Example: (me) separator = Separator(
output_single_stem='Instrumental',
output_format='MP3',
sample_rate=44100,
mdx_params={"hop_length": 1024, "segment_size": 256, "overlap": 0.25, "batch_size": 1, "enable_denoise": True}
)
separator.load_model(model_filename='model_bs_roformer_ep_317_sdr_12.9755.ckpt') Roformer model because this issue What about you? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
So, I actually make karaoke videos every day, and I built this helper tool to automate part of my process: and, because I'm pretty much the only person using it, I tend to update the CLI to default to my own preferred settings whenever those change 😅 for every track I start making, when I run
I choose those because:
There is no single model which is the best for all input audio! It really does depend quite a lot on the track you're working on. |
Beta Was this translation helpful? Give feedback.
-
Oh, and I always use FLAC for the input audio whenever I can find a lossless input audio for the track I'm making, and I always use FLAC as the output format because I want to avoid quality loss by re-encoding multiple times with lossy formats. Because for my use case I'm generating videos in H264/AAC MP4 format for the highest compatibility with video players / youtube etc. - whatever audio I output will be re-encoded to AAC at least once anyway. Avoiding multiple lossy encodings is vastly preferably for audio quality compared to encoding to e.g. MP3 and then re-encoding that lossy file to AAC afterwards. There's a bit more explanation here: https://interviewfor.red/en/transcodes.html and here: https://interviewfor.red/en/spectrals.html |
Beta Was this translation helpful? Give feedback.
So, I actually make karaoke videos every day, and I built this helper tool to automate part of my process:
https://github.com/karaokenerds/karaoke-prep
and, because I'm pretty much the only person using it, I tend to update the CLI to default to my own preferred settings whenever those change 😅
so, if you see here: https://github.com/karaokenerds/karaoke-prep/blob/main/karaoke_prep/utils/prep_cli.py#L62
for every track I start making, when I run
karaoke-prep
it basically runsaudio-separator
4 times, once for each of these models, all using default settings:model_mel_band_roformer_ep_3005_sdr_11.4360.ckpt
UVR_MDXNET_KARA_2.onnx
2_HP-UVR.pth
MDX23C-8KFFT-InstVoc_HQ_2.ckpt
I choose those…