Pass your audio in, AudioSR will make it high fidelity!
Work on all types of audio (e.g., music, speech, dog, raining, ...) & all sampling rates.
- Create an Anaconda environment:
conda create -n audiosr python=3.9
- Activate the environment:
conda activate audiosr
- Clone this repository to your local machine:
git clone https://github.com/ORI-Muchim/AudioSR-Upsampling.git
- Navigate to the cloned directory:
cd AudioSR-Upsampling
- Install the necessary dependencies:
pip install -r requirements.txt
Place the audio files as follows.
.mp3 or .wav files are okay.
AudioSR-Upsampling
├────datasets
│ ├───speaker0
│ │ ├────1.mp3
│ │ └────2.mp3
│ ├───speaker1
│ │ ├───1.wav
│ │ └───1.wav
│ ├───speaker2
│ │ ├────1.wav
│ └───└────1.wav
├────audiosr
├────.gitignore
├────main.py
├────Readme.md
└────requirements.txt
This is just an example, and it's okay to add more speakers.
python main.py
Thank you for falsewinnet for helping me create ./audiosr/__main__.py
.
If you find this repo useful, please consider citing:
@article{liu2023audiosr,
title={{AudioSR}: Versatile Audio Super-resolution at Scale},
author={Liu, Haohe and Chen, Ke and Tian, Qiao and Wang, Wenwu and Plumbley, Mark D},
journal={arXiv preprint arXiv:2309.07314},
year={2023}
}