Repository for Group Project of CS570, KAIST, 2023 Spring
Sorted by Korean alphabetical order
Please refer to the syllabus
Diffusion-GAN Model for Audio Synthesis
Our Diffusion-GAN melspectrogram result (100 samples) is in diffgan_output.zip. You can convert each melspectrogram .png file into .wav
Use wav2mel.py to convert from audio.wav into mel.png
$ python wav2mel.py --input audio.wav --save mel.pngWe utilized the publicly accessible source code of Diffwave and Diffusion-GAN for our project
Our experimental setup is described in our report.
Use mel2wav.py to convert from mel.png into output.wav
$ python mel2wav.py --input mel.png --save output.wav