This repo is designed as an uniform interface for developing various vocoders.
Configs:
- hifigan (baseline): HiFiGAN generator with UnivNet discriminators.
- bigvgan: BigVGAN generator.
- vocos: Vocos (ConvNext) generator.
- refinegan: RefineGAN generator.
- firefly-gan: convnext encoder + hifigan generator.
- TIMM: https://github.com/huggingface/pytorch-image-models
- BigVGAN: https://github.com/NVIDIA/BigVGAN
- Vocos: https://github.com/charactr-platform/vocos
- UnivNet: https://github.com/mindslab-ai/univnet
- ConvNext: https://github.com/facebookresearch/ConvNeXt
- HiFiGAN: https://github.com/jik876/hifi-gan
- Fish Diffusion: https://github.com/fishaudio/fish-diffusion
- RefineGAN: https://arxiv.org/abs/2111.00962
- Encodec: https://github.com/facebookresearch/encodec
- EVA-GAN: https://arxiv.org/abs/2402.00892
- ConvNext: https://github.com/facebookresearch/ConvNeXt