A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
-
Updated
Apr 2, 2023 - Python
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
Singing Voice Synthesis based on VITS, different from VISinger
Multispeaker Community Vocoder Model for DiffSinger
A fork of genon2nnsvs with modifications made for english speakers and diffsinger users
Convert the UTAU Voicebank to a configuration compatible with DiffSinger Dataset
A Streamlit-based web application that converts Japanese text (romaji/hiragana/katakana) into MIDI files with customizable parameters. Perfect for UTAU/DIFFSINGER and other voice synthesis development workflows.
Add a description, image, and links to the diffsinger topic page so that developers can more easily learn about it.
To associate your repository with the diffsinger topic, visit your repo's landing page and select "manage topics."