AI Voice Synthesis and stuff

awoo@burggit.moe · 1 year ago

awoo@burggit.moe · edit-2 1 year ago

Personal thought:

RVC is apparently the big thing right now just by looking at numbers of available models https://docs.google.com/spreadsheets/d/1tAUaQrEHYgRsm1Lvrnj14HFHDwJWl0Bd9x0QePewNco/edit#gid=1227575351 Unfortunately RVC is strictly a voice changer so you either need to provide the source audio yourself or use TTS if you want to hear your favorite anime girl whispers into your ears.

I’m a bit surprised to the lack of interest for TTS but oh well, guess people nowadays are more into virtual idol singing songs and shit.

Here is a modified webUI for RVC that incorporates Micorsoft’s Edge TTS https://huggingface.co/spaces/ArkanDash/rvc-models-new. Check the Github link for the repo.

Here is my test run using the Shiroko model found in the Google Sheet: