We present the conversions generated by our method to the 2018 voice converison challenge. Samples are provided for both Hub and Spoke tasks. The model was trained using 5 minutes of audio for each speaker. Each table cell links to the complete set of conversions for the matching source and target speakers.
TF1 | TF2 | TM1 | TM2 | |
---|---|---|---|---|
SF1 | SF1 → TF1 | SF1 → TF2 | SF1 → TM1 | SF1 → TM2 |
SF2 | SF2 → TF1 | SF2 → TF2 | SF2 → TM1 | SF2 → TM2 |
SM1 | SM1 → TF1 | SM1 → TF2 | SM1 → TM1 | SM1 → TM2 |
SM2 | SM2 → TF1 | SM2 → TF2 | SM2 → TM1 | SM2 → TM2 |