We present the conversions generated by our method to the 2018 voice converison challenge. Samples are provided for both Hub and Spoke tasks. The model was trained using 5 minutes of audio for each speaker. Each table cell links to the complete set of conversions for the matching source and target speakers.
| TF1 | TF2 | TM1 | TM2 | |
|---|---|---|---|---|
| SF1 | SF1 → TF1 | SF1 → TF2 | SF1 → TM1 | SF1 → TM2 |
| SF2 | SF2 → TF1 | SF2 → TF2 | SF2 → TM1 | SF2 → TM2 |
| SM1 | SM1 → TF1 | SM1 → TF2 | SM1 → TM1 | SM1 → TM2 |
| SM2 | SM2 → TF1 | SM2 → TF2 | SM2 → TM1 | SM2 → TM2 |