1. 3-D convolutional recurrent neural networks with attention model for speech emotion recognition;Chen;IEEE Signal Process. Lett.,2018
2. AGAIN-VC: A One-shot voice conversion using activation guidance and adaptive instance normalization;Chen,2021
3. CLUB: A Contrastive log-ratio upper bound of mutual information;Cheng,2020
4. One-shot voice conversion by separating speaker and content representations with instance normalization;Chou,2019
5. Multi-speaker and multi-domain emotional voice conversion using factorized hierarchical variational autoencoder;Elgaar,2020