1. Ba J.L., Kiros J.R., Hinton G.E., “Layer normalization,” arXiv preprint arXiv: 1607.06450, 2016.
2. Understanding dropout;Baldi,2013
3. Phonetic annotation of a non-native speech corpus;Bonaventura,2000
4. Completely unsupervised phoneme recognition by a generative adversarial network harmonized with iteratively refined hidden Markov models;Chen;Interspeech,2019
5. Chou J., Yeh C., Lee H., et al., “Multi-target voice conversion without parallel data by adversarially learning disentangled audio representations,” arXiv preprint arXiv:1804.02812, 2018.