Author:
Shi Xian,Chen Yanni,Zhang Shiliang,Yan Zhijie
Publisher
Springer Nature Singapore
Reference21 articles.
1. Yang, Z., et al.: CaTT-KWS: A Multi-stage Customized Keyword Spotting Framework based on Cascaded Transducer-Transformer. arXiv preprint arXiv:2207.01267 (2022)
2. Ren, Y., et al.: Fastspeech: Fast, robust and controllable text to speech. In: Advances in Neural Information Processing Systems 32 (2019)
3. Labov, W., Rosenfelder, I., Fruehwald, J.: One hundred years of sound change in Philadelphia: Linear incrementation, reversal, and reanalysis. Language, pp. 30–65 (2013)
4. DiCanio, C., Nam, H., Whalen, D.H., Timothy Bunnell, H., Amith, J.D., García, R.C.: Using automatic alignment to analyze endangered language data: Testing the viability of untrained alignment. J. Acoustical Soc. Am. 134(3), 2235–2246 (2013)
5. Yuan, J., Liberman, M., Cieri, C.: Towards an integrated understanding of speaking rate in conversation. In: Ninth International Conference on Spoken Language Processing (2006)
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献