Publisher
National Institute of Telecommunications
Reference54 articles.
1. [1] J. Glass, "A probabilistic framework for segment-based speech recognition", Computer Speech & Language, vol. 17, no. 2-3, pp. 137-152, 2003 (DOI: 10.1016/S0885-2308(03)00006-8).
2. [2] D. T. Chappell and J. Hansen, "A comparison of spectral smoothing methods for segment concatenation based speech synthesis", Speech Commun., vol. 36, no. 3-4, pp. 343-373, 2002 (DOI: 10.1016/S0167-6393(01)00008-5).
3. [3] J. Adell and A. Bonafonte, "Towards phone segmentation for concatenative speech synthesis", in Proc. of the 5th ISCA Speech Synthesis Workshop (SSW5), Pittsburgh, PA, USA, 2004, pp. 139-144 [Online]. Available: https://nlp.lsi.upc.edu/papers/adell04b.pdf
4. [4] H. Wang, T. Lee, C. Leung, B. Ma, and H. Li, "Acoustic Segment Modeling with Spectral Clustering Methods", in IEEE/ACM Transac. on Audio, Speech, and Language Process., vol. 23, no. 2, pp. 264-277, 2015 (DOI: 10.1109/TASLP.2014.2387382).
5. [5] J. P. Hosom, "Speaker-independent phoneme alignment Rusing transition-dependent states", vol. 51, no. 4, pp. 352-368, 2008 (DOI: 10.1016/j.specom.2008.11.003).
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献