1. 3rd Generation Partnership Project. (2002). Technical Report 26.937 V1.2.0 Transparent end-to-end Packet-switched Streaming Service (PSS); Real-Time Transport Protocol (RTP) usage model.
http://www.3gpp.org
2. 3rd Generation Partnership Project. (2008). Technical Report 26.935 V8.0.0 (2008-12). Technical Specification Group Services and System Aspects; Packet-switched conversational multimedia applications; Performance characterization of default codecs (Release 8). http://www.3gpp.org
3. Ali, A. , & Renals, S. (2018). Word Error Rate Estimation for Speech Recognition: e-WER . In Proceedings of the 56th annual meeting of the Association for Computational Linguistics (Vol. 2: Short Papers, pp. 20–24). Association for Computational Linguistics. https://doi.org/10.18653/v1/P18-2004
4. Perceptual evaluation of speech quality (PESQ), the new ITU standard for end-to-end speech quality assessment, part II—Psychoacoustic model;Beerends J. G.;Journal of the Audio Engineering Society,2002
5. Beerends J. G. Larsen E. Iyer N. &
van Vugt J. M.
(2004). Measurement of speech intelligibility based on the PESQ approach. Proceedings of the Workshop Measurement of Speech and Audio Quality in Networks (MESAQIN) Prague Czech Republic.