1. Yu C (2017) Robust speaker modeling in non-neutral environments with application to large scale multi-speaker audio streams. Doctoral dissertation
2. Reynolds DA, Torres-Carrasquillo P (2005) Approaches and applications of audio diarization. In: Proceedings.(ICASSP'05). IEEE international conference on acoustics, speech, and signal processing, March 2005, vol 5, pp v-953. IEEE
3. Medennikov I, Korenevsky M, Prisyach T, Khokhlov Y, Korenevskaya M, Sorokin I, Timofeeva T, Mitrofanov A, Andrusenko A, Podluzhny I, Laptev A, Romanenko A (2020) Target-speaker voice activity detection: a novel approach for multi-speaker diarization in a dinner party scenario. arXiv:2005.07272
4. Lechevrel N, Gábor K, Tellier I, Charnois T, Zargayouna H, Buscaldi D (2017). Combining syntactic and sequential patterns for unsupervised semantic relation extraction. In DMNLP workshop@ ECML-PKDD, August 2017, pp 81–84
5. Wang D, Chen J (2018) Supervised speech separation based on deep learning: an overview. IEEE/ACM Trans Audio Speech Lang Process 26(10):1702–1726