1. Automatic Speech Recognition, vol. 1;Yu,2016
2. R. Tanabe, T. Endo, Y. Nikaido, T. Ichige, P. Nguyen, Y. Kawaguchi, K. Hamada, Multichannel acoustic scene classification by blind dereverberation, blind source separation, data augmentation, and model ensembling, 2018, DCASE 2018 Challenge.
3. An end-to-end neural network for polyphonic piano music transcription;Sigtia;IEEE/ACM Trans. Audio Speech Lang. Process.,2016
4. On the potential of simple framewise approaches to piano transcription;Kelz,2016
5. Onsets and frames: dual-objective piano transcription;Hawthorne,2018