1. On-line melody extraction from polyphonic audio using harmonic cluster tracking;Arora;IEEE Transactions on Audio, Speech, and Language Processing,2012
2. Bahdanau, D., Cho, K., Bengio, Y., 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473.
3. Bahdanau, D., Chorowski, J., Serdyuk, D., Brakel, P., Bengio, Y., 2016. End-to-end attention-based large vocabulary speech recognition. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp. 4945–4949.
4. Basaran, D., Essid, S., Peeters, G., 2018. Main melody extraction with source-filter nmf and crnn. In: 19th International Society for Music Information Retrieval Conference (ISMIR).
5. Bittner, R.M., McFee, B., Salamon, J., Li, P., Bello, J.P., 2017. Deep salience representations for f0 estimation in polyphonic music. In: 18th International Society for Music Information Retrieval Conference (ISMIR). pp. 63–70.