1. Rudder: Return decomposition for delayed rewards;Arjona-Medina;NIPS,2019
2. Neural machine translation by jointly learning to align and translate;Bahdanau;ICLR,2015
3. Verifiable reinforcement learning via policy extraction;Bastani;NIPS,2018
4. Understanding the role of individual units in a deep neural network