1. Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., et al. (2016). TensorFlow: Large-scale machine learning on heterogeneous distributed systems. arXiv:1603.04467v2 [cs.DC].
2. Effective features selection and machine learning classifiers for improved wireless intrusion detection;Abdulhammed,2018
3. Falsification of cyber-physical systems using deep reinforcement learning;Akazaki,2018
4. Deep reinforcement learning: A brief survey;Arulkumaran;IEEE Signal Processing Magazine,2017
5. Bartlett, P. L., & Baxter, J. (2011). Infinite-horizon policy-gradient estimation. arXiv:1106.0665 [cs.AI].