1. Greedy layer-wise training of deep networks;Bengio,2007
2. Nonlinear system identification: NARMAX methods in the time, frequency, and spatio-temporal domains;Billings,2013
3. Christopher M. Bishop, Mixture density networks. (1994).
4. Davis Blalock, et al. What is the state of neural network pruning? arXiv preprint arXiv:2003.03033 (2020).
5. Box, George EP, et al. Time series analysis: forecasting and control. John Wiley & Sons, 2015.