1. Andrychowicz, M., Denil, M., Gomez, S., et al. (2016). Learning to learn by gradient descent by gradient descent. Advances in Neural Information Processing Systems, 29, 1–10.
2. Ba, JL., Kiros, JR., & Hinton, GE. (2016). Layer normalization. arXiv:1607.06450.
3. Camci, F., & Chinnam, R. B. (2010). Health-state estimation and prognostics in machining processes. IEEE Transactions on Automation Science and Engineering, 7(3), 581–597.
4. Chen, J., Jing, H., Chang, Y., et al. (2019). Gated recurrent unit based recurrent neural network for remaining useful life prediction of nonlinear deterioration process. Reliability Engineering & System Safety, 185, 372–382.
5. Da Costa, PRd. O., Akçay, A., Zhang, Y., et al. (2020). Remaining useful lifetime prediction via deep domain adaptation. Reliability Engineering & System Safety, 195(106), 682.