1. From thinking too little to thinking too much: a continuum of decision making;Ariely;WIREs Cognitive Science,2011
2. Layer normalization;Ba,2016
3. Pondernet: Learning to ponder;Banino,2021
4. Estimating or propagating gradients through stochastic neurons for conditional computation;Bengio,2013
5. A comparative analysis of gradient boosting algorithms;Bentéjac;Artificial Intelligence Review,2020