Abstract
ABSTRACTConvolutional neural networks are rapidly gaining popularity in regulatory genomics. Typically, these networks have a stack of convolutional and pooling layers, followed by one or more fully connected layers. In genomics, the same positional patterns are often present across multiple convolutional channels. Therefore, in current state-of-the-art networks, there exists significant redundancy in the representations learned by standard fully connected layers. We present a new separable fully connected layer that learns a weights tensor that is the outer product of positional weights and cross-channel weights, thereby allowing the same positional patterns to be applied across multiple convolutional channels. Decomposing positional and cross-channel weights further enables us to readily impose biologically-inspired constraints on positional weights, such as symmetry. We also propose a novel regularizer and constraint that act on curvature in the positional weights. Using experiments on simulated and in vivo datasets, we show that networks that incorporate our separable fully connected layer outperform conventional models with analogous architectures and the same number of parameters. Additionally, our networks are more robust to hyperparameter tuning, have more informative gradients, and produce importance scores that are more consistent with known biology than conventional deep neural networks.AvailabilityImplementation: https://github.com/kundajelab/keras/tree/keras_1A gist illustrating model setup is at: goo.gl/gYooaa
Publisher
Cold Spring Harbor Laboratory
Reference9 articles.
1. Babak Alipanahi , Andrew Delong , Matthew T Weirauch , and Brendan J Frey . Predicting the sequence specificities of dna-and rna-binding proteins by deep learning. Nature biotechnology, 2015.
2. François Chollet. Keras. 2017.
3. ENCODE Project Consortium. An integrated encyclopedia of dna elements in the human genome. Nature, 2012.
4. M Ryan Corces , Jason D Buenrostro , Beijing Wu , Peyton G Greenside , Steven M Chan , Julie L Koenig , Michael P Snyder , Jonathan K Pritchard , Anshul Kundaje , William J Greenleaf , Ravindra Majeti , and Howard Y Chang . Lineage-specific and single-cell chromatin accessibility charts human hematopoiesis and leukemia evolution. Nature Genetics, 2016.
5. Xavier Glorot and Yoshua Bengio . Understanding the difficulty of training deep feedforward neural networks. The International Conference on Artificial Intelligence and Statistics (AISTAT), 2010.