Separable Fully Connected Layers Improve Deep Learning Models For Genomics-Reference-Cited by-同舟云学术

Separable Fully Connected Layers Improve Deep Learning Models For Genomics

Published:2017-06-05 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Alexandari Amr Mohamed^ORCID,Shrikumar Avanti,Kundaje Anshul

Abstract

ABSTRACTConvolutional neural networks are rapidly gaining popularity in regulatory genomics. Typically, these networks have a stack of convolutional and pooling layers, followed by one or more fully connected layers. In genomics, the same positional patterns are often present across multiple convolutional channels. Therefore, in current state-of-the-art networks, there exists significant redundancy in the representations learned by standard fully connected layers. We present a new separable fully connected layer that learns a weights tensor that is the outer product of positional weights and cross-channel weights, thereby allowing the same positional patterns to be applied across multiple convolutional channels. Decomposing positional and cross-channel weights further enables us to readily impose biologically-inspired constraints on positional weights, such as symmetry. We also propose a novel regularizer and constraint that act on curvature in the positional weights. Using experiments on simulated and in vivo datasets, we show that networks that incorporate our separable fully connected layer outperform conventional models with analogous architectures and the same number of parameters. Additionally, our networks are more robust to hyperparameter tuning, have more informative gradients, and produce importance scores that are more consistent with known biology than conventional deep neural networks.AvailabilityImplementation: https://github.com/kundajelab/keras/tree/keras_1A gist illustrating model setup is at: goo.gl/gYooaa

Publisher

Cold Spring Harbor Laboratory

Reference9 articles.

1. Babak Alipanahi , Andrew Delong , Matthew T Weirauch , and Brendan J Frey . Predicting the sequence specificities of dna-and rna-binding proteins by deep learning. Nature biotechnology, 2015.

2. François Chollet. Keras. 2017.

3. ENCODE Project Consortium. An integrated encyclopedia of dna elements in the human genome. Nature, 2012.

4. M Ryan Corces , Jason D Buenrostro , Beijing Wu , Peyton G Greenside , Steven M Chan , Julie L Koenig , Michael P Snyder , Jonathan K Pritchard , Anshul Kundaje , William J Greenleaf , Ravindra Majeti , and Howard Y Chang . Lineage-specific and single-cell chromatin accessibility charts human hematopoiesis and leukemia evolution. Nature Genetics, 2016.

5. Xavier Glorot and Yoshua Bengio . Understanding the difficulty of training deep feedforward neural networks. The International Conference on Artificial Intelligence and Statistics (AISTAT), 2010.

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DNA sequence classification based on MLP with PILAE algorithm;Soft Computing;2020-11-21

2. Towards a Better Understanding of Reverse-Complement Equivariance for Deep Learning Models in Regulatory Genomics;2020-11-04

3. Deciphering regulatory DNA sequences and noncoding genetic variants using neural network models of massively parallel reporter assays;PLOS ONE;2019-06-17

4. Deciphering regulatory DNA sequences and noncoding genetic variants using neural network models of massively parallel reporter assays;2018-08-17

5. Modeling positional effects of regulatory sequences with spline transformations increases prediction accuracy of deep neural networks;2017-07-18