+ Site Statistics
+ Search Articles
+ PDF Full Text Service
How our service works
Request PDF Full Text
+ Follow Us
Follow on Facebook
Follow on Twitter
Follow on LinkedIn
+ Subscribe to Site Feeds
Most Shared
PDF Full Text
+ Translate
+ Recently Requested

Chromatin accessibility prediction via convolutional long short-term memory networks with k-mer embedding



Chromatin accessibility prediction via convolutional long short-term memory networks with k-mer embedding



Bioinformatics 33(14): I92



Experimental techniques for measuring chromatin accessibility are expensive and time consuming, appealing for the development of computational approaches to predict open chromatin regions from DNA sequences. Along this direction, existing methods fall into two classes: one based on handcrafted k -mer features and the other based on convolutional neural networks. Although both categories have shown good performance in specific applications thus far, there still lacks a comprehensive framework to integrate useful k -mer co-occurrence information with recent advances in deep learning. We fill this gap by addressing the problem of chromatin accessibility prediction with a convolutional Long Short-Term Memory (LSTM) network with k -mer embedding. We first split DNA sequences into k -mers and pre-train k -mer embedding vectors based on the co-occurrence matrix of k -mers by using an unsupervised representation learning approach. We then construct a supervised deep learning architecture comprised of an embedding layer, three convolutional layers and a Bidirectional LSTM (BLSTM) layer for feature learning and classification. We demonstrate that our method gains high-quality fixed-length features from variable-length sequences and consistently outperforms baseline methods. We show that k -mer embedding can effectively enhance model performance by exploring different embedding strategies. We also prove the efficacy of both the convolution and the BLSTM layers by comparing two variations of the network architecture. We confirm the robustness of our model to hyper-parameters by performing sensitivity analysis. We hope our method can eventually reinforce our understanding of employing deep learning in genomic studies and shed light on research regarding mechanisms of chromatin accessibility. The source code can be downloaded from https://github.com/minxueric/ismb2017_lstm . tingchen@tsinghua.edu.cn or ruijiang@tsinghua.edu.cn. Supplementary materials are available at Bioinformatics online.

Please choose payment method:






(PDF emailed within 0-6 h: $19.90)

Accession: 059498768

Download citation: RISBibTeXText

PMID: 28881969

DOI: 10.1093/bioinformatics/btx234


Related references

Enhanced prediction of RNA solvent accessibility with long short-term memory neural networks and improved sequence profiles. Bioinformatics 35(10): 1686-1691, 2019

Accurate prediction of protein contact maps by coupling residual two-dimensional bidirectional long short-term memory with convolutional neural networks. Bioinformatics 34(23): 4039-4045, 2018

A Novel Method for Classifying Liver and Brain Tumors Using Convolutional Neural Networks, Discrete Wavelet Transform and Long Short-Term Memory Networks. Sensors 19(9):, 2019

Capturing non-local interactions by long short-term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers and solvent accessibility. Bioinformatics 33(18): 2842-2849, 2017

Ensembling convolutional and long short-term memory networks for electrocardiogram arrhythmia detection. Physiological Measurement 39(11): 114002, 2018

Distant supervised relation extraction via long short term memory networks with sentence embedding. Intelligent Data Analysis 21(5): 1213-1231, 2017

Application of Convolutional Long Short-Term Memory Neural Networks to Signals Collected from a Sensor Network for Autonomous Gas Source Localization in Outdoor Environments. Sensors 18(12):, 2018

Attention-aware fully convolutional neural network with convolutional long short-term memory network for ultrasound-based motion tracking. Medical Physics 46(5): 2275-2285, 2019

Towards real-time respiratory motion prediction based on long short-term memory neural networks. Physics in Medicine and Biology 64(8): 085010, 2019

Chromatin accessibility prediction via a hybrid deep convolutional neural network. Bioinformatics 34(5): 732-738, 2018

Improving protein disorder prediction by deep bidirectional long short-term memory recurrent neural networks. Bioinformatics 33(5): 685-692, 2017

Accurate prediction of blood culture outcome in the intensive care unit using long short-term memory neural networks. Artificial Intelligence in Medicine 97: 38-43, 2019

A novel spatiotemporal convolutional long short-term neural network for air pollution prediction. Science of the Total Environment 654: 1091-1099, 2019

Mixed convolutional and long short-term memory network for the detection of lethal ventricular arrhythmia. Plos one 14(5): E0216756, 2019

Application of stacked convolutional and long short-term memory network for accurate identification of CAD ECG signals. Computers in Biology and Medicine 94: 19-26, 2018