+ Site Statistics
+ Search Articles
+ Subscribe to Site Feeds
EurekaMag Most Shared ContentMost Shared
EurekaMag PDF Full Text ContentPDF Full Text
+ PDF Full Text
Request PDF Full TextRequest PDF Full Text
+ Follow Us
Follow on FacebookFollow on Facebook
Follow on TwitterFollow on Twitter
Follow on LinkedInFollow on LinkedIn

+ Translate

A frameshift error detection algorithm for DNA sequencing projects

A frameshift error detection algorithm for DNA sequencing projects

Nucleic acids research, 23(15): 2900-2908

During the determination of DNA sequences, frame-shift errors are not the most frequent but they are the most bothersome as they corrupt the amino acid sequence over several residues. Detection of such errors by sequence alignment is only possible when related sequences are found in the databases. To avoid this limitation, we have developed a new tool based on the distribution of non-overlapping 3-tuples or 6-tuples in the three frames of an ORF. The method relies upon the result of a correspondence analysis. It has been extensively tested on Bacillus subtilis and Saccharomyces cerevisiae sequences and has also been examined with human sequences. The results indicate that it can detect frameshift errors affecting as few as 20 bp with a low rate of false positives (no more than 1.0/1000 bp scanned). The proposed algorithm can be used to scan a large collection of data, but it is mainly intended for laboratory practice as a tool for checking the quality of the sequences produced during a sequencing project.

Accession: 008039177

Download citation: RISBibTeXText

PMID: 7659513

DOI: 10.1093/nar/23.15.2900

Download PDF Full Text: A frameshift error detection algorithm for DNA sequencing projects

Related references

EDAR: an efficient error detection and removal algorithm for next generation sequencing data. Journal of Computational Biology 17(11): 1549-1560, 2011

A quality control algorithm for DNA sequencing projects. Nucleic Acids Research 21(16): 3829-3838, 1993

Analysis of 454 sequencing error rate, error sources, and artifact recombination for detection of Low-frequency drug resistance mutations in HIV-1 DNA. Retrovirology 10: 18-18, 2013

Error-free image compression algorithm using classifying-sequencing techniques. Applied Optics 31(14): 2554-2559, 1992

Error begat error: design error analysis and prevention in social infrastructure projects. Accident; Analysis and Prevention 48: 100-110, 2012

Rare Event Detection Using Error-corrected DNA and RNA Sequencing. Journal of Visualized Experiments 138), 2018

Error correction for phase detection by recursive algorithm real time DFT. Electrical Engineering in Japan 141(1): 8-17, 2002

Error analysis of coefficient-based regularized algorithm for density-level detection. Neural Computation 25(4): 1107-1121, 2013

Validation of a nonrigid registration error detection algorithm using clinical MRI brain data. IEEE Transactions on Medical Imaging 34(1): 86-96, 2015

Alternative Splicing Detection Tool-a novel PERL algorithm for sensitive detection of splicing events, based on next-generation sequencing data analysis. Annals of Translational Medicine 6(12): 244-244, 2018

High-specificity detection of rare alleles with Paired-End Low Error Sequencing (PELE-Seq). Bmc Genomics 17(): 464-464, 2016

ADEPT, a dynamic next generation sequencing data error-detection program with trimming. Bmc Bioinformatics 17(): 109-109, 2016

An error detection and recovery algorithm for compressed video signal using source level redundancy. IEEE Transactions on Image Processing 9(2): 209-219, 2008

rSW-seq: algorithm for detection of copy number alterations in deep sequencing data. Bmc Bioinformatics 11(): 432-432, 2010

An accurate algorithm for the detection of DNA fragments from dilution pool sequencing experiments. Bioinformatics, 2017