Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
FLASH: fast length adjustment of short reads to improve genome assemblies
15.526
Zitationen
2
Autoren
2011
Jahr
Abstract
MOTIVATION: Next-generation sequencing technologies generate very large numbers of short reads. Even with very deep genome coverage, short read lengths cause problems in de novo assemblies. The use of paired-end libraries with a fragment size shorter than twice the read length provides an opportunity to generate much longer reads by overlapping and merging read pairs before assembling a genome. RESULTS: We present FLASH, a fast computational tool to extend the length of short reads by overlapping paired-end reads from fragment libraries that are sufficiently short. We tested the correctness of the tool on one million simulated read pairs, and we then applied it as a pre-processor for genome assemblies of Illumina reads from the bacterium Staphylococcus aureus and human chromosome 14. FLASH correctly extended and merged reads >99% of the time on simulated reads with an error rate of <1%. With adequately set parameters, FLASH correctly merged reads over 90% of the time even when the reads contained up to 5% errors. When FLASH was used to extend reads prior to assembly, the resulting assemblies had substantially greater N50 lengths for both contigs and scaffolds. AVAILABILITY AND IMPLEMENTATION: The FLASH system is implemented in C and is freely available as open-source code at http://www.cbcb.umd.edu/software/flash. CONTACT: t.magoc@gmail.com.
Ähnliche Arbeiten
Cleavage of Structural Proteins during the Assembly of the Head of Bacteriophage T4
1970 · 251.381 Zit.
Basic local alignment search tool
1990 · 93.923 Zit.
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
1997 · 74.271 Zit.
Fiji: an open-source platform for biological-image analysis
2012 · 69.594 Zit.
Trimmomatic: a flexible trimmer for Illumina sequence data
2014 · 68.549 Zit.