Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
fastp: an ultra-fast all-in-one FASTQ preprocessor
28.855
Zitationen
4
Autoren
2018
Jahr
Abstract
Abstract Motivation Quality control and preprocessing of FASTQ files are essential to providing clean data for downstream analysis. Traditionally, a different tool is used for each operation, such as quality control, adapter trimming and quality filtering. These tools are often insufficiently fast as most are developed using high-level programming languages (e.g. Python and Java) and provide limited multi-threading support. Reading and loading data multiple times also renders preprocessing slow and I/O inefficient. Results We developed fastp as an ultra-fast FASTQ preprocessor with useful quality control and data-filtering features. It can perform quality control, adapter trimming, quality filtering, per-read quality pruning and many other operations with a single scan of the FASTQ data. This tool is developed in C++ and has multi-threading support. Based on our evaluation, fastp is 2–5 times faster than other FASTQ preprocessing tools such as Trimmomatic or Cutadapt despite performing far more operations than similar tools. Availability and implementation The open-source code and corresponding instructions are available at https://github.com/OpenGene/fastp.
Einordnung (Deutsch)
Diese Studie untersucht open-source code and corresponding instructions are available at https://github.com/opengene/fastp.. Die Arbeit trägt den Originaltitel „fastp: an ultra-fast all-in-one FASTQ preprocessor" und ist relevant für die aktuelle Gesundheits- und MedTech-Forschung. Die Ergebnisse können für Kliniker, Forscher und Fachleute im Gesundheitswesen von Bedeutung sein.
Diese Kurzfassung dient der thematischen Einordnung und ersetzt nicht den Originaltext.
Ähnliche Arbeiten
Fast Parallel Algorithms for Short-Range Molecular Dynamics
1995 · 44.257 Zit.
MapReduce
2008 · 18.508 Zit.
LINCS: A linear constraint solver for molecular simulations
1997 · 16.975 Zit.
PyTorch: An Imperative Style, High-Performance Deep Learning Library
2019 · 16.187 Zit.
Suspending OpenMP Tasks on Asynchronous Events: Extending the Taskwait Construct
2023 · 12.934 Zit.