TagDust - A program to eliminate artifacts from next generation sequencing data

Timo Lassmann, Yoshihide Hayashizaki, Carsten O. Daub

Research output: Contribution to journalArticlepeer-review

188 Citations (Scopus)

Abstract

Motivation: Next-generation parallel sequencing technologies produce large quantities of short sequence reads. Due to experimental procedures various types of artifacts are commonly sequenced alongside the targeted RNA or DNA sequences. Identification of such artifacts is important during the development of novel sequencing assays and for the downstream analysis of the sequenced libraries. Results: Here we present TagDust, a program identifying artifactual sequences in large sequencing runs. Given a user-defined cutoff for the false discovery rate, TagDust identifies all reads explainable by combinations and partial matches to known sequences used during library preparation. We demonstrate the quality of our method on sequencing runs performed on Illumina's Genome Analyzer platform.

Original languageEnglish
Pages (from-to)2839-2840
Number of pages2
JournalBioinformatics
Volume25
Issue number21
DOIs
Publication statusPublished - 2009
Externally publishedYes

Fingerprint

Dive into the research topics of 'TagDust - A program to eliminate artifacts from next generation sequencing data'. Together they form a unique fingerprint.

Cite this