全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
Biology  2012 

FLEXBAR—Flexible Barcode and Adapter Processing for Next-Generation Sequencing Platforms

DOI: 10.3390/biology1030895

Keywords: high-throughput sequencing, demultiplexing, trimming, clipping, quality control

Full-Text   Cite this paper   Add to My Lib

Abstract:

Quantitative and systems biology approaches benefit from the unprecedented depth of next-generation sequencing. A typical experiment yields millions of short reads, which oftentimes carry particular sequence tags. These tags may be: (a) specific to the sequencing platform and library construction method (e.g., adapter sequences); (b) have been introduced by experimental design (e.g., sample barcodes); or (c) constitute some biological signal (e.g., splice leader sequences in nematodes). Our software FLEXBAR enables accurate recognition, sorting and trimming of sequence tags with maximal flexibility, based on exact overlap sequence alignment. The software supports data formats from all current sequencing platforms, including color-space reads. FLEXBAR maintains read pairings and processes separate barcode reads on demand. Our software facilitates the fine-grained adjustment of sequence tag detection parameters and search regions. FLEXBAR is a multi-threaded software and combines speed with precision. Even complex read processing scenarios might be executed with a single command line call. We demonstrate the utility of the software in terms of read mapping applications, library demultiplexing and splice leader detection. FLEXBAR and additional information is available for academic use from the website: http://sourceforge.net/projects/flexbar/.

References

[1]  D?ring, A.; Weese, D.; Rausch, T.; Reinert, K. SeqAn an efficient, generic C++ library for sequence analysis. BMC Bioinformatics 2008, 9, 11, doi:10.1186/1471-2105-9-11.
[2]  TBB Library. Available online: http://www.threadingbuildingblocks.org/ (accessed on 14 August 2012).
[3]  Needleman, S.B.; Wunsch, C.D. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol. 1970, 48, 443–453, doi:10.1016/0022-2836(70)90057-4.
[4]  FASTX Toolkit. Available online: http://hannonlab.cshl.edu/fastx_toolkit/ (accessed on 25 July 2012).
[5]  Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 2011, 17, 10–12.
[6]  Kong, Y. Btrim: A fast, lightweight adapter and quality trimming program for next-generation sequencing technologies. Genomics 2011, 98, 152–153, doi:10.1016/j.ygeno.2011.05.009.
[7]  Kato, M.; de Lencastre, A.; Pincus, Z.; Slack, F.J. Dynamic expression of small non-coding RNAs, including novel microRNAs and piRNAs/21U-RNAs, during Caenorhabditis elegans development. Genome Biol. 2009, 10, R54, doi:10.1186/gb-2009-10-5-r54.
[8]  Maxwell, C.; Antoshechkin, I.; Kurhanewicz, N.; Belsky, J.; Baugh, L.R. Nutritional control of mRNA isoform expression during developmental arrest and recovery in C. elegans. Genome Res. 2012, doi:10.1101/gr.133587.111.
[9]  Mason. Available online: http://www.seqan.de/projects/mason/ (accessed on 22 August 2012).

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133