%0 Journal Article
%T Local Assemblies of Paired-End Reduced Representation Libraries Sequenced with the Illumina Genome Analyzer in Maize
%A St谷phane Deschamps
%A Kishore Nannapaneni
%A Yun Zhang
%A Kevin Hayes
%J International Journal of Plant Genomics
%D 2012
%I Hindawi Publishing Corporation
%R 10.1155/2012/360598
%X The use of next-generation DNA sequencing technologies has greatly facilitated reference-guided variant detection in complex plant genomes. However, complications may arise when regions adjacent to a read of interest are used for marker assay development, or when reference sequences are incomplete, as short reads alone may not be long enough to ascertain their uniqueness. Here, the possibility of generating longer sequences in discrete regions of the large and complex genome of maize is demonstrated, using a modified version of a paired-end RAD library construction strategy. Reads are generated from DNA fragments first digested with a methylation-sensitive restriction endonuclease, sheared, enriched with biotin and a selective PCR amplification step, and then sequenced at both ends. Sequences are locally assembled into contigs by subgrouping pairs based on the identity of the read anchored by the restriction site. This strategy applied to two maize inbred lines (B14 and B73) generated 183,609 and 129,018 contigs, respectively, out of which at least 76% were >200ˋbps in length. A subset of putative single nucleotide polymorphisms from contigs aligning to the B73 reference genome with at least one mismatch was resequenced, and 90% of those in B14 were confirmed, indicating that this method is a potent approach for variant detection and marker development in species with complex genomes or lacking extensive reference sequences. 1. Introduction DNA-based genetic markers are pivotal tools for applications as diverse as QTL mapping, marker assisted selection, association mapping, and fine mapping for the detection of genes linked to a particular phenotype [1]. Among the variety of genetic markers that have been developed, those derived from single nucleotide polymorphisms (SNPs) have become the marker of choice for many mapping applications because of their abundance and the availability of high-throughput and cost-effective technologies for detection and diagnostics [2每4]. One popular tool for SNP identification and detection has been the construction of reduced-representation libraries (RRL) and their sequencing with massively parallel sequencing platforms, in species as varied as cattle, worm, soybean, rice, maize, or common bean [5每10]. However, one major limitation of such platforms is the relatively short length of individual sequencing reads. While the availability of a high quality reference sequence may render short reads sufficient for alignment and subsequent SNP detection, this limitation may be further compounded in crop species due to (1) the
%U http://www.hindawi.com/journals/ijpg/2012/360598/