%0 Journal Article %T Exploring Diversification and Genome Size Evolution in Extant Gymnosperms through Phylogenetic Synthesis %A J. Gordon Burleigh %A W. Brad Barbazuk %A John M. Davis %A Alison M. Morse %A Pamela S. Soltis %J Journal of Botany %D 2012 %I Hindawi Publishing Corporation %R 10.1155/2012/292857 %X Gymnosperms, comprising cycads, Ginkgo, Gnetales, and conifers, represent one of the major groups of extant seed plants. Yet compared to angiosperms, little is known about the patterns of diversification and genome evolution in gymnosperms. We assembled a phylogenetic supermatrix containing over 4.5 million nucleotides from 739 gymnosperm taxa. Although 93.6% of the cells in the supermatrix are empty, the data reveal many strongly supported nodes that are generally consistent with previous phylogenetic analyses, including weak support for Gnetales sister to Pinaceae. A lineage through time plot suggests elevated rates of diversification within the last 100 million years, and there is evidence of shifts in diversification rates in several clades within cycads and conifers. A likelihood-based analysis of the evolution of genome size in 165 gymnosperms finds evidence for heterogeneous rates of genome size evolution due to an elevated rate in Pinus. 1. Introduction Recent advances in sequencing technology offer the possibility of identifying the genetic mechanisms that influence evolutionarily important characters and ultimately drive diversification. Within angiosperms, large-scale phylogenetic analyses have identified complex patterns of diversification (e.g., [1¨C3]), and numerous genomes are at least partially sequenced. Yet the other major clade of seed plants, the gymnosperms, have received far less attention, with few comprehensive studies of diversification and no sequenced genomes. Note that throughout this paper ¡°gymnosperms¡± specifies only the approximately 1000 extant species within cycads, Ginkgo, Gnetales, and conifers. These comprise the Acrogymnospermae clade described by Cantino et al. [4]. Many gymnosperms have exceptionally large genomes (e.g., [5¨C7]), and this has hindered whole-genome sequencing projects, especially among economically important Pinus species. This large genome size is interesting because one suggested mechanism for rapid increases in genome size, polyploidy, is rare among gymnosperms [8]. Recent sequencing efforts have elucidated some of genomic characteristics associated with the large genome size in Pinus. Morse et al. [9] identified a large retrotransposon family in Pinus, that, with other retrotransposon families, accounts for much of the genomic complexity. Similarly, recent sequencing of 10 BAC (bacterial artificial chromosome) clones from Pinus taeda identified many conifer-specific LTR (long terminal repeat) retroelements [10]. These studies suggest that the large genome size may be caused by rapid expansion of %U http://www.hindawi.com/journals/jb/2012/292857/