The complete chloroplast genome sequence of Pelargonium x hortorum: organization and evolution of the largest and most highly rearranged chloroplast genome of land plants
Chumley, T.W.; Palmer, J.D.; Mower, J.P.; Fourcade, H.M.; Calie, P.J.; Boore, J.L.; Jansen, R.K.
Molecular Biology and Evolution 23(11): 2175-2190
ISSN/ISBN: 0737-4038 PMID: 16916942 DOI: 10.1093/molbev/msl089
The chloroplast genome of Pelargonium x hortorum has been completely sequenced. It maps as a circular molecule of 217,942 bp and is both the largest and most rearranged land plant chloroplast genome yet sequenced. It features 2 copies of a greatly expanded inverted repeat (IR) of 75,741 bp each and, consequently, diminished single-copy regions of 59,710 and 6,750 bp. Despite the increase in size and complexity of the genome, the gene content is similar to that of other angiosperms, with the exceptions of a large number of pseudogenes, the recognition of 2 open reading frames (ORF56 and ORF42) in the trnA intron with similarities to previously identified mitochondrial products (ACRS and pvs-trnA), the losses of accD and trnT-ggu and, in particular, the presence of a highly divergent set of rpoA-like ORFs rather than a single, easily recognized gene for rpoA. The 3-fold expansion of the IR (relative to most angiosperms) accounts for most of the size increase of the genome, but an additional 10% of the size increase is related to the large number of repeats found. The Pelargonium genome contains 35 times as many 31 bp or larger repeats than the unrearranged genome of Spinacia. Most of these repeats occur near the rearrangement hotspots, and 2 different associations of repeats are localized in these regions. These associations are characterized by full or partial duplications of several genes, most of which appear to be nonfunctional copies or pseudogenes. These duplications may also be linked to the disruption of at least 1 but possibly 2 or 3 operons. We propose simple models that account for the major rearrangements with a minimum of 8 IR boundary changes and 12 inversions in addition to several insertions of duplicated sequence.