Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 19 de 19
Filter
Add more filters










Publication year range
1.
Genome Biol Evol ; 15(1)2023 01 04.
Article in English | MEDLINE | ID: mdl-36582124

ABSTRACT

Mycoheterotrophy is an alternative nutritional strategy whereby plants obtain sugars and other nutrients from soil fungi. Mycoheterotrophy and associated loss of photosynthesis have evolved repeatedly in plants, particularly in monocots. Although reductive evolution of plastomes in mycoheterotrophs is well documented, the dynamics of nuclear genome evolution remains largely unknown. Transcriptome datasets were generated from four mycoheterotrophs in three families (Orchidaceae, Burmanniaceae, Triuridaceae) and related green plants and used for phylogenomic analyses to resolve relationships among the mycoheterotrophs, their relatives, and representatives across the monocots. Phylogenetic trees based on 602 genes were mostly congruent with plastome phylogenies, except for an Asparagales + Liliales clade inferred in the nuclear trees. Reduction and loss of chlorophyll synthesis and photosynthetic gene expression and relaxation of purifying selection on retained genes were progressive, with greater loss in older nonphotosynthetic lineages. One hundred seventy-four of 1375 plant benchmark universally conserved orthologous genes were undetected in any mycoheterotroph transcriptome or the genome of the mycoheterotrophic orchid Gastrodia but were expressed in green relatives, providing evidence for massively convergent gene loss in nonphotosynthetic lineages. We designate this set of deleted or undetected genes Missing in Mycoheterotrophs (MIM). MIM genes encode not only mainly photosynthetic or plastid membrane proteins but also a diverse set of plastid processes, genes of unknown function, mitochondrial, and cellular processes. Transcription of a photosystem II gene (psb29) in all lineages implies a nonphotosynthetic function for this and other genes retained in mycoheterotrophs. Nonphotosynthetic plants enable novel insights into gene function as well as gene expression shifts, gene loss, and convergence in nuclear genomes.


Subject(s)
Genome, Plastid , Orchidaceae , Humans , Aged , Phylogeny , Genes, Plant , Plant Proteins/genetics , Orchidaceae/genetics
2.
Front Plant Sci ; 13: 876779, 2022.
Article in English | MEDLINE | ID: mdl-36483967

ABSTRACT

We assess relationships among 192 species in all 12 monocot orders and 72 of 77 families, using 602 conserved single-copy (CSC) genes and 1375 benchmarking single-copy ortholog (BUSCO) genes extracted from genomic and transcriptomic datasets. Phylogenomic inferences based on these data, using both coalescent-based and supermatrix analyses, are largely congruent with the most comprehensive plastome-based analysis, and nuclear-gene phylogenomic analyses with less comprehensive taxon sampling. The strongest discordance between the plastome and nuclear gene analyses is the monophyly of a clade comprising Asparagales and Liliales in our nuclear gene analyses, versus the placement of Asparagales and Liliales as successive sister clades to the commelinids in the plastome tree. Within orders, around six of 72 families shifted positions relative to the recent plastome analysis, but four of these involve poorly supported inferred relationships in the plastome-based tree. In Poales, the nuclear data place a clade comprising Ecdeiocoleaceae+Joinvilleaceae as sister to the grasses (Poaceae); Typhaceae, (rather than Bromeliaceae) are resolved as sister to all other Poales. In Commelinales, nuclear data place Philydraceae sister to all other families rather than to a clade comprising Haemodoraceae+Pontederiaceae as seen in the plastome tree. In Liliales, nuclear data place Liliaceae sister to Smilacaceae, and Melanthiaceae are placed sister to all other Liliales except Campynemataceae. Finally, in Alismatales, nuclear data strongly place Tofieldiaceae, rather than Araceae, as sister to all the other families, providing an alternative resolution of what has been the most problematic node to resolve using plastid data, outside of those involving achlorophyllous mycoheterotrophs. As seen in numerous prior studies, the placement of orders Acorales and Alismatales as successive sister lineages to all other extant monocots. Only 21.2% of BUSCO genes were demonstrably single-copy, yet phylogenomic inferences based on BUSCO and CSC genes did not differ, and overall functional annotations of the two sets were very similar. Our analyses also reveal significant gene tree-species tree discordance despite high support values, as expected given incomplete lineage sorting (ILS) related to rapid diversification. Our study advances understanding of monocot relationships and the robustness of phylogenetic inferences based on large numbers of nuclear single-copy genes that can be obtained from transcriptomes and genomes.

3.
Gigascience ; 8(10)2019 10 01.
Article in English | MEDLINE | ID: mdl-31644802

ABSTRACT

BACKGROUND: The 1000 Plant transcriptomes initiative (1KP) explored genetic diversity by sequencing RNA from 1,342 samples representing 1,173 species of green plants (Viridiplantae). FINDINGS: This data release accompanies the initiative's final/capstone publication on a set of 3 analyses inferring species trees, whole genome duplications, and gene family expansions. These and previous analyses are based on de novo transcriptome assemblies and related gene predictions. Here, we assess their data and assembly qualities and explain how we detected potential contaminations. CONCLUSIONS: These data will be useful to plant and/or evolutionary scientists with interests in particular gene families, either across the green plant tree of life or in more focused lineages.


Subject(s)
Genes, Plant , Viridiplantae/genetics , Plant Proteins/genetics , Sequence Analysis, RNA , Transcriptome
4.
Am J Bot ; 105(3): 587-601, 2018 03.
Article in English | MEDLINE | ID: mdl-29746718

ABSTRACT

PREMISE OF THE STUDY: The relative ease of high throughput sequencing is facilitating comprehensive phylogenomic and gene expression studies, even for nonmodel groups. To date, however, these two approaches have not been merged; while phylogenomic methods might use transcriptome sequences to resolve relationships, assessment of gene expression patterns in a phylogenetic context is less common. Here we analyzed both carbon assimilation and gene expression patterns of closely related species within the Agavoideae (Asparagaceae) to elucidate changes in gene expression across weak and strong phenotypes for Crassulacean acid metabolism (CAM). METHODS: Gene expression patterns were compared across four genera: Agave (CAM), which is paraphyletic with Polianthes (weak CAM) and Manfreda (CAM), and Beschorneria (weak CAM). RNA-sequencing was paired with measures of gas exchange and titratable acidity. Climate niche space was compared across the four lineages to examine abiotic factors and their correlation to CAM. KEY RESULTS: Expression of homologous genes showed both shared and variable patterns in weak and strong CAM species. Network analysis highlights that despite shared expression patterns, highly connected genes differ between weak and strong CAM, implicating shifts in regulatory gene function as key for the evolution of CAM. Variation in carbohydrate metabolism between weak and strong CAM supports the importance of sugar turnovers for CAM physiology. CONCLUSIONS: Integration of phylogenetics and RNA-sequencing provides a powerful tool to study the evolution of CAM photosynthesis across closely related but photosynthetically variable species. Our findings regarding shared or shifted gene expression and regulation of CAM via carbohydrate metabolism have important implications for efforts to engineer the CAM pathway into C3 food and biofuel crops.


Subject(s)
Asparagaceae/genetics , Biological Evolution , Carbohydrate Metabolism/genetics , Climate , Phenotype , Photosynthesis/genetics , Transcriptome , Adaptation, Biological , Agave , Asparagaceae/metabolism , Carbon/metabolism , Gene Expression , Genes, Plant , Genome, Plant , Genomics/methods , High-Throughput Nucleotide Sequencing , Phylogeny , RNA, Plant/analysis , Sequence Analysis, RNA , Species Specificity , Sugars/metabolism
5.
Nat Commun ; 8(1): 1279, 2017 11 02.
Article in English | MEDLINE | ID: mdl-29093472

ABSTRACT

Sex chromosomes evolved from autosomes many times across the eukaryote phylogeny. Several models have been proposed to explain this transition, some involving male and female sterility mutations linked in a region of suppressed recombination between X and Y (or Z/W, U/V) chromosomes. Comparative and experimental analysis of a reference genome assembly for a double haploid YY male garden asparagus (Asparagus officinalis L.) individual implicates separate but linked genes as responsible for sex determination. Dioecy has evolved recently within Asparagus and sex chromosomes are cytogenetically identical with the Y, harboring a megabase segment that is missing from the X. We show that deletion of this entire region results in a male-to-female conversion, whereas loss of a single suppressor of female development drives male-to-hermaphrodite conversion. A single copy anther-specific gene with a male sterile Arabidopsis knockout phenotype is also in the Y-specific region, supporting a two-gene model for sex chromosome evolution.


Subject(s)
Arabidopsis/genetics , Asparagus Plant/genetics , Chromosomes, Plant/genetics , Sex Chromosomes/genetics , Sex Determination Processes/genetics , Evolution, Molecular , Genome, Plant , Hermaphroditic Organisms/genetics , Plant Infertility/genetics
6.
J Clin Microbiol ; 55(1): 244-252, 2017 01.
Article in English | MEDLINE | ID: mdl-27847370

ABSTRACT

Despite attempts to control avian mycoplasmosis through management, vaccination, and surveillance, Mycoplasma gallisepticum continues to cause significant morbidity, mortality, and economic losses in poultry production. Live attenuated vaccines are commonly used in the poultry industry to control avian mycoplasmosis; unfortunately, some vaccines may revert to virulence and vaccine strains are generally difficult to distinguish from natural field isolates. In order to identify genome differences among vaccine revertants, vaccine strains, and field isolates, whole-genome sequencing of the M. gallisepticum vaccine strain ts-11 and several "ts-11-like" strains isolated from commercial flocks was performed using Illumina and 454 pyrosequencing and the sequenced genomes compared to the M. gallisepticum Rlow reference genome. The collective contigs for each strain were annotated using the fully annotated Mycoplasma reference genome. The analysis revealed genetic differences among vlhA alleles, as well as among genes annotated as coding for a cell wall surface anchor protein (mg0377) and a hypothetical protein gene, mg0359, unique to M. gallisepticum ts-11 vaccine strain. PCR protocols were designed to target 5 sequences unique to the M. gallisepticum ts-11 strain: vlhA3.04a, vlhA3.04b, vlhA3.05, mg0377, and mg0359 All ts-11 isolates were positive for the five gene alleles tested by PCR; however, 5 to 36% of field isolates were also positive for at least one of the alleles tested. A combination of PCR tests for vlhA3.04a, vlhA3.05, and mg0359 was able to distinguish the M. gallisepticum ts-11 vaccine strain from field isolates. This method will further supplement current approaches to quickly distinguish M. gallisepticum vaccine strains from field isolates.


Subject(s)
Bacterial Vaccines/genetics , DNA, Bacterial/genetics , Genetic Variation , Mycoplasma Infections/veterinary , Mycoplasma gallisepticum/genetics , Poultry Diseases/microbiology , Sequence Analysis, DNA , Animals , Bacterial Vaccines/adverse effects , Bacteriological Techniques/methods , Diagnosis, Differential , Genes, Bacterial , Genome, Bacterial , Mycoplasma Infections/microbiology , Polymerase Chain Reaction/methods , Poultry , Vaccines, Attenuated/adverse effects , Vaccines, Attenuated/genetics , Veterinary Medicine/methods
7.
Genome Biol Evol ; 8(4): 1150-64, 2016 Apr 21.
Article in English | MEDLINE | ID: mdl-26988252

ABSTRACT

Comparisons of flowering plant genomes reveal multiple rounds of ancient polyploidy characterized by large intragenomic syntenic blocks. Three such whole-genome duplication (WGD) events, designated as rho (ρ), sigma (σ), and tau (τ), have been identified in the genomes of cereal grasses. Precise dating of these WGD events is necessary to investigate how they have influenced diversification rates, evolutionary innovations, and genomic characteristics such as the GC profile of protein-coding sequences. The timing of these events has remained uncertain due to the paucity of monocot genome sequence data outside the grass family (Poaceae). Phylogenomic analysis of protein-coding genes from sequenced genomes and transcriptome assemblies from 35 species, including representatives of all families within the Poales, has resolved the timing of rho and sigma relative to speciation events and placed tau prior to divergence of Asparagales and the commelinids but after divergence with eudicots. Examination of gene family phylogenies indicates that rho occurred just prior to the diversification of Poaceae and sigma occurred before early diversification of Poales lineages but after the Poales-commelinid split. Additional lineage-specific WGD events were identified on the basis of the transcriptome data. Gene families exhibiting high GC content are underrepresented among those with duplicate genes that persisted following these genome duplications. However, genome duplications had little overall influence on lineage-specific changes in the GC content of coding genes. Improved resolution of the timing of WGD events in monocot history provides evidence for the influence of polyploidization on functional evolution and species diversification.


Subject(s)
Evolution, Molecular , Genome, Plant , Magnoliopsida/genetics , Transcriptome , Gene Duplication , Oryza/genetics , Phylogeny , Poaceae/genetics , Polyploidy , RNA, Plant/genetics , Sorghum/genetics
8.
Proc Natl Acad Sci U S A ; 111(45): E4859-68, 2014 Nov 11.
Article in English | MEDLINE | ID: mdl-25355905

ABSTRACT

Reconstructing the origin and evolution of land plants and their algal relatives is a fundamental problem in plant phylogenetics, and is essential for understanding how critical adaptations arose, including the embryo, vascular tissue, seeds, and flowers. Despite advances in molecular systematics, some hypotheses of relationships remain weakly resolved. Inferring deep phylogenies with bouts of rapid diversification can be problematic; however, genome-scale data should significantly increase the number of informative characters for analyses. Recent phylogenomic reconstructions focused on the major divergences of plants have resulted in promising but inconsistent results. One limitation is sparse taxon sampling, likely resulting from the difficulty and cost of data generation. To address this limitation, transcriptome data for 92 streptophyte taxa were generated and analyzed along with 11 published plant genome sequences. Phylogenetic reconstructions were conducted using up to 852 nuclear genes and 1,701,170 aligned sites. Sixty-nine analyses were performed to test the robustness of phylogenetic inferences to permutations of the data matrix or to phylogenetic method, including supermatrix, supertree, and coalescent-based approaches, maximum-likelihood and Bayesian methods, partitioned and unpartitioned analyses, and amino acid versus DNA alignments. Among other results, we find robust support for a sister-group relationship between land plants and one group of streptophyte green algae, the Zygnematophyceae. Strong and robust support for a clade comprising liverworts and mosses is inconsistent with a widely accepted view of early land plant evolution, and suggests that phylogenetic hypotheses used to understand the evolution of fundamental plant traits should be reevaluated.


Subject(s)
Evolution, Molecular , Genome, Plant/physiology , Phylogeny , Quantitative Trait, Heritable , Streptophyta/physiology , Transcriptome/physiology , DNA, Plant/genetics , DNA, Plant/metabolism , Gene Expression Profiling , Sequence Alignment , Streptophyta/classification
9.
Genome Announc ; 2(3)2014 Jun 05.
Article in English | MEDLINE | ID: mdl-24903880

ABSTRACT

Geobacillus thermopakistaniensis strain MAS1 was isolated from a hot spring located in the Northern Areas of Pakistan. The draft genome sequence was 3.5 Mb and identified a number of genes of potential industrial importance, including genes encoding glycoside hydrolases, pullulanase, amylopullulanase, glycosidase, and alcohol dehydrogenases.

10.
Genome Announc ; 2(1)2014 Jan 30.
Article in English | MEDLINE | ID: mdl-24482525

ABSTRACT

Bacillus sp. strain CPSM8 is an efficient solubilizer and mineralizer of phosphorus. Here, we present the 4.39-Mb draft genome sequence of the strain, providing insight into the phosphorus-releasing genes related to productivity in aquatic habitats.

11.
Gigascience ; 3: 17, 2014.
Article in English | MEDLINE | ID: mdl-25625010

ABSTRACT

The 1,000 plants (1KP) project is an international multi-disciplinary consortium that has generated transcriptome data from over 1,000 plant species, with exemplars for all of the major lineages across the Viridiplantae (green plants) clade. Here, we describe how to access the data used in a phylogenomics analysis of the first 85 species, and how to visualize our gene and species trees. Users can develop computational pipelines to analyse these data, in conjunction with data of their own that they can upload. Computationally estimated protein-protein interactions and biochemical pathways can be visualized at another site. Finally, we comment on our future plans and how they fit within this scalable system for the dissemination, visualization, and analysis of large multi-species data sets.

12.
BMC Plant Biol ; 13: 13, 2013 Jan 24.
Article in English | MEDLINE | ID: mdl-23347749

ABSTRACT

BACKGROUND: Previous studies in basal angiosperms have provided insight into the diversity within the angiosperm lineage and helped to polarize analyses of flowering plant evolution. However, there is still not an experimental system for genetic studies among basal angiosperms to facilitate comparative studies and functional investigation. It would be desirable to identify a basal angiosperm experimental system that possesses many of the features found in existing plant model systems (e.g., Arabidopsis and Oryza). RESULTS: We have considered all basal angiosperm families for general characteristics important for experimental systems, including availability to the scientific community, growth habit, and membership in a large basal angiosperm group that displays a wide spectrum of phenotypic diversity. Most basal angiosperms are woody or aquatic, thus are not well-suited for large scale cultivation, and were excluded. We further investigated members of Aristolochiaceae for ease of culture, life cycle, genome size, and chromosome number. We demonstrated self-compatibility for Aristolochia elegans and A. fimbriata, and transformation with a GFP reporter construct for Saruma henryi and A. fimbriata. Furthermore, A. fimbriata was easily cultivated with a life cycle of just three months, could be regenerated in a tissue culture system, and had one of the smallest genomes among basal angiosperms. An extensive multi-tissue EST dataset was produced for A. fimbriata that includes over 3.8 million 454 sequence reads. CONCLUSIONS: Aristolochia fimbriata has numerous features that facilitate genetic studies and is suggested as a potential model system for use with a wide variety of technologies. Emerging genetic and genomic tools for A. fimbriata and closely related species can aid the investigation of floral biology, developmental genetics, biochemical pathways important in plant-insect interactions as well as human health, and various other features present in early angiosperms.


Subject(s)
Aristolochia/genetics , Aristolochia/physiology , Genome, Plant/genetics
13.
Nature ; 488(7410): 213-7, 2012 Aug 09.
Article in English | MEDLINE | ID: mdl-22801500

ABSTRACT

Bananas (Musa spp.), including dessert and cooking types, are giant perennial monocotyledonous herbs of the order Zingiberales, a sister group to the well-studied Poales, which include cereals. Bananas are vital for food security in many tropical and subtropical countries and the most popular fruit in industrialized countries. The Musa domestication process started some 7,000 years ago in Southeast Asia. It involved hybridizations between diverse species and subspecies, fostered by human migrations, and selection of diploid and triploid seedless, parthenocarpic hybrids thereafter widely dispersed by vegetative propagation. Half of the current production relies on somaclones derived from a single triploid genotype (Cavendish). Pests and diseases have gradually become adapted, representing an imminent danger for global banana production. Here we describe the draft sequence of the 523-megabase genome of a Musa acuminata doubled-haploid genotype, providing a crucial stepping-stone for genetic improvement of banana. We detected three rounds of whole-genome duplications in the Musa lineage, independently of those previously described in the Poales lineage and the one we detected in the Arecales lineage. This first monocotyledon high-continuity whole-genome sequence reported outside Poales represents an essential bridge for comparative genome analysis in plants. As such, it clarifies commelinid-monocotyledon phylogenetic relationships, reveals Poaceae-specific features and has led to the discovery of conserved non-coding sequences predating monocotyledon-eudicotyledon divergence.


Subject(s)
Evolution, Molecular , Genome, Plant/genetics , Musa/genetics , Conserved Sequence/genetics , DNA Transposable Elements/genetics , Gene Duplication/genetics , Genes, Plant/genetics , Genotype , Haploidy , Molecular Sequence Data , Musa/classification , Phylogeny
14.
Am J Bot ; 99(2): 397-406, 2012 Feb.
Article in English | MEDLINE | ID: mdl-22301890

ABSTRACT

PREMISE OF THE STUDY: The stability of the bimodal karyotype found in Agave and closely related species has long interested botanists. The origin of the bimodal karyotype has been attributed to allopolyploidy, but this hypothesis has not been tested. Next-generation transcriptome sequence data were used to test whether a paleopolyploid event occurred on the same branch of the Agavoideae phylogenetic tree as the origin of the Yucca-Agave bimodal karyotype. METHODS: Illumina RNA-seq data were generated for phylogenetically strategic species in Agavoideae. Paleopolyploidy was inferred in analyses of frequency plots for synonymous substitutions per synonymous site (K(s)) between Hosta, Agave, and Chlorophytum paralogous and orthologous gene pairs. Phylogenies of gene families including paralogous genes for these species and outgroup species were estimated to place inferred paleopolyploid events on a species tree. KEY RESULTS: K(s) frequency plots suggested paleopolyploid events in the history of the genera Agave, Hosta, and Chlorophytum. Phylogenetic analyses of gene families estimated from transcriptome data revealed two polyploid events: one predating the last common ancestor of Agave and Hosta and one within the lineage leading to Chlorophytum. CONCLUSIONS: We found that polyploidy and the origin of the Yucca-Agave bimodal karyotype co-occur on the same lineage consistent with the hypothesis that the bimodal karyotype is a consequence of allopolyploidy. We discuss this and alternative mechanisms for the formation of the Yucca-Agave bimodal karyotype. More generally, we illustrate how the use of next-generation sequencing technology is a cost-efficient means for assessing genome evolution in nonmodel species.


Subject(s)
Karyotyping , Liliaceae/genetics , Phylogeny , Polyploidy , Transcriptome , Base Sequence , Chromosomes, Plant/genetics , Evolution, Molecular , Gene Duplication , Genes, Plant , Liliaceae/classification , Multigene Family , RNA, Plant/genetics , Sequence Analysis, RNA , Species Specificity
15.
Genome Biol ; 13(1): R3, 2012 Jan 26.
Article in English | MEDLINE | ID: mdl-22280555

ABSTRACT

BACKGROUND: Although it is agreed that a major polyploidy event, gamma, occurred within the eudicots, the phylogenetic placement of the event remains unclear. RESULTS: To determine when this polyploidization occurred relative to speciation events in angiosperm history, we employed a phylogenomic approach to investigate the timing of gene set duplications located on syntenic gamma blocks. We populated 769 putative gene families with large sets of homologs obtained from public transcriptomes of basal angiosperms, magnoliids, asterids, and more than 91.8 gigabases of new next-generation transcriptome sequences of non-grass monocots and basal eudicots. The overwhelming majority (95%) of well-resolved gamma duplications was placed before the separation of rosids and asterids and after the split of monocots and eudicots, providing strong evidence that the gamma polyploidy event occurred early in eudicot evolution. Further, the majority of gene duplications was placed after the divergence of the Ranunculales and core eudicots, indicating that the gamma appears to be restricted to core eudicots. Molecular dating estimates indicate that the duplication events were intensely concentrated around 117 million years ago. CONCLUSIONS: The rapid radiation of core eudicot lineages that gave rise to nearly 75% of angiosperm species appears to have occurred coincidentally or shortly following the gamma triplication event. Reconciliation of gene trees with a species phylogeny can elucidate the timing of major events in genome evolution, even when genome sequences are only available for a subset of species represented in the gene trees. Comprehensive transcriptome datasets are valuable complements to genome sequences for high-resolution phylogenomic analysis.


Subject(s)
Gene Duplication , Magnoliopsida/genetics , Plant Proteins/genetics , Polyploidy , Evolution, Molecular , Gene Expression Profiling , Genetic Speciation , Genome, Plant , Phylogeny
16.
Genome Biol ; 12(5): R48, 2011.
Article in English | MEDLINE | ID: mdl-21619600

ABSTRACT

BACKGROUND: Recent phylogenetic analyses have identified Amborella trichopoda, an understory tree species endemic to the forests of New Caledonia, as sister to a clade including all other known flowering plant species. The Amborella genome is a unique reference for understanding the evolution of angiosperm genomes because it can serve as an outgroup to root comparative analyses. A physical map, BAC end sequences and sample shotgun sequences provide a first view of the 870 Mbp Amborella genome. RESULTS: Analysis of Amborella BAC ends sequenced from each contig suggests that the density of long terminal repeat retrotransposons is negatively correlated with that of protein coding genes. Syntenic, presumably ancestral, gene blocks were identified in comparisons of the Amborella BAC contigs and the sequenced Arabidopsis thaliana, Populus trichocarpa, Vitis vinifera and Oryza sativa genomes. Parsimony mapping of the loss of synteny corroborates previous analyses suggesting that the rate of structural change has been more rapid on lineages leading to Arabidopsis and Oryza compared with lineages leading to Populus and Vitis. The gamma paleohexiploidy event identified in the Arabidopsis, Populus and Vitis genomes is shown to have occurred after the divergence of all other known angiosperms from the lineage leading to Amborella. CONCLUSIONS: When placed in the context of a physical map, BAC end sequences representing just 5.4% of the Amborella genome have facilitated reconstruction of gene blocks that existed in the last common ancestor of all flowering plants. The Amborella genome is an invaluable reference for inferences concerning the ancestral angiosperm and subsequent genome evolution.


Subject(s)
Contig Mapping/methods , Evolution, Molecular , Genome, Plant , Genomics/methods , Magnoliopsida/genetics , Databases, Genetic , Magnoliopsida/classification , New Caledonia , Open Reading Frames/genetics , Phylogeny , Phylogeography , Ploidies , Retroelements , Sequence Analysis, DNA , Synteny
17.
Nature ; 473(7345): 97-100, 2011 May 05.
Article in English | MEDLINE | ID: mdl-21478875

ABSTRACT

Whole-genome duplication (WGD), or polyploidy, followed by gene loss and diploidization has long been recognized as an important evolutionary force in animals, fungi and other organisms, especially plants. The success of angiosperms has been attributed, in part, to innovations associated with gene or whole-genome duplications, but evidence for proposed ancient genome duplications pre-dating the divergence of monocots and eudicots remains equivocal in analyses of conserved gene order. Here we use comprehensive phylogenomic analyses of sequenced plant genomes and more than 12.6 million new expressed-sequence-tag sequences from phylogenetically pivotal lineages to elucidate two groups of ancient gene duplications-one in the common ancestor of extant seed plants and the other in the common ancestor of extant angiosperms. Gene duplication events were intensely concentrated around 319 and 192 million years ago, implicating two WGDs in ancestral lineages shortly before the diversification of extant seed plants and extant angiosperms, respectively. Significantly, these ancestral WGDs resulted in the diversification of regulatory genes important to seed and flower development, suggesting that they were involved in major innovations that ultimately contributed to the rise and eventual dominance of seed plants and angiosperms.


Subject(s)
Evolution, Molecular , Genome, Plant/genetics , Magnoliopsida/classification , Magnoliopsida/genetics , Polyploidy , Genomics , Phylogeny
18.
Environ Sci Technol ; 42(24): 9231-6, 2008 Dec 15.
Article in English | MEDLINE | ID: mdl-19174897

ABSTRACT

The SPARC vapor pressure and activity coefficient models were coupled to successfully estimate Henry's Law Constant (HLC) in water and in hexadecane for a wide range of organic compounds without modification to, or additional parametrization of, either SPARC model. The vapor pressure model quantifies the solute-solute intermolecular interactions in the pure liquid phase, whereas the activity coefficient model quantifies the solute-solvent and solvent-solvent (in addition to the solute-solute) interactions upon placing solute, i, in solvent, j. These intermolecular interactions are factored into dispersion, induction, dipole-dipole, and H-bonding components upon moving a solute molecule from the gas to the liquid phase. The SPARC HLC calculator so produced was tested and validated on the largest experimental HLC data set to date: 1356 organic solutes, spanning a wide range of functional groups, dipolarities and H-bonding capabilities, such as PAHs, PCBs,VOCs, amides, pesticides, and pharmaceuticals. The rms deviation errors for the calculated versus experimental log HLCs for 1222 compounds in water and 563 in hexadecane were 0.456 and 0.192 log [(mol/L)/(mol/L)] units, respectively, spanning a range of more than 13 and 20 log HLC dimensionless units for the compounds in water and hexadecane, respectively. The SPARC calculator web version is available for public use, free of charge, and can be accessed at http://sparc.chem.uga.edu.


Subject(s)
Air , Alkanes/chemistry , Models, Chemical , Organic Chemicals/chemistry , Water/chemistry , Pesticides/chemistry , Polychlorinated Biphenyls/chemistry , Polycyclic Aromatic Hydrocarbons/chemistry , Reproducibility of Results , Temperature , Vapor Pressure
19.
Mol Pharm ; 4(4): 498-512, 2007.
Article in English | MEDLINE | ID: mdl-17629304

ABSTRACT

Most pharmacologically active molecules contain one or more ionizing groups, and it is well-known that knowledge of the ionization state of a drug, indicated by the pKa value, is critical for understanding many properties important to the drug discovery and development process. The ionization state of a compound directly influences such important pharmaceutical characteristics as aqueous solubility, permeability, crystal structure, etc. Tremendous advances have been made in the field of experimental determination of pKa, in terms of both quantity/speed and quality/accuracy. However, there still remains a need for accurate in silico predictions of pKa both to estimate this parameter for virtual compounds and to focus screening efforts of real compounds. The computer program SPARC (SPARC Performs Automated Reasoning in Chemistry) was used to predict the ionization state of a drug. This program has been developed based on the solid physical chemistry of reactivity models and applied to successfully predict numerous physical properties as well as chemical reactivity parameters. SPARC predicts both macroscopic and microscopic pKa values strictly from molecular structure. In this paper, we describe the details of the SPARC reactivity computational methods and its performance on predicting the pKa values of known drugs as well as Pfizer internal discovery/development compounds. A high correlation (r2=0.92) between experimental and the SPARC calculated pKa values was obtained with root-mean-square error (RMSE) of 0.78 log unit for a set of 123 compounds including many known drugs. For a set of 537 compounds from the Pfizer internal dataset, correlation coefficient r2=0.80 and RMSE=1.05 were obtained.


Subject(s)
Computer Simulation , Ions/chemistry , Pharmaceutical Preparations/chemistry , Software , Electrophoresis, Capillary , Hydrogen Bonding , Models, Chemical , Molecular Structure , Static Electricity , Stereoisomerism , Thermodynamics
SELECTION OF CITATIONS
SEARCH DETAIL
...