Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 12 de 12
Filter
Add more filters










Publication year range
1.
PLoS Comput Biol ; 20(2): e1011270, 2024 Feb.
Article in English | MEDLINE | ID: mdl-38324613

ABSTRACT

CyVerse, the largest publicly-funded open-source research cyberinfrastructure for life sciences, has played a crucial role in advancing data-driven research since the 2010s. As the technology landscape evolved with the emergence of cloud computing platforms, machine learning and artificial intelligence (AI) applications, CyVerse has enabled access by providing interfaces, Software as a Service (SaaS), and cloud-native Infrastructure as Code (IaC) to leverage new technologies. CyVerse services enable researchers to integrate institutional and private computational resources, custom software, perform analyses, and publish data in accordance with open science principles. Over the past 13 years, CyVerse has registered more than 124,000 verified accounts from 160 countries and was used for over 1,600 peer-reviewed publications. Since 2011, 45,000 students and researchers have been trained to use CyVerse. The platform has been replicated and deployed in three countries outside the US, with additional private deployments on commercial clouds for US government agencies and multinational corporations. In this manuscript, we present a strategic blueprint for creating and managing SaaS cyberinfrastructure and IaC as free and open-source software.


Subject(s)
Artificial Intelligence , Software , Humans , Cloud Computing , Publishing
2.
Genome Biol Evol ; 15(11)2023 11 01.
Article in English | MEDLINE | ID: mdl-37931037

ABSTRACT

The evolution and diversification of proteins capable of remodeling domains has been critical for transcriptional reprogramming during cell fate determination in multicellular eukaryotes. Chromatin remodeling proteins of the CHD3 family have been shown to have important and antagonistic impacts on seed development in the model plant, Arabidopsis thaliana, yet the basis of this functional divergence remains unknown. In this study, we demonstrate that genes encoding the CHD3 proteins PICKLE (PKL) and PICKLE-RELATED 2 (PKR2) originated from a duplication event during the diversification of crown Brassicaceae, and that these homologs have undergone distinct evolutionary trajectories since this duplication, with PKR2 fast evolving under positive selection, while PKL is subject to purifying selection. We find that the rapid evolution of PKR2 under positive selection reduces the encoded protein's intrinsic disorder, possibly suggesting a tertiary structure configuration which differs from that of PKL. Our whole genome transcriptome analysis in seeds of pkr2 and pkl mutants reveals that they act antagonistically on the expression of specific sets of genes, providing a basis for their differing roles in seed development. Our results provide insights into how gene duplication and neofunctionalization can lead to differing and antagonistic selective pressures on transcriptomes during plant reproduction, as well as on the evolutionary diversification of the CHD3 family within seed plants.


Subject(s)
Arabidopsis Proteins , Arabidopsis , Arabidopsis/genetics , Arabidopsis Proteins/genetics , Gene Expression Regulation, Plant , Seeds/genetics , Transcription Factors/genetics , Transcriptome , Gene Duplication
4.
Mol Biol Evol ; 36(6): 1239-1253, 2019 06 01.
Article in English | MEDLINE | ID: mdl-30913563

ABSTRACT

Genomic imprinting is an epigenetic phenomenon where autosomal genes display uniparental expression depending on whether they are maternally or paternally inherited. Genomic imprinting can arise from parental conflicts over resource allocation to the offspring, which could drive imprinted loci to evolve by positive selection. We investigate whether positive selection is associated with genomic imprinting in the inbreeding species Arabidopsis thaliana. Our analysis of 140 genes regulated by genomic imprinting in the A. thaliana seed endosperm demonstrates they are evolving more rapidly than expected. To investigate whether positive selection drives this evolutionary acceleration, we identified orthologs of each imprinted gene across 34 plant species and elucidated their evolutionary trajectories. Increased positive selection was sought by comparing its incidence among imprinted genes with nonimprinted controls. Strikingly, we find a statistically significant enrichment of imprinted paternally expressed genes (iPEGs) evolving under positive selection, 50.6% of the total, but no such enrichment for positive selection among imprinted maternally expressed genes (iMEGs). This suggests that maternally- and paternally expressed imprinted genes are subject to different selective pressures. Almost all positively selected amino acids were fixed across 80 sequenced A. thaliana accessions, suggestive of selective sweeps in the A. thaliana lineage. The imprinted genes under positive selection are involved in processes important for seed development including auxin biosynthesis and epigenetic regulation. Our findings support a genomic imprinting model for plants where positive selection can affect paternally expressed genes due to continued conflict with maternal sporophyte tissues, even when parental conflict is reduced in predominantly inbreeding species.


Subject(s)
Arabidopsis/genetics , Evolution, Molecular , Genomic Imprinting , Selection, Genetic , Arabidopsis Proteins/genetics , DNA-Directed RNA Polymerases/genetics
5.
Plant J ; 92(6): 1044-1058, 2017 Dec.
Article in English | MEDLINE | ID: mdl-29024088

ABSTRACT

Genomic imprinting in the seed endosperm could be due to unequal parental-genome contribution effects in triploid endosperm tissue that trigger parent-of-origin specific activation and/or silencing of loci prone to genomic imprinting. To determine whether genomic imprinting is triggered by unequal parental-genome contribution effects, we generated a whole-genome transcriptome dataset of F1 hybrid triploid embryos (as mimics of F1 hybrid triploid endosperm). For the vast majority of genes, the parental contributions to their expression levels in the F1 triploid hybrid embryos follow a biallelic and linear expression pattern. While allele-specific expression (ASE) bias was detected, such effects were predominantly parent-of-origin independent. We demonstrate that genomic imprinting is largely absent from F1 triploid embryos, strongly suggesting that neither triploidy nor unequal parental-genome contribution are key triggers of genomic imprinting in plants. However, extensive parental-genome dosage effects on gene expression were observed between the reciprocal F1 hybrid embryos, particularly for genes involved in defence response and nutrient reservoir activity, potentially leading to the seed size differences between reciprocal triploids. We further determined that unequal parental-genome contribution in F1 triploids can lead to overexpression effects that are parent-of-origin dependent, and which are not observed in diploid or tetraploid embryos in which the parental-genome dosage is balanced. Overall, our study demonstrates that neither triploidy nor unequal parental-genome contribution is sufficient to trigger imprinting in plant tissues, suggesting that genomic imprinting is an intrinsic and unique feature of the triploid seed endosperm.


Subject(s)
Arabidopsis/genetics , Genome, Plant/genetics , Genomic Imprinting , Transcriptome , Alleles , Arabidopsis/growth & development , Diploidy , Endosperm/genetics , Endosperm/growth & development , Epigenomics , Seeds/genetics , Seeds/growth & development , Sequence Analysis, RNA , Triploidy
6.
Nat Commun ; 5: 3706, 2014 Apr 23.
Article in English | MEDLINE | ID: mdl-24759634

ABSTRACT

Camelina sativa is an oilseed with desirable agronomic and oil-quality attributes for a viable industrial oil platform crop. Here we generate the first chromosome-scale high-quality reference genome sequence for C. sativa and annotated 89,418 protein-coding genes, representing a whole-genome triplication event relative to the crucifer model Arabidopsis thaliana. C. sativa represents the first crop species to be sequenced from lineage I of the Brassicaceae. The well-preserved hexaploid genome structure of C. sativa surprisingly mirrors those of economically important amphidiploid Brassica crop species from lineage II as well as wheat and cotton. The three genomes of C. sativa show no evidence of fractionation bias and limited expression-level bias, both characteristics commonly associated with polyploid evolution. The highly undifferentiated polyploid genome of C. sativa presents significant consequences for breeding and genetic manipulation of this industrial oil crop.


Subject(s)
Biofuels , Brassicaceae/genetics , Genome, Plant , Polyploidy , Karyotyping
7.
DNA Res ; 20(5): 485-95, 2013 Oct.
Article in English | MEDLINE | ID: mdl-23792890

ABSTRACT

The hybrid pigeonpea (Cajanus cajan) breeding technology based on cytoplasmic male sterility (CMS) is currently unique among legumes and displays major potential for yield increase. CMS is defined as a condition in which a plant is unable to produce functional pollen grains. The novel chimeric open reading frames (ORFs) produced as a results of mitochondrial genome rearrangements are considered to be the main cause of CMS. To identify these CMS-related ORFs in pigeonpea, we sequenced the mitochondrial genomes of three C. cajan lines (the male-sterile line ICPA 2039, the maintainer line ICPB 2039, and the hybrid line ICPH 2433) and of the wild relative (Cajanus cajanifolius ICPW 29). A single, circular-mapping molecule of length 545.7 kb was assembled and annotated for the ICPA 2039 line. Sequence annotation predicted 51 genes, including 34 protein-coding and 17 RNA genes. Comparison of the mitochondrial genomes from different Cajanus genotypes identified 31 ORFs, which differ between lines within which CMS is present or absent. Among these chimeric ORFs, 13 were identified by comparison of the related male-sterile and maintainer lines. These ORFs display features that are known to trigger CMS in other plant species and to represent the most promising candidates for CMS-related mitochondrial rearrangements in pigeonpea.


Subject(s)
Chimera , Cytoplasm/metabolism , DNA, Mitochondrial/genetics , Open Reading Frames , Phaseolus/genetics , Pollen , Genotype , Species Specificity
8.
BMC Res Notes ; 5: 359, 2012 Jul 17.
Article in English | MEDLINE | ID: mdl-22805311

ABSTRACT

BACKGROUND: Synonymous codon usage bias has typically been correlated with, and attributed to translational efficiency. However, there are other pressures on genomic sequence composition that can affect codon usage patterns such as mutational biases. This study provides an analysis of the codon usage patterns in Arabidopsis thaliana in relation to gene expression levels, codon volatility, mutational biases and selective pressures. RESULTS: We have performed synonymous codon usage and codon volatility analyses for all genes in the A. thaliana genome. In contrast to reports for species from other kingdoms, we find that neither codon usage nor volatility are correlated with selection pressure (as measured by dN/dS), nor with gene expression levels on a genome wide level. Our results show that codon volatility and usage are not synonymous, rather that they are correlated with the abundance of G and C at the third codon position (GC3). CONCLUSIONS: Our results indicate that while the A. thaliana genome shows evidence for synonymous codon usage bias, this is not related to the expression levels of its constituent genes. Neither codon volatility nor codon usage are correlated with expression levels or selective pressures but, because they are directly related to the composition of G and C at the third codon position, they are the result of mutational bias. Therefore, in A. thaliana codon volatility and usage do not result from selection for translation efficiency or protein functional shift as measured by positive selection.


Subject(s)
Arabidopsis/genetics , Codon , Base Sequence , Brassica/genetics , Cloning, Molecular , DNA Primers , DNA, Complementary , Genes, Plant , Polymerase Chain Reaction , Protein Biosynthesis
9.
Mol Plant ; 5(5): 1020-8, 2012 Sep.
Article in English | MEDLINE | ID: mdl-22241453

ABSTRACT

A comprehensive transcriptome assembly for pigeonpea has been developed by analyzing 128.9 million short Illumina GA IIx single end reads, 2.19 million single end FLX/454 reads, and 18 353 Sanger expressed sequenced tags from more than 16 genotypes. The resultant transcriptome assembly, referred to as CcTA v2, comprised 21 434 transcript assembly contigs (TACs) with an N50 of 1510 bp, the largest one being ~8 kb. Of the 21 434 TACs, 16 622 (77.5%) could be mapped on to the soybean genome build 1.0.9 under fairly stringent alignment parameters. Based on knowledge of intron junctions, 10 009 primer pairs were designed from 5033 TACs for amplifying intron spanning regions (ISRs). By using in silico mapping of BAC-end-derived SSR loci of pigeonpea on the soybean genome as a reference, putative mapping positions at the chromosome level were predicted for 6284 ISR markers, covering all 11 pigeonpea chromosomes. A subset of 128 ISR markers were analyzed on a set of eight genotypes. While 116 markers were validated, 70 markers showed one to three alleles, with an average of 0.16 polymorphism information content (PIC) value. In summary, the CcTA v2 transcript assembly and ISR markers will serve as a useful resource to accelerate genetic research and breeding applications in pigeonpea.


Subject(s)
Cajanus/genetics , Sequence Analysis, DNA/methods , Transcriptome , Cajanus/classification , Chromosome Mapping , Fabaceae/classification , Fabaceae/genetics , Genotype , Phylogeny , Polymorphism, Single Nucleotide
10.
Nat Biotechnol ; 30(1): 83-9, 2011 Nov 06.
Article in English | MEDLINE | ID: mdl-22057054

ABSTRACT

Pigeonpea is an important legume food crop grown primarily by smallholder farmers in many semi-arid tropical regions of the world. We used the Illumina next-generation sequencing platform to generate 237.2 Gb of sequence, which along with Sanger-based bacterial artificial chromosome end sequences and a genetic map, we assembled into scaffolds representing 72.7% (605.78 Mb) of the 833.07 Mb pigeonpea genome. Genome analysis predicted 48,680 genes for pigeonpea and also showed the potential role that certain gene families, for example, drought tolerance-related genes, have played throughout the domestication of pigeonpea and the evolution of its ancestors. Although we found a few segmental duplication events, we did not observe the recent genome-wide duplication events observed in soybean. This reference genome sequence will facilitate the identification of the genetic basis of agronomically important traits, and accelerate the development of improved pigeonpea varieties that could improve food security in many developing countries.


Subject(s)
Cajanus/genetics , Genes, Plant , Genome, Plant , Sequence Analysis, DNA/methods , Chromosome Mapping , Chromosomes, Artificial, Bacterial/genetics , Genetic Markers , Molecular Sequence Annotation , Repetitive Sequences, Nucleic Acid/genetics , Segmental Duplications, Genomic , Glycine max/genetics , Synteny/genetics
11.
Plant Biotechnol J ; 9(8): 922-31, 2011 Oct.
Article in English | MEDLINE | ID: mdl-21615673

ABSTRACT

Chickpea (Cicer arietinum L.) is an important legume crop in the semi-arid regions of Asia and Africa. Gains in crop productivity have been low however, particularly because of biotic and abiotic stresses. To help enhance crop productivity using molecular breeding techniques, next generation sequencing technologies such as Roche/454 and Illumina/Solexa were used to determine the sequence of most gene transcripts and to identify drought-responsive genes and gene-based molecular markers. A total of 103,215 tentative unique sequences (TUSs) have been produced from 435,018 Roche/454 reads and 21,491 Sanger expressed sequence tags (ESTs). Putative functions were determined for 49,437 (47.8%) of the TUSs, and gene ontology assignments were determined for 20,634 (41.7%) of the TUSs. Comparison of the chickpea TUSs with the Medicago truncatula genome assembly (Mt 3.5.1 build) resulted in 42,141 aligned TUSs with putative gene structures (including 39,281 predicted intron/splice junctions). Alignment of ∼37 million Illumina/Solexa tags generated from drought-challenged root tissues of two chickpea genotypes against the TUSs identified 44,639 differentially expressed TUSs. The TUSs were also used to identify a diverse set of markers, including 728 simple sequence repeats (SSRs), 495 single nucleotide polymorphisms (SNPs), 387 conserved orthologous sequence (COS) markers, and 2088 intron-spanning region (ISR) markers. This resource will be useful for basic and applied research for genome analysis and crop improvement in chickpea.


Subject(s)
Chromosome Mapping/methods , Cicer/genetics , Gene Expression Profiling/methods , Genome, Plant , Africa , Asia , Cicer/metabolism , Cicer/physiology , Droughts , Energy Metabolism , Expressed Sequence Tags , Gene Expression Regulation, Plant , Gene Library , Genetic Markers , Genotype , Introns , Medicago truncatula/genetics , Microsatellite Repeats , Plant Roots/genetics , Polymorphism, Single Nucleotide , Sequence Alignment/methods , Stress, Physiological , Transcription Factors/genetics
12.
DNA Res ; 18(3): 153-64, 2011 Jun.
Article in English | MEDLINE | ID: mdl-21565938

ABSTRACT

This study reports generation of large-scale genomic resources for pigeonpea, a so-called 'orphan crop species' of the semi-arid tropic regions. FLX/454 sequencing carried out on a normalized cDNA pool prepared from 31 tissues produced 494 353 short transcript reads (STRs). Cluster analysis of these STRs, together with 10 817 Sanger ESTs, resulted in a pigeonpea trancriptome assembly (CcTA) comprising of 127 754 tentative unique sequences (TUSs). Functional analysis of these TUSs highlights several active pathways and processes in the sampled tissues. Comparison of the CcTA with the soybean genome showed similarity to 10 857 and 16 367 soybean gene models (depending on alignment methods). Additionally, Illumina 1G sequencing was performed on Fusarium wilt (FW)- and sterility mosaic disease (SMD)-challenged root tissues of 10 resistant and susceptible genotypes. More than 160 million sequence tags were used to identify FW- and SMD-responsive genes. Sequence analysis of CcTA and the Illumina tags identified a large new set of markers for use in genetics and breeding, including 8137 simple sequence repeats, 12 141 single-nucleotide polymorphisms and 5845 intron-spanning regions. Genomic resources developed in this study should be useful for basic and applied research, not only for pigeonpea improvement but also for other related, agronomically important legumes.


Subject(s)
Cajanus/genetics , Genome, Plant , Gene Expression Profiling , Genetic Markers , Introns , Microsatellite Repeats , Multigene Family , Polymorphism, Single Nucleotide , Transcription, Genetic
SELECTION OF CITATIONS
SEARCH DETAIL
...