Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 13 de 13
1.
Genetics ; 226(1)2024 Jan 03.
Article En | MEDLINE | ID: mdl-37865119

The discovery that experimental delivery of dsRNA can induce gene silencing at target genes revolutionized genetics research, by both uncovering essential biological processes and creating new tools for developmental geneticists. However, the efficacy of exogenous RNA interference (RNAi) varies dramatically within the Caenorhabditis elegans natural population, raising questions about our understanding of RNAi in the lab relative to its activity and significance in nature. Here, we investigate why some wild strains fail to mount a robust RNAi response to germline targets. We observe diversity in mechanism: in some strains, the response is stochastic, either on or off among individuals, while in others, the response is consistent but delayed. Increased activity of the Argonaute PPW-1, which is required for germline RNAi in the laboratory strain N2, rescues the response in some strains but dampens it further in others. Among wild strains, genes known to mediate RNAi exhibited very high expression variation relative to other genes in the genome as well as allelic divergence and strain-specific instances of pseudogenization at the sequence level. Our results demonstrate functional diversification in the small RNA pathways in C. elegans and suggest that RNAi processes are evolving rapidly and dynamically in nature.


Caenorhabditis elegans Proteins , Caenorhabditis elegans , Humans , Animals , RNA Interference , Caenorhabditis elegans/genetics , Caenorhabditis elegans/metabolism , Caenorhabditis elegans Proteins/metabolism , RNA, Small Interfering/genetics , RNA, Small Interfering/metabolism , RNA, Double-Stranded/metabolism , Germ Cells/metabolism
2.
Genetics ; 224(4)2023 08 09.
Article En | MEDLINE | ID: mdl-37310925

Codon bias and mRNA folding strength (mF) are hypothesized molecular mechanisms by which polymorphisms in genes modify protein expression. Natural patterns of codon bias and mF across genes as well as effects of altering codon bias and mF suggest that the influence of these 2 mechanisms may vary depending on the specific location of polymorphisms within a transcript. Despite the central role codon bias and mF may play in natural trait variation within populations, systematic studies of how polymorphic codon bias and mF relate to protein expression variation are lacking. To address this need, we analyzed genomic, transcriptomic, and proteomic data for 22 Saccharomyces cerevisiae isolates, estimated protein accumulation for each allele of 1,620 genes as the log of protein molecules per RNA molecule (logPPR), and built linear mixed-effects models associating allelic variation in codon bias and mF with allelic variation in logPPR. We found that codon bias and mF interact synergistically in a positive association with logPPR, and this interaction explains almost all the effects of codon bias and mF. We examined how the locations of polymorphisms within transcripts influence their effects and found that codon bias primarily acts through polymorphisms in domain-encoding and 3' coding sequences, while mF acts most significantly through coding sequences with weaker effects from untranslated regions. Our results present the most comprehensive characterization to date of how polymorphisms in transcripts influence protein expression.


Codon Usage , Saccharomyces cerevisiae , RNA, Messenger/genetics , RNA, Messenger/metabolism , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Proteomics , RNA Folding , Codon/genetics
3.
Mol Biol Cell ; 32(15): 1335-1346, 2021 07 15.
Article En | MEDLINE | ID: mdl-34010017

Endogenous RNA interference (RNAi) pathways regulate a wide range of cellular processes in diverse eukaryotes, yet in the ciliated eukaryote, Tetrahymena thermophila, the cellular purpose of RNAi pathways that generate ∼23-24 nucleotide (nt) small (s)RNAs has remained unknown. Here, we investigated the phenotypic and gene expression impacts on vegetatively growing cells when genes involved in ∼23-24 nt sRNA biogenesis are disrupted. We observed slower proliferation and increased expression of genes involved in DNA metabolism and chromosome organization and maintenance in sRNA biogenesis mutants RSP1Δ, RDN2Δ, and RDF2Δ. In addition, RSP1Δ and RDN2Δ cells frequently exhibited enlarged chromatin extrusion bodies, which are nonnuclear, DNA-containing structures that may be akin to mammalian micronuclei. Expression of homologous recombination factor Rad51 was specifically elevated in RSP1Δ and RDN2Δ strains, with Rad51 and double-stranded DNA break marker γ-H2A.X localized to discrete macronuclear foci. In addition, an increase in Rad51 and γ-H2A.X foci was also found in knockouts of TWI8, a macronucleus-localized PIWI protein. Together, our findings suggest that an evolutionarily conserved role for RNAi pathways in maintaining genome integrity may be extended even to the early branching eukaryotic lineage that gave rise to Tetrahymena thermophila.


DNA Repair , RNA, Small Interfering/metabolism , Tetrahymena thermophila/genetics , Tetrahymena thermophila/metabolism , DNA/metabolism , DNA Breaks, Double-Stranded , Evolution, Molecular , Gene Expression Profiling , Gene Expression Regulation , Protozoan Proteins , Rad51 Recombinase/genetics , Recombinational DNA Repair , Sequence Analysis, RNA
4.
Mol Biol Cell ; 30(12): 1359-1368, 2019 06 01.
Article En | MEDLINE | ID: mdl-31145670

We provide guidelines for using statistical methods to analyze the types of experiments reported in cellular and molecular biology journals such as Molecular Biology of the Cell. Our aim is to help experimentalists use these methods skillfully, avoid mistakes, and extract the maximum amount of information from their laboratory work. We focus on comparing the average values of control and experimental samples. A Supplemental Tutorial provides examples of how to analyze experimental data using R software.


Cell Biology , Molecular Biology , Statistics as Topic , Animals , Caenorhabditis elegans/embryology , Decision Trees
5.
G3 (Bethesda) ; 3(6): 941-7, 2013 Jun 21.
Article En | MEDLINE | ID: mdl-23589516

Resolving the genetic complexity of heritable phenotypic variation is fundamental to understanding the mechanisms of evolution and the etiology of human disease. Trait variation among isolates from genetically efficient model organisms offers the opportunity to dissect genetic architectures and identify the molecular mechanisms of causation. Here we present a genetic analysis of loss of sensitivity to gene knockdown via exogenous RNA interference in the germline of a wild isolate of the roundworm Caenorhabditis elegans. We find that the loss of RNA interference sensitivity in the wild isolate CB4856 is recessive to the sensitivity of the lab strain N2. A cross of the strains produced F2 with intermediate sensitivities, and the segregation of the trait among F2s strongly deviated from a single locus recessive allele expectation. Linkage analysis in recombinant inbred lines derived from CB4856 and N2 identified a single significant locus on chromosome I that includes the argonaute gene ppw-1. The alleles for ppw-1 were unable to explain the sensitivity of 18 (12.1%) of the recombinant inbred lines. Complementation tests and F2 segregation analysis of these recombinant inbred lines revealed cases of complex epistatic suppression and enhancement of the effects of ppw-1. We conclude that the variation in RNA interference sensitivity between CB4856 and N2 likely involves the nonadditive interactions of eight or more genes in addition to ppw-1.


Caenorhabditis elegans/genetics , Caenorhabditis elegans/isolation & purification , Germ Cells/metabolism , RNA Interference , Animals , Caenorhabditis elegans Proteins/metabolism , Chromosome Mapping , Chromosome Segregation/genetics , Crosses, Genetic , Epistasis, Genetic , Female , Genes, Dominant , Humans , Inbreeding , Male , Phenotype , Protein Serine-Threonine Kinases/metabolism , Quantitative Trait Loci/genetics , Recombination, Genetic/genetics , Repressor Proteins/metabolism
6.
Methods Mol Biol ; 871: 31-9, 2012.
Article En | MEDLINE | ID: mdl-22565831

Recombinant inbred lines (RILs) are a collection of strains that can be used to map quantitative trait loci. Parent strains are crossed to create recombinants that are then inbred to isogenicity, resulting in a permanent resource for trait mapping and analysis. Here I describe the process of designing and constructing RILs. This consists of the following steps. Parent strains are selected based on phenotype, marker availability, and compatibility, and they may be genetically engineered to remove unwanted variation or to introduce reporters. A construction design scheme is determined, including the target population size, if and how advanced intercrossing will be done, and the number of generations of inbreeding. Parent crosses and F1 crosses are performed to create an F2 population. Depending on design, advanced intercrossing may be implemented to increase mapping resolution through the accumulation of additional meiotic crossover events. Finally, lines are inbred to create genetically stable recombinant lines. I discuss tips and techniques for maximizing mapping power and resolution and minimizing resource investment for each stage of the process.


Crosses, Genetic , Quantitative Trait Loci/genetics , Chromosome Mapping , Genetic Linkage/genetics , Inbreeding , Recombination, Genetic/genetics
7.
PLoS Biol ; 6(2): e27, 2008 Feb.
Article En | MEDLINE | ID: mdl-18271625

Identifying the genomic regions bound by sequence-specific regulatory factors is central both to deciphering the complex DNA cis-regulatory code that controls transcription in metazoans and to determining the range of genes that shape animal morphogenesis. We used whole-genome tiling arrays to map sequences bound in Drosophila melanogaster embryos by the six maternal and gap transcription factors that initiate anterior-posterior patterning. We find that these sequence-specific DNA binding proteins bind with quantitatively different specificities to highly overlapping sets of several thousand genomic regions in blastoderm embryos. Specific high- and moderate-affinity in vitro recognition sequences for each factor are enriched in bound regions. This enrichment, however, is not sufficient to explain the pattern of binding in vivo and varies in a context-dependent manner, demonstrating that higher-order rules must govern targeting of transcription factors. The more highly bound regions include all of the over 40 well-characterized enhancers known to respond to these factors as well as several hundred putative new cis-regulatory modules clustered near developmental regulators and other genes with patterned expression at this stage of embryogenesis. The new targets include most of the microRNAs (miRNAs) transcribed in the blastoderm, as well as all major zygotically transcribed dorsal-ventral patterning genes, whose expression we show to be quantitatively modulated by anterior-posterior factors. In addition to these highly bound regions, there are several thousand regions that are reproducibly bound at lower levels. However, these poorly bound regions are, collectively, far more distant from genes transcribed in the blastoderm than highly bound regions; are preferentially found in protein-coding sequences; and are less conserved than highly bound regions. Together these observations suggest that many of these poorly bound regions are not involved in early-embryonic transcriptional regulation, and a significant proportion may be nonfunctional. Surprisingly, for five of the six factors, their recognition sites are not unambiguously more constrained evolutionarily than the immediate flanking DNA, even in more highly bound and presumably functional regions, indicating that comparative DNA sequence analysis is limited in its ability to identify functional transcription factor targets.


Blastoderm/metabolism , Drosophila melanogaster/embryology , Transcription Factors/metabolism , Animals , Binding Sites , DNA/metabolism , Evolution, Molecular , MicroRNAs/metabolism
8.
Nature ; 450(7167): 203-18, 2007 Nov 08.
Article En | MEDLINE | ID: mdl-17994087

Comparative analysis of multiple genomes in a phylogenetic framework dramatically improves the precision and sensitivity of evolutionary inference, producing more robust results than single-genome analyses can provide. The genomes of 12 Drosophila species, ten of which are presented here for the first time (sechellia, simulans, yakuba, erecta, ananassae, persimilis, willistoni, mojavensis, virilis and grimshawi), illustrate how rates and patterns of sequence divergence across taxa can illuminate evolutionary processes on a genomic scale. These genome sequences augment the formidable genetic tools that have made Drosophila melanogaster a pre-eminent model for animal genetics, and will further catalyse fundamental research on mechanisms of development, cell biology, genetics, disease, neurobiology, behaviour, physiology and evolution. Despite remarkable similarities among these Drosophila species, we identified many putatively non-neutral changes in protein-coding genes, non-coding RNA genes, and cis-regulatory regions. These may prove to underlie differences in the ecology and behaviour of these diverse species.


Drosophila/classification , Drosophila/genetics , Evolution, Molecular , Genes, Insect/genetics , Genome, Insect/genetics , Genomics , Phylogeny , Animals , Codon/genetics , DNA Transposable Elements/genetics , Drosophila/immunology , Drosophila/metabolism , Drosophila Proteins/genetics , Gene Order/genetics , Genome, Mitochondrial/genetics , Immunity/genetics , Multigene Family/genetics , RNA, Untranslated/genetics , Reproduction/genetics , Sequence Alignment , Sequence Analysis, DNA , Synteny/genetics
9.
PLoS Genet ; 2(10): e173, 2006 Oct 27.
Article En | MEDLINE | ID: mdl-17132051

The phylogenetic relationship of the now fully sequenced species Drosophila erecta and D. yakuba with respect to the D. melanogaster species complex has been a subject of controversy. All three possible groupings of the species have been reported in the past, though recent multi-gene studies suggest that D. erecta and D. yakuba are sister species. Using the whole genomes of each of these species as well as the four other fully sequenced species in the subgenus Sophophora, we set out to investigate the placement of D. erecta and D. yakuba in the D. melanogaster species group and to understand the cause of the past incongruence. Though we find that the phylogeny grouping D. erecta and D. yakuba together is the best supported, we also find widespread incongruence in nucleotide and amino acid substitutions, insertions and deletions, and gene trees. The time inferred to span the two key speciation events is short enough that under the coalescent model, the incongruence could be the result of incomplete lineage sorting. Consistent with the lineage-sorting hypothesis, substitutions supporting the same tree were spatially clustered. Support for the different trees was found to be linked to recombination such that adjacent genes support the same tree most often in regions of low recombination and substitutions supporting the same tree are most enriched roughly on the same scale as linkage disequilibrium, also consistent with lineage sorting. The incongruence was found to be statistically significant and robust to model and species choice. No systematic biases were found. We conclude that phylogenetic incongruence in the D. melanogaster species complex is the result, at least in part, of incomplete lineage sorting. Incomplete lineage sorting will likely cause phylogenetic incongruence in many comparative genomics datasets. Methods to infer the correct species tree, the history of every base in the genome, and comparative methods that control for and/or utilize this information will be valuable advancements for the field of comparative genomics.


Drosophila/classification , Drosophila/genetics , Genes, Insect/genetics , Genetic Speciation , Phylogeny , Animals , Base Sequence , Cluster Analysis , Drosophila melanogaster/genetics , Evolution, Molecular , Likelihood Functions , Models, Genetic
10.
PLoS Comput Biol ; 2(10): e130, 2006 Oct.
Article En | MEDLINE | ID: mdl-17040121

The gain and loss of functional transcription factor binding sites has been proposed as a major source of evolutionary change in cis-regulatory DNA and gene expression. We have developed an evolutionary model to study binding-site turnover that uses multiple sequence alignments to assess the evolutionary constraint on individual binding sites, and to map gain and loss events along a phylogenetic tree. We apply this model to study the evolutionary dynamics of binding sites of the Drosophila melanogaster transcription factor Zeste, using genome-wide in vivo (ChIP-chip) binding data to identify functional Zeste binding sites, and the genome sequences of D. melanogaster, D. simulans, D. erecta, and D. yakuba to study their evolution. We estimate that more than 5% of functional Zeste binding sites in D. melanogaster were gained along the D. melanogaster lineage or lost along one of the other lineages. We find that Zeste-bound regions have a reduced rate of binding-site loss and an increased rate of binding-site gain relative to flanking sequences. Finally, we show that binding-site gains and losses are asymmetrically distributed with respect to D. melanogaster, consistent with lineage-specific acquisition and loss of Zeste-responsive regulatory elements.


Drosophila melanogaster/genetics , Drosophila melanogaster/metabolism , Response Elements/genetics , Transcription Factors/metabolism , Animals , Base Sequence , Binding Sites , Computational Biology , Conserved Sequence , DNA, Intergenic/genetics , DNA-Binding Proteins/genetics , Drosophila Proteins/genetics , Evolution, Molecular , Models, Genetic , Molecular Sequence Data , Promoter Regions, Genetic/genetics , Selection, Genetic , Sequence Alignment
11.
BMC Bioinformatics ; 7: 376, 2006 Aug 14.
Article En | MEDLINE | ID: mdl-16904011

BACKGROUND: Molecular evolutionary studies of noncoding sequences rely on multiple alignments. Yet how multiple alignment accuracy varies across sequence types, tree topologies, divergences and tools, and further how this variation impacts specific inferences, remains unclear. RESULTS: Here we develop a molecular evolution simulation platform, CisEvolver, with models of background noncoding and transcription factor binding site evolution, and use simulated alignments to systematically examine multiple alignment accuracy and its impact on two key molecular evolutionary inferences: transcription factor binding site conservation and divergence estimation. We find that the accuracy of multiple alignments is determined almost exclusively by the pairwise divergence distance of the two most diverged species and that additional species have a negligible influence on alignment accuracy. Conserved transcription factor binding sites align better than surrounding noncoding DNA yet are often found to be misaligned at relatively short divergence distances, such that studies of binding site gain and loss could easily be confounded by alignment error. Divergence estimates from multiple alignments tend to be overestimated at short divergence distances but reach a tool specific divergence at which they cease to increase, leading to underestimation at long divergences. Our most striking finding was that overall alignment accuracy, binding site alignment accuracy and divergence estimation accuracy vary greatly across branches in a tree and are most accurate for terminal branches connecting sister taxa and least accurate for internal branches connecting sub-alignments. CONCLUSION: Our results suggest that variation in alignment accuracy can lead to errors in molecular evolutionary inferences that could be construed as biological variation. These findings have implications for which species to choose for analyses, what kind of errors would be expected for a given set of species and how multiple alignment tools and phylogenetic inference methods might be improved to minimize or control for alignment errors.


Evolution, Molecular , Models, Genetic , Regulatory Elements, Transcriptional/genetics , Sequence Alignment/methods , Binding Sites/genetics , Sequence Analysis, DNA/methods , Species Specificity
12.
Genome Biol ; 5(12): R98, 2004.
Article En | MEDLINE | ID: mdl-15575972

We introduce a method (MONKEY) to identify conserved transcription-factor binding sites in multispecies alignments. MONKEY employs probabilistic models of factor specificity and binding-site evolution, on which basis we compute the likelihood that putative sites are conserved and assign statistical significance to each hit. Using genomes from the genus Saccharomyces, we illustrate how the significance of real sites increases with evolutionary distance and explore the relationship between conservation and function.


Biological Evolution , Conserved Sequence , DNA/metabolism , Sequence Alignment , Software , Transcription Factors/metabolism , Binding Sites , DNA, Fungal/metabolism , DNA-Binding Proteins/metabolism , Gene Expression Regulation , Models, Genetic , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae Proteins/metabolism , Species Specificity
13.
BMC Bioinformatics ; 5: 6, 2004 01 21.
Article En | MEDLINE | ID: mdl-14736341

BACKGROUND: Numerous tools have been developed to align genomic sequences. However, their relative performance in specific applications remains poorly characterized. Alignments of protein-coding sequences typically have been benchmarked against "correct" alignments inferred from structural data. For noncoding sequences, where such independent validation is lacking, simulation provides an effective means to generate "correct" alignments with which to benchmark alignment tools. RESULTS: Using rates of noncoding sequence evolution estimated from the genus Drosophila, we simulated alignments over a range of divergence times under varying models incorporating point substitution, insertion/deletion events, and short blocks of constrained sequences such as those found in cis-regulatory regions. We then compared "correct" alignments generated by a modified version of the ROSE simulation platform to alignments of the simulated derived sequences produced by eight pairwise alignment tools (Avid, BlastZ, Chaos, ClustalW, DiAlign, Lagan, Needle, and WABA) to determine the off-the-shelf performance of each tool. As expected, the ability to align noncoding sequences accurately decreases with increasing divergence for all tools, and declines faster in the presence of insertion/deletion evolution. Global alignment tools (Avid, ClustalW, Lagan, and Needle) typically have higher sensitivity over entire noncoding sequences as well as in constrained sequences. Local tools (BlastZ, Chaos, and WABA) have lower overall sensitivity as a consequence of incomplete coverage, but have high specificity to detect constrained sequences as well as high sensitivity within the subset of sequences they align. Tools such as DiAlign, which generate both local and global outputs, produce alignments of constrained sequences with both high sensitivity and specificity for divergence distances in the range of 1.25-3.0 substitutions per site. CONCLUSION: For species with genomic properties similar to Drosophila, we conclude that a single pair of optimally diverged species analyzed with a high performance alignment tool can yield accurate and specific alignments of functionally constrained noncoding sequences. Further algorithm development, optimization of alignment parameters, and benchmarking studies will be necessary to extract the maximal biological information from alignments of functional noncoding DNA.


Benchmarking/methods , DNA, Intergenic/genetics , Sequence Alignment/methods , Sequence Alignment/standards , Animals , Caenorhabditis/genetics , Caenorhabditis elegans/genetics , Computer Simulation , DNA, Helminth/genetics , DNA, Helminth/physiology , DNA, Intergenic/physiology , Drosophila/genetics , Drosophila melanogaster/genetics , Evolution, Molecular , Genomics/methods , Genomics/standards , Humans , Mice , Sensitivity and Specificity , Sequence Homology, Nucleic Acid
...