RESUMEN
Advances in affordable transcriptome sequencing combined with better exon and gene prediction has motivated many to compare transcription across the tree of life. We develop a mathematical framework to calculate complexity and compare transcript models. Structural features, i.e. intron retention (IR), donor/acceptor site variation, alternative exon cassettes, alternative 5'/3' UTRs, are compared and the distance between transcript models is calculated with nucleotide level precision. All metrics are implemented in a PyPi package, TranD and output can be used to summarize splicing patterns for a transcriptome (1GTF) and between transcriptomes (2GTF). TranD output enables quantitative comparisons between: annotations augmented by empirical RNA-seq data and the original transcript models; transcript model prediction tools for longread RNA-seq (e.g. FLAIR versus Isoseq3); alternate annotations for a species (e.g. RefSeq vs Ensembl); and between closely related species. In C. elegans, Z. mays, D. melanogaster, D. simulans and H. sapiens, alternative exons were observed more frequently in combination with an alternative donor/acceptor than alone. Transcript models in RefSeq and Ensembl are linked and both have unique transcript models with empirical support. D. melanogaster and D. simulans, share many transcript models and long-read RNAseq data suggests that both species are under-annotated. We recommend combined references.
Asunto(s)
Empalme Alternativo , Transcriptoma , Animales , Caenorhabditis elegans/genética , Drosophila melanogaster/genética , Perfilación de la Expresión Génica , Nucleótidos , Empalme del ARN , Análisis de Secuencia de ARN , Especificidad de la Especie , Transcriptoma/genética , Programas InformáticosRESUMEN
In Drosophila melanogaster and D. simulans head tissue, 60% of orthologous genes show evidence of sex-biased expression in at least one species. Of these, â¼39% (2,192) are conserved in direction. We hypothesize enrichment of open chromatin in the sex where we see expression bias and closed chromatin in the opposite sex. Male-biased orthologs are significantly enriched for H3K4me3 marks in males of both species (â¼89% of male-biased orthologs vs. â¼76% of unbiased orthologs). Similarly, female-biased orthologs are significantly enriched for H3K4me3 marks in females of both species (â¼90% of female-biased orthologs vs. â¼73% of unbiased orthologs). The sex-bias ratio in female-biased orthologs was similar in magnitude between the two species, regardless of the closed chromatin (H3K27me2me3) marks in males. However, in male-biased orthologs, the presence of H3K27me2me3 in both species significantly reduced the correlation between D. melanogaster sex-bias ratio and the D. simulans sex-bias ratio. Male-biased orthologs are enriched for evidence of positive selection in the D. melanogaster group. There are more male-biased genes than female-biased genes in both species. For orthologs with gains/losses of sex-bias between the two species, there is an excess of male-bias compared to female-bias, but there is no consistent pattern in the relationship between H3K4me3 or H3K27me2me3 chromatin marks and expression. These data suggest chromatin state is a component of the maintenance of sex-biased expression and divergence of sex-bias between species is reflected in the complexity of the chromatin status.
Asunto(s)
Cromatina , Drosophila melanogaster , Animales , Femenino , Masculino , Drosophila melanogaster/genética , Cromatina/genética , Drosophila simulans/genética , Evolución Molecular , Drosophila/genéticaRESUMEN
We propose a new model for the association of chromatin state and sex-bias in expression. We hypothesize enrichment of open chromatin in the sex where we see expression bias (OS) and closed chromatin in the opposite sex (CO). In this study of D. melanogaster and D. simulans head tissue, sex-bias in expression is associated with H3K4me3 (open mark) in males for male-biased genes and in females for female-biased genes in both species. Sex-bias in expression is also largely conserved in direction and magnitude between the two species on the X and autosomes. In male-biased orthologs, the sex-bias ratio is more divergent between species if both species have H3K27me2me3 marks in females compared to when either or neither species has H3K27me2me3 in females. H3K27me2me3 marks in females are associated with male-bias in expression on the autosomes in both species, but on the X only in D. melanogaster . In female-biased orthologs the relationship between the species for the sex-bias ratio is similar regardless of the H3K27me2me3 marks in males. Female-biased orthologs are more similar in the ratio of sex-bias than male-biased orthologs and there is an excess of male-bias in expression in orthologs that gain/lose sex-bias. There is an excess of male-bias in sex-limited expression in both species suggesting excess male-bias is due to rapid evolution between the species. The X chromosome has an enrichment in male-limited H3K4me3 in both species and an enrichment of sex-bias in expression compared to the autosomes.
RESUMEN
Gene regulatory networks control the complex programs that drive development. Deciphering the connections between transcription factors (TFs) and target genes is challenging, in part because TFs bind to thousands of places in the genome but control expression through a subset of these binding events. We hypothesize that we can combine natural variation of expression levels and predictions of TF binding sites to identify TF targets. We gather RNA-seq data from 71 genetically distinct F1 Drosophila melanogaster embryos and calculate the correlations between TF and potential target genes' expression levels, which we call "regulatory strength." To separate direct and indirect TF targets, we hypothesize that direct TF targets will have a preponderance of binding sites in their upstream regions. Using 14 TFs active during embryogenesis, we find that 12 TFs showed a significant correlation between their binding strength and regulatory strength on downstream targets, and 10 TFs showed a significant correlation between the number of binding sites and the regulatory effect on target genes. The general roles, e.g. bicoid's role as an activator, and the particular interactions we observed between our TFs, e.g. twist's role as a repressor of sloppy paired and odd paired, generally coincide with the literature.