Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Genome Biol Evol ; 16(8)2024 Aug 05.
Artigo em Inglês | MEDLINE | ID: mdl-39212967

RESUMO

The discovery of de novo emerged genes, originating from previously noncoding DNA regions, challenges traditional views of species evolution. Indeed, the hypothesis of neutrally evolving sequences giving rise to functional proteins is highly unlikely. This conundrum has sparked numerous studies to quantify and characterize these genes, aiming to understand their functional roles and contributions to genome evolution. Yet, no fully automated pipeline for their identification is available. Therefore, we introduce DENSE (DE Novo emerged gene SEarch), an automated Nextflow pipeline based on two distinct steps: detection of taxonomically restricted genes (TRGs) through phylostratigraphy, and filtering of TRGs for de novo emerged genes via genome comparisons and synteny search. DENSE is available as a user-friendly command-line tool, while the second step is accessible through a web server upon providing a list of TRGs. Highly flexible, DENSE provides various strategy and parameter combinations, enabling users to adapt to specific configurations or define their own strategy through a rational framework, facilitating protocol communication, and study interoperability. We apply DENSE to seven model organisms, exploring the impact of its strategies and parameters on de novo gene predictions. This thorough analysis across species with different evolutionary rates reveals useful metrics for users to define input datasets, identify favorable/unfavorable conditions for de novo gene detection, and control potential biases in genome annotations. Additionally, predictions made for the seven model organisms are compiled into a requestable database, which we hope will serve as a reference for de novo emerged gene lists generated with specific criteria combinations.


Assuntos
Eucariotos , Evolução Molecular , Eucariotos/genética , Software , Animais , Filogenia , Genoma
2.
Nat Commun ; 15(1): 6187, 2024 Jul 23.
Artigo em Inglês | MEDLINE | ID: mdl-39043684

RESUMO

Protein coding features can emerge de novo in non coding transcripts, resulting in emergence of new protein coding genes. Studies across many species show that a large fraction of evolutionarily novel non-coding RNAs have an antisense overlap with protein coding genes. The open reading frames (ORFs) in these antisense RNAs could also overlap with existing ORFs. In this study, we investigate how the evolution an ORF could be constrained by its overlap with an existing ORF in three different reading frames. Using a combination of mathematical modeling and genome/transcriptome data analysis in two different model organisms, we show that antisense overlap can increase the likelihood of ORF emergence and reduce the likelihood of ORF loss, especially in one of the three reading frames. In addition to rationalising the repeatedly reported prevalence of de novo emerged genes in antisense transcripts, our work also provides a generic modeling and an analytical framework that can be used to understand evolution of antisense genes.


Assuntos
Evolução Molecular , Fases de Leitura Aberta , RNA Antissenso , RNA Antissenso/genética , RNA Antissenso/metabolismo , Fases de Leitura Aberta/genética , Animais , Modelos Genéticos , Transcriptoma
3.
Genome Biol Evol ; 16(7)2024 Jul 03.
Artigo em Inglês | MEDLINE | ID: mdl-38934893

RESUMO

De novo genes emerge from noncoding regions of genomes via succession of mutations. Among others, such mutations activate transcription and create a new open reading frame (ORF). Although the mechanisms underlying ORF emergence are well documented, relatively little is known about the mechanisms enabling new transcription events. Yet, in many species a continuum between absent and very prominent transcription has been reported for essentially all regions of the genome. In this study, we searched for de novo transcripts by using newly assembled genomes and transcriptomes of seven inbred lines of Drosophila melanogaster, originating from six European and one African population. This setup allowed us to detect sample specific de novo transcripts, and compare them to their homologous nontranscribed regions in other samples, as well as genic and intergenic control sequences. We studied the association with transposable elements (TEs) and the enrichment of transcription factor motifs upstream of de novo emerged transcripts and compared them with regulatory elements. We found that de novo transcripts overlap with TEs more often than expected by chance. The emergence of new transcripts correlates with regions of high guanine-cytosine content and TE expression. Moreover, upstream regions of de novo transcripts are highly enriched with regulatory motifs. Such motifs are more enriched in new transcripts overlapping with TEs, particularly DNA TEs, and are more conserved upstream de novo transcripts than upstream their 'nontranscribed homologs'. Overall, our study demonstrates that TE insertion is important for transcript emergence, partly by introducing new regulatory motifs from DNA TE families.


Assuntos
Elementos de DNA Transponíveis , Drosophila melanogaster , Fatores de Transcrição , Animais , Drosophila melanogaster/genética , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Transcriptoma , Transcrição Gênica , Motivos de Nucleotídeos , Sítios de Ligação , Fases de Leitura Aberta , Genoma de Inseto , Evolução Molecular
4.
Nucleic Acids Res ; 52(1): 274-287, 2024 Jan 11.
Artigo em Inglês | MEDLINE | ID: mdl-38000384

RESUMO

Most of the transcribed eukaryotic genomes are composed of non-coding transcripts. Among these transcripts, some are newly transcribed when compared to outgroups and are referred to as de novo transcripts. De novo transcripts have been shown to play a major role in genomic innovations. However, little is known about the rates at which de novo transcripts are gained and lost in individuals of the same species. Here, we address this gap and estimate the de novo transcript turnover rate with an evolutionary model. We use DNA long reads and RNA short reads from seven geographically remote samples of inbred individuals of Drosophila melanogaster to detect de novo transcripts that are gained on a short evolutionary time scale. Overall, each sampled individual contains around 2500 unspliced de novo transcripts, with most of them being sample specific. We estimate that around 0.15 transcripts are gained per year, and that each gained transcript is lost at a rate around 5× 10-5 per year. This high turnover of transcripts suggests frequent exploration of new genomic sequences within species. These rate estimates are essential to comprehend the process and timescale of de novo gene birth.


Assuntos
Drosophila melanogaster , Evolução Molecular , RNA não Traduzido , Transcrição Gênica , Animais , Humanos , Evolução Biológica , Drosophila melanogaster/genética , Genoma , Genômica , RNA , RNA não Traduzido/química , RNA não Traduzido/genética , RNA não Traduzido/metabolismo , Geografia
5.
Genome Res ; 33(6): 872-890, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37442576

RESUMO

Novel genes are essential for evolutionary innovations and differ substantially even between closely related species. Recently, multiple studies across many taxa showed that some novel genes arise de novo, that is, from previously noncoding DNA. To characterize the underlying mutations that allowed de novo gene emergence and their order of occurrence, homologous regions must be detected within noncoding sequences in closely related sister genomes. So far, most studies do not detect noncoding homologs of de novo genes because of incomplete assemblies and annotations, and long evolutionary distances separating genomes. Here, we overcome these issues by searching for de novo expressed open reading frames (neORFs), the not-yet fixed precursors of de novo genes that emerged within a single species. We sequenced and assembled genomes with long-read technology and the corresponding transcriptomes from inbred lines of Drosophila melanogaster, derived from seven geographically diverse populations. We found line-specific neORFs in abundance but few neORFs shared by lines, suggesting a rapid turnover. Gain and loss of transcription is more frequent than the creation of ORFs, for example, by forming new start and stop codons. Consequently, the gain of ORFs becomes rate limiting and is frequently the initial step in neORFs emergence. Furthermore, transposable elements (TEs) are major drivers for intragenomic duplications of neORFs, yet TE insertions are less important for the emergence of neORFs. However, highly mutable genomic regions around TEs provide new features that enable gene birth. In conclusion, neORFs have a high birth-death rate, are rapidly purged, but surviving neORFs spread neutrally through populations and within genomes.


Assuntos
Drosophila melanogaster , Metagenômica , Animais , Drosophila melanogaster/genética , Fases de Leitura Aberta , Elementos de DNA Transponíveis/genética , Evolução Biológica , Evolução Molecular
6.
Genes (Basel) ; 13(2)2022 01 31.
Artigo em Inglês | MEDLINE | ID: mdl-35205330

RESUMO

De novo genes are novel genes which emerge from non-coding DNA. Until now, little is known about de novo genes' properties, correlated to their age and mechanisms of emergence. In this study, we investigate four related properties: introns, upstream regulatory motifs, 5' Untranslated regions (UTRs) and protein domains, in 23,135 human proto-genes. We found that proto-genes contain introns, whose number and position correlates with the genomic position of proto-gene emergence. The origin of these introns is debated, as our results suggest that 41% of proto-genes might have captured existing introns, and 13.7% of them do not splice the ORF. We show that proto-genes which emerged via overprinting tend to be more enriched in core promotor motifs, while intergenic and intronic genes are more enriched in enhancers, even if the TATA motif is most commonly found upstream in these genes. Intergenic and intronic 5' UTRs of proto-genes have a lower potential to stabilise mRNA structures than exonic proto-genes and established human genes. Finally, we confirm that proteins expressed by proto-genes gain new putative domains with age. Overall, we find that regulatory motifs inducing transcription and translation of previously non-coding sequences may facilitate proto-gene emergence. Our study demonstrates that introns, 5' UTRs, and domains have specific properties in proto-genes. We also emphasize that the genomic positions of de novo genes strongly impacts these properties.


Assuntos
Genômica , Regiões 5' não Traduzidas , Éxons/genética , Humanos , Íntrons/genética , Regiões Promotoras Genéticas
7.
PLoS Genet ; 17(9): e1009787, 2021 09.
Artigo em Inglês | MEDLINE | ID: mdl-34478447

RESUMO

Comparative genomics has enabled the identification of genes that potentially evolved de novo from non-coding sequences. Many such genes are expressed in male reproductive tissues, but their functions remain poorly understood. To address this, we conducted a functional genetic screen of over 40 putative de novo genes with testis-enriched expression in Drosophila melanogaster and identified one gene, atlas, required for male fertility. Detailed genetic and cytological analyses showed that atlas is required for proper chromatin condensation during the final stages of spermatogenesis. Atlas protein is expressed in spermatid nuclei and facilitates the transition from histone- to protamine-based chromatin packaging. Complementary evolutionary analyses revealed the complex evolutionary history of atlas. The protein-coding portion of the gene likely arose at the base of the Drosophila genus on the X chromosome but was unlikely to be essential, as it was then lost in several independent lineages. Within the last ~15 million years, however, the gene moved to an autosome, where it fused with a conserved non-coding RNA and evolved a non-redundant role in male fertility. Altogether, this study provides insight into the integration of novel genes into biological processes, the links between genomic innovation and functional evolution, and the genetic control of a fundamental developmental process, gametogenesis.


Assuntos
Cromatina/metabolismo , Drosophila melanogaster/genética , Evolução Molecular , Espermátides/metabolismo , Animais , Núcleo Celular/metabolismo , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Fertilidade/genética , Masculino , Interferência de RNA , Espermatogênese/genética
8.
Int J Genomics ; 2021: 9028667, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34368340

RESUMO

Gene dosage is an important issue both in cell and evolutionary biology. Most genes are present in two copies or alleles in diploid eukariotic cells. The most outstanding exception is monoallelic gene expression (MA) that concerns genes localized on the X chromosome or in regions undergoing parental imprinting in eutherians, and many other genes scattered throughout the genome. In diploids, haploinsufficiency (HI) implies that a single functional copy of a gene in a diploid organism is insufficient to ensure a normal biological function. One of the most important mechanisms ensuring functional innovation during evolution is whole genome duplication (WGD). In addition to the two WGDs that have occurred in vertebrate genomes, the teleost genomes underwent an additional WGD, after their divergence from tetrapods. In the present work, we have studied on 57 teleost species whether the orthologs of human MA or HI genes remain more frequently in duplicates or returned more frequently in singleton than the rest of the genome. Our results show that the teleost orthologs of HI human genes remained more frequently in duplicate than the rest of the genome in all of the teleost species studied. No signal was observed for the orthologs of genes mapping to the human X chromosome or subjected to parental imprinting. Surprisingly, the teleost orthologs of the other human MA genes remained in duplicate more frequently than the rest of the genome for most teleost species. These results suggest that the teleost orthologs of MA and HI human genes also undergo selective pressures either related to absolute protein amounts and/or of dosage balance issues. However, these constraints seem to be different for MA genes in teleost in comparison with human genomes.

9.
PLoS One ; 15(5): e0231813, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32442167

RESUMO

The interactions between membrane receptors and their endogenous ligands are key interactions in organisms. Recently, we have shown that a high number of genes encoding human receptors appeared at the same moment as their ligand in the animal tree of life. However, a set of receptors appeared before their present ligand. Different scenarios have been proposed to explain how a receptor can be conserved if its ligand is not yet appeared. However, these scenarios have been proposed individually and have never been studied in a global way. In this study, we investigated 30 mammalian pairs of ligand/receptor for which the first ligand appeared after its receptor in the tree of life, by using common indexes of selection, and proposed different scenarios explaining the earlier appearance of a receptor relative to its ligand. Based on 3D structural studies, our indexes allowed us to classify the evolution of these partners into different scenarios: 1) a scenario where the binding interface of the receptor is already present and under purifying selection before the appearance of the ligand; 2) a scenario where the binding interface seems to have appeared progressively, and 3) a scenario where the binding site seems to have been reshuffled since its appearance. As some scenarios were confirmed by the literature, we concluded that simple indexes can give a good highlight of the evolutive history of two partners that did not appear at the same time. Based on these scenarios, we also hypothesize that the replacement of a ligand by another is a frequent phenomenon during evolution.


Assuntos
Sítios de Ligação/genética , Evolução Molecular , Ligantes , Ligação Proteica/genética , Receptores de Superfície Celular , Sequência de Aminoácidos/genética , Animais , Simulação por Computador , Humanos , Modelos Moleculares , Conformação Molecular , Receptores de Superfície Celular/genética , Receptores de Superfície Celular/metabolismo
10.
BMC Evol Biol ; 19(1): 215, 2019 11 26.
Artigo em Inglês | MEDLINE | ID: mdl-31771521

RESUMO

BACKGROUND: In mammals, the natriuretic system contains three natriuretic peptides, NPPA, NPPB and NPPC, that bind to three transmembrane receptors, NPR1, NPR2 and NPR3. The natriuretic peptides are known only in vertebrates. In contrast, the receptors have orthologs in all the animal taxa and in plants. However, in non-vertebrates, these receptors do not have natriuretic properties, and most of their ligands are unknown. How was the interaction of the NP receptors and the NP established in vertebrates? Do natriuretic peptides have orthologs in non-vertebrates? If so, what was the function of the interaction? How did that function change? If not, are the NP homologous to ancestral NPR ligands? Or did the receptor's binding pocket completely change during evolution? METHODS: In the present study, we tried to determine if the pairs of natriuretic receptors and their ligands come from an ancestral pair, or if the interaction only appeared in vertebrates. Alignments, modeling, docking, research of positive selection, and motif research were performed in order to answer this question. RESULTS: We discovered that the binding pocket of the natriuretic peptide receptors was completely remodeled in mammals. We found several peptides in non vertebrates that could be related to human natriuretic peptides, but a set of clues, as well as modeling and docking analysis, suggest that the natriuretic peptides undoubtedly appeared later than their receptors during animal evolution. We suggest here that natriuretic peptide receptors in non vertebrates bind to other ligands. CONCLUSIONS: The present study further support that vertebrate natriuretic peptides appeared after their receptors in the tree of life. We suggest the existence of peptides that resemble natriuretic peptides in non-vertebrate species, that might be the result of convergent evolution.


Assuntos
Peptídeos Natriuréticos/genética , Vertebrados/genética , Sequência de Aminoácidos , Animais , Humanos , Ligantes , Modelos Moleculares , Peptídeos Natriuréticos/química , Peptídeos Natriuréticos/metabolismo , Filogenia , Ligação Proteica , Receptores de Peptídeos/genética , Seleção Genética , Vertebrados/metabolismo
11.
Genome Biol Evol ; 11(5): 1451-1462, 2019 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-31087101

RESUMO

Signaling through ligand/receptor interactions is a widespread mechanism across all living taxa. During evolution, however, there has been a diversification in multigene families and changes in their interaction patterns. Among the events that led to the creation of new genes is the whole-genome duplication, which made possible some major innovations. Teleost fishes descended from a common ancestor which underwent one such whole-genome duplication. In our study, we investigated the effect of complete genome duplication on the evolution of ligand-receptor pairs in teleosts. We selected ten teleost species and used bioinformatics programs and phylogenetic tools in order to study the evolution of the human ligands and receptors that have orthologous genes in fishes, as well as the rest of the fish genomes. We established that since the complete duplication of the fish genomes, the conservation in duplicate copy of ligand and receptor genes is higher than expected. However, the ligand/receptor pair partners did not necessarily evolve in the same way, and a lot of situations occurred in which one of the partners returned in singleton copy when the other one was maintained in duplicate. This suggests that changes in interaction partners may have taken place during the evolution of teleosts. Moreover, the fate of the ligands and receptor coding genes is partly congruent with the phylogeny of teleosts. However, some incongruences can be observed. We suggest that these incongruences are correlated to the environment.


Assuntos
Evolução Molecular , Peixes/genética , Receptores de Superfície Celular/genética , Animais , Duplicação Gênica , Genoma , Humanos , Ligantes
12.
BMC Genomics ; 19(1): 611, 2018 Aug 14.
Artigo em Inglês | MEDLINE | ID: mdl-30107779

RESUMO

BACKGROUND: Interactions between proteins are key components in the chemical and physical processes of living organisms. Among these interactions, membrane receptors and their ligands are particularly important because they are at the interface between extracellular and intracellular environments. Many studies have investigated how binding partners have co-evolved in genomes during the evolution. However, little is known about the establishment of the interaction on a phylogenetic scale. In this study, we systematically studied the time of birth of genes encoding human membrane receptors and their ligands in the animal tree of life. We examined a total of 553 pairs of ligands/receptors, representing non-redundant interactions. RESULTS: We found that 41% of the receptors and their respective first ligands appeared in the same branch, representing 2.5-fold more than expected by chance, thus suggesting an evolutionary dynamic of interdependence and conservation between these partners. In contrast, 21% of the receptors appeared after their ligand, i.e. three-fold less often than expected by chance. Most surprisingly, 38% of the receptors appeared before their first ligand, as much as expected by chance. CONCLUSIONS: According to these results, we propose that a selective pressure is exerted on ligands and receptors once they appear, that would remove molecules whose partner does not appear quickly.


Assuntos
Evolução Molecular , Receptores Citoplasmáticos e Nucleares/genética , Animais , Humanos , Ligantes , Modelos Biológicos , Filogenia , Ligação Proteica , Receptores Citoplasmáticos e Nucleares/química
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA