Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 47
Filtrar
1.
Commun Biol ; 6(1): 1241, 2023 12 08.
Artigo em Inglês | MEDLINE | ID: mdl-38066075

RESUMO

Soil invertebrates are among the least understood metazoans on Earth. Thus far, the lack of taxonomically broad and dense genomic resources has made it hard to thoroughly investigate their evolution and ecology. With MetaInvert we provide draft genome assemblies for 232 soil invertebrate species, representing 14 common groups and 94 families. We show that this data substantially extends the taxonomic scope of DNA- or RNA-based taxonomic identification. Moreover, we confirm that theories of genome evolution cannot be generalised across evolutionarily distinct invertebrate groups. The soil invertebrate genomes presented here will support the management of soil biodiversity through molecular monitoring of community composition and function, and the discovery of evolutionary adaptations to the challenges of soil conditions.


Assuntos
Invertebrados , Solo , Humanos , Animais , Invertebrados/genética , Biodiversidade , Ecologia , Genômica
2.
Genes (Basel) ; 14(8)2023 08 15.
Artigo em Inglês | MEDLINE | ID: mdl-37628678

RESUMO

Repetitive elements are a major component of DNA sequences due to their ability to propagate through the genome. Characterization of Metazoan repetitive profiles is improving; however, current pipelines fail to identify a significant proportion of divergent repeats in non-model organisms. The Decapoda order, for which repeat content analyses are largely lacking, is characterized by extremely variable genome sizes that suggest an important presence of repetitive elements. Here, we developed a new standardized pipeline to annotate repetitive elements in non-model organisms, which we applied to twenty Decapoda and six other Crustacea genomes. Using this new tool, we identified 10% more repetitive elements than standard pipelines. Repetitive elements were more abundant in Decapoda species than in other Crustacea, with a very large number of highly repeated satellite DNA families. Moreover, we demonstrated a high correlation between assembly size and transposable elements and different repeat dynamics between Dendrobranchiata and Reptantia. The patterns of repetitive elements largely reflect the phylogenetic relationships of Decapoda and the distinct evolutionary trajectories within Crustacea. In summary, our results highlight the impact of repetitive elements on genome evolution in Decapoda and the value of our novel annotation pipeline, which will provide a baseline for future comparative analyses.


Assuntos
Elementos de DNA Transponíveis , Decápodes , Animais , Filogenia , Elementos de DNA Transponíveis/genética , DNA Satélite
3.
Genome Biol ; 24(1): 135, 2023 06 08.
Artigo em Inglês | MEDLINE | ID: mdl-37291671

RESUMO

BACKGROUND: In every living species, the function of a protein depends on its organization of structural domains, and the length of a protein is a direct reflection of this. Because every species evolved under different evolutionary pressures, the protein length distribution, much like other genomic features, is expected to vary across species but has so far been scarcely studied. RESULTS: Here we evaluate this diversity by comparing protein length distribution across 2326 species (1688 bacteria, 153 archaea, and 485 eukaryotes). We find that proteins tend to be on average slightly longer in eukaryotes than in bacteria or archaea, but that the variation of length distribution across species is low, especially compared to the variation of other genomic features (genome size, number of proteins, gene length, GC content, isoelectric points of proteins). Moreover, most cases of atypical protein length distribution appear to be due to artifactual gene annotation, suggesting the actual variation of protein length distribution across species is even smaller. CONCLUSIONS: These results open the way for developing a genome annotation quality metric based on protein length distribution to complement conventional quality measures. Overall, our findings show that protein length distribution between living species is more uniform than previously thought. Furthermore, we also provide evidence for a universal selection on protein length, yet its mechanism and fitness effect remain intriguing open questions.


Assuntos
Anotação de Sequência Molecular , Proteínas , Análise de Sequência de Proteína , Sequência de Aminoácidos , Anotação de Sequência Molecular/métodos , Proteínas/química , Proteínas/classificação , Proteoma , Análise de Sequência de Proteína/métodos , Eucariotos , Bactérias , Archaea
4.
Front Bioinform ; 3: 1178926, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37151482

RESUMO

Protein annotation errors can have significant consequences in a wide range of fields, ranging from protein structure and function prediction to biomedical research, drug discovery, and biotechnology. By comparing the domains of different proteins, scientists can identify common domains, classify proteins based on their domain architecture, and highlight proteins that have evolved differently in one or more species or clades. However, genome-wide identification of different protein domain architectures involves a complex error-prone pipeline that includes genome sequencing, prediction of gene exon/intron structures, and inference of protein sequences and domain annotations. Here we developed an automated fact-checking approach to distinguish true domain loss/gain events from false events caused by errors that occur during the annotation process. Using genome-wide ortholog sets and taking advantage of the high-quality human and Saccharomyces cerevisiae genome annotations, we analyzed the domain gain and loss events in the predicted proteomes of 9 non-human primates (NHP) and 20 non-S. cerevisiae fungi (NSF) as annotated in the Uniprot and Interpro databases. Our approach allowed us to quantify the impact of errors on estimates of protein domain gains and losses, and we show that domain losses are over-estimated ten-fold and three-fold in the NHP and NSF proteins respectively. This is in line with previous studies of gene-level losses, where issues with genome sequencing or gene annotation led to genes being falsely inferred as absent. In addition, we show that insistent protein domain annotations are a major factor contributing to the false events. For the first time, to our knowledge, we show that domain gains are also over-estimated by three-fold and two-fold respectively in NHP and NSF proteins. Based on our more accurate estimates, we infer that true domain losses and gains in NHP with respect to humans are observed at similar rates, while domain gains in the more divergent NSF are observed twice as frequently as domain losses with respect to S. cerevisiae. This study highlights the need to critically examine the scientific validity of protein annotations, and represents a significant step toward scalable computational fact-checking methods that may 1 day mitigate the propagation of wrong information in protein databases.

5.
BMC Res Notes ; 15(1): 281, 2022 Aug 22.
Artigo em Inglês | MEDLINE | ID: mdl-35989321

RESUMO

OBJECTIVES: Crayfish plague disease, caused by the oomycete pathogen Aphanomyces astaci represents one of the greatest risks for the biodiversity of the freshwater crayfish. This data article covers the de novo transcriptome assembly and annotation data of the noble crayfish and the marbled crayfish challenged with Ap. astaci. Following the controlled infection experiment (Francesconi et al. in Front Ecol Evol, 2021, https://doi.org/10.3389/fevo.2021.647037 ), we conducted a differential gene expression analysis described in (Bostjancic et al. in BMC Genom, 2022, https://doi.org/10.1186/s12864-022-08571-z ) DATA DESCRIPTION: In total, 25 noble crayfish and 30 marbled crayfish were selected. Hepatopancreas tissue was isolated, followed by RNA sequencing using the Illumina NovaSeq 6000 platform. Raw data was checked for quality with FastQC, adapter and quality trimming were conducted using Trimmomatic followed by de novo assembly with Trinity. Assembly quality was assessed with BUSCO, at 93.30% and 93.98% completeness for the noble crayfish and the marbled crayfish, respectively. Transcripts were annotated using the Dammit! pipeline and assigned to KEGG pathways. Respective transcriptome and raw datasets may be reused as the reference transcriptome assemblies for future expression studies.


Assuntos
Aphanomyces , Astacoidea , Animais , Aphanomyces/genética , Astacoidea/genética , Hepatopâncreas , Análise de Sequência de RNA , Transcriptoma/genética
6.
BMC Genomics ; 23(1): 600, 2022 Aug 22.
Artigo em Inglês | MEDLINE | ID: mdl-35989333

RESUMO

BACKGROUND: For over a century, scientists have studied host-pathogen interactions between the crayfish plague disease agent Aphanomyces astaci and freshwater crayfish. It has been hypothesised that North American crayfish hosts are disease-resistant due to the long-lasting coevolution with the pathogen. Similarly, the increasing number of latent infections reported in the historically sensitive European crayfish hosts seems to indicate that similar coevolutionary processes are occurring between European crayfish and A. astaci. Our current understanding of these host-pathogen interactions is largely focused on the innate immunity processes in the crayfish haemolymph and cuticle, but the molecular basis of the observed disease-resistance and susceptibility remain unclear. To understand how coevolution is shaping the host's molecular response to the pathogen, susceptible native European noble crayfish and invasive disease-resistant marbled crayfish were challenged with two A. astaci strains of different origin: a haplogroup A strain (introduced to Europe at least 50 years ago, low virulence) and a haplogroup B strain (signal crayfish in lake Tahoe, USA, high virulence). Here, we compare the gene expression profiles of the hepatopancreas, an integrated organ of crayfish immunity and metabolism. RESULTS: We characterised several novel innate immune-related gene groups in both crayfish species. Across all challenge groups, we detected 412 differentially expressed genes (DEGs) in the noble crayfish, and 257 DEGs in the marbled crayfish. In the noble crayfish, a clear immune response was detected to the haplogroup B strain, but not to the haplogroup A strain. In contrast, in the marbled crayfish we detected an immune response to the haplogroup A strain, but not to the haplogroup B strain. CONCLUSIONS: We highlight the hepatopancreas as an important hub for the synthesis of immune molecules in the response to A. astaci. A clear distinction between the innate immune response in the marbled crayfish and the noble crayfish is the capability of the marbled crayfish to mobilise a higher variety of innate immune response effectors. With this study we outline that the type and strength of the host immune response to the pathogen is strongly influenced by the coevolutionary history of the crayfish with specific A. astaci strains.


Assuntos
Aphanomyces , Animais , Aphanomyces/genética , Astacoidea/genética , Resistência à Doença , Lagos , Transcriptoma
7.
Nucleic Acids Res ; 50(W1): W623-W632, 2022 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-35552456

RESUMO

The Orthology Benchmark Service (https://orthology.benchmarkservice.org) is the gold standard for orthology inference evaluation, supported and maintained by the Quest for Orthologs consortium. It is an essential resource to compare existing and new methods of orthology inference (the bedrock for many comparative genomics and phylogenetic analysis) over a standard dataset and through common procedures. The Quest for Orthologs Consortium is dedicated to maintaining the resource up to date, through regular updates of the Reference Proteomes and increasingly accessible data through the OpenEBench platform. For this update, we have added a new benchmark based on curated orthology assertion from the Vertebrate Gene Nomenclature Committee, and provided an example meta-analysis of the public predictions present on the platform.


Assuntos
Benchmarking , Genômica , Filogenia , Genômica/métodos , Proteoma
8.
Genes (Basel) ; 12(9)2021 09 21.
Artigo em Inglês | MEDLINE | ID: mdl-34573434

RESUMO

Multiciliogenesis is a complex process that allows the generation of hundreds of motile cilia on the surface of specialized cells, to create fluid flow across epithelial surfaces. Dysfunction of human multiciliated cells is associated with diseases of the brain, airway and reproductive tracts. Despite recent efforts to characterize the transcriptional events responsible for the differentiation of multiciliated cells, a lot of actors remain to be identified. In this work, we capitalize on the ever-growing quantity of high-throughput data to search for new candidate genes involved in multiciliation. After performing a large-scale screening using 10 transcriptomics datasets dedicated to multiciliation, we established a specific evolutionary signature involving Otomorpha fish to use as a criterion to select the most likely targets. Combining both approaches highlighted a list of 114 potential multiciliated candidates. We characterized these genes first by generating protein interaction networks, which showed various clusters of ciliated and multiciliated genes, and then by computing phylogenetic profiles. In the end, we selected 11 poorly characterized genes that seem like particularly promising multiciliated candidates. By combining functional and comparative genomics methods, we developed a novel type of approach to study biological processes and identify new promising candidates linked to that process.


Assuntos
Cílios/fisiologia , Proteínas de Peixes/genética , Peixes , Genômica/métodos , Animais , Evolução Biológica , Diferenciação Celular/genética , Cílios/genética , Bases de Dados Genéticas , Proteínas de Peixes/metabolismo , Expressão Gênica , Humanos , Filogenia , Transcriptoma
9.
Mol Biol Evol ; 38(8): 3033-3045, 2021 07 29.
Artigo em Inglês | MEDLINE | ID: mdl-33822172

RESUMO

Accurate determination of the evolutionary relationships between genes is a foundational challenge in biology. Homology-evolutionary relatedness-is in many cases readily determined based on sequence similarity analysis. By contrast, whether or not two genes directly descended from a common ancestor by a speciation event (orthologs) or duplication event (paralogs) is more challenging, yet provides critical information on the history of a gene. Since 2009, this task has been the focus of the Quest for Orthologs (QFO) Consortium. The sixth QFO meeting took place in Okazaki, Japan in conjunction with the 67th National Institute for Basic Biology conference. Here, we report recent advances, applications, and oncoming challenges that were discussed during the conference. Steady progress has been made toward standardization and scalability of new and existing tools. A feature of the conference was the presentation of a panel of accessible tools for phylogenetic profiling and several developments to bring orthology beyond the gene unit-from domains to networks. This meeting brought into light several challenges to come: leveraging orthology computations to get the most of the incoming avalanche of genomic data, integrating orthology from domain to biological network levels, building better gene models, and adapting orthology approaches to the broad evolutionary and genomic diversity recognized in different forms of life and viruses.


Assuntos
Especiação Genética , Genômica/tendências , Filogenia , Genoma Viral , Genômica/métodos
10.
Genome Biol Evol ; 13(1)2021 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-33211099

RESUMO

In the multiomics era, comparative genomics studies based on gene repertoire comparison are increasingly used to investigate evolutionary histories of species, to study genotype-phenotype relations, species adaptation to various environments, or to predict gene function using phylogenetic profiling. However, comparisons of orthologs have highlighted the prevalence of sequence plasticity among species, showing the benefits of combining protein and subprotein levels of analysis to allow for a more comprehensive study of genotype/phenotype correlations. In this article, we introduce a new approach called BLUR (BLAST Unexpected Ranking), capable of detecting genotype divergence or specialization between two related clades at different levels: gain/loss of proteins but also of subprotein regions. These regions can correspond to known domains, uncharacterized regions, or even small motifs. Our method was created to allow two types of research strategies: 1) the comparison of two groups of species with no previous knowledge, with the aim of predicting phenotype differences or specializations between close species or 2) the study of specific phenotypes by comparing species that present the phenotype of interest with species that do not. We designed a website to facilitate the use of BLUR with a possibility of in-depth analysis of the results with various tools, such as functional enrichments, protein-protein interaction networks, and multiple sequence alignments. We applied our method to the study of two different biological pathways and to the comparison of several groups of close species, all with very promising results. BLUR is freely available at http://lbgi.fr/blur/.


Assuntos
Evolução Molecular , Genômica/métodos , Proteínas/genética , Proteoma/genética , Proteoma/metabolismo , Animais , Proteínas do Domínio Armadillo , Bactérias , Sequência Conservada/genética , Fungos , Genótipo , Humanos , Fenótipo , Filogenia , Alinhamento de Sequência , Análise de Sequência , Software
11.
Nucleic Acids Res ; 48(W1): W538-W545, 2020 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-32374845

RESUMO

The identification of orthologs-genes in different species which descended from the same gene in their last common ancestor-is a prerequisite for many analyses in comparative genomics and molecular evolution. Numerous algorithms and resources have been conceived to address this problem, but benchmarking and interpreting them is fraught with difficulties (need to compare them on a common input dataset, absence of ground truth, computational cost of calling orthologs). To address this, the Quest for Orthologs consortium maintains a reference set of proteomes and provides a web server for continuous orthology benchmarking (http://orthology.benchmarkservice.org). Furthermore, consensus ortholog calls derived from public benchmark submissions are provided on the Alliance of Genome Resources website, the joint portal of NIH-funded model organism databases.


Assuntos
Família Multigênica , Proteoma , Software , Animais , Benchmarking , Consenso , Genômica , Humanos , Camundongos , Filogenia , Ratos
12.
Nucleic Acids Res ; 47(D1): D411-D418, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30380106

RESUMO

OrthoInspector is one of the leading software suites for orthology relations inference. In this paper, we describe a major redesign of the OrthoInspector online resource along with a significant increase in the number of species: 4753 organisms are now covered across the three domains of life, making OrthoInspector the most exhaustive orthology resource to date in terms of covered species (excluding viruses). The new website integrates original data exploration and visualization tools in an ergonomic interface. Distributions of protein orthologs are represented by heatmaps summarizing their evolutionary histories, and proteins with similar profiles can be directly accessed. Two novel tools have been implemented for comparative genomics: a phylogenetic profile search that can be used to find proteins with a specific presence-absence profile and investigate their functions and, inversely, a GO profiling tool aimed at deciphering evolutionary histories of molecular functions, processes or cell components. In addition to the re-designed website, the OrthoInspector resource now provides a REST interface for programmatic access. OrthoInspector 3.0 is available at http://lbgi.fr/orthoinspectorv3.


Assuntos
Bases de Dados Genéticas , Genômica , Algoritmos , Bactérias/genética , Classificação , Eucariotos/genética , Evolução Molecular , Previsões , Ontologia Genética , Internet , Filogenia , Proteoma , Homologia de Sequência do Ácido Nucleico , Software , Especificidade da Espécie
13.
Bioinformatics ; 34(19): 3390-3392, 2018 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-29741582

RESUMO

Summary: Comparative studies of protein sequences are widely used in evolutionary and comparative genomics studies, but there is a lack of efficient tools to identify conserved regions ab initio within a protein multiple alignment. PROBE provides a fully automatic analysis of protein family conservation, to identify conserved regions, or 'blocks', that may correspond to structural/functional domains or motifs. Conserved blocks are identified at two different levels: (i) family level blocks indicate sites that are probably of central importance to the protein's structure or function, and (ii) sub-family level blocks highlight regions that may signify functional specialization, such as binding partners, etc. All conserved blocks are mapped onto a phylogenetic tree and can also be visualized in the context of the multiple sequence alignment. PROBE thus facilitates in-depth studies of sequence-structure-function-evolution relationships, and opens the way to block-level phylogenetic profiling. Availability and implementation: Freely available on the web at http://www.lbgi.fr/∼julie/probe/web.


Assuntos
Evolução Molecular , Proteínas/genética , Software , Sequência de Aminoácidos , Biologia Computacional , Sequência Conservada , Filogenia , Alinhamento de Sequência
14.
J Med Internet Res ; 19(6): e212, 2017 06 16.
Artigo em Inglês | MEDLINE | ID: mdl-28623182

RESUMO

BACKGROUND: The constant and massive increase of biological data offers unprecedented opportunities to decipher the function and evolution of genes and their roles in human diseases. However, the multiplicity of sources and flow of data mean that efficient access to useful information and knowledge production has become a major challenge. This challenge can be addressed by taking inspiration from Web 2.0 and particularly social networks, which are at the forefront of big data exploration and human-data interaction. OBJECTIVE: MyGeneFriends is a Web platform inspired by social networks, devoted to genetic disease analysis, and organized around three types of proactive agents: genes, humans, and genetic diseases. The aim of this study was to improve exploration and exploitation of biological, postgenomic era big data. METHODS: MyGeneFriends leverages conventions popularized by top social networks (Facebook, LinkedIn, etc), such as networks of friends, profile pages, friendship recommendations, affinity scores, news feeds, content recommendation, and data visualization. RESULTS: MyGeneFriends provides simple and intuitive interactions with data through evaluation and visualization of connections (friendships) between genes, humans, and diseases. The platform suggests new friends and publications and allows agents to follow the activity of their friends. It dynamically personalizes information depending on the user's specific interests and provides an efficient way to share information with collaborators. Furthermore, the user's behavior itself generates new information that constitutes an added value integrated in the network, which can be used to discover new connections between biological agents. CONCLUSIONS: We have developed MyGeneFriends, a Web platform leveraging conventions from popular social networks to redefine the relationship between humans and biological big data and improve human processing of biomedical data. MyGeneFriends is available at lbgi.fr/mygenefriends.


Assuntos
Doenças Genéticas Inatas/genética , Testes Genéticos/métodos , Rede Social , Telemedicina/estatística & dados numéricos , Amigos , Humanos , Pesquisadores
15.
Mol Biol Evol ; 34(8): 2016-2034, 2017 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-28460059

RESUMO

Cilia (flagella) are important eukaryotic organelles, present in the Last Eukaryotic Common Ancestor, and are involved in cell motility and integration of extracellular signals. Ciliary dysfunction causes a class of genetic diseases, known as ciliopathies, however current knowledge of the underlying mechanisms is still limited and a better characterization of genes is needed. As cilia have been lost independently several times during evolution and they are subject to important functional variation between species, ciliary genes can be investigated through comparative genomics. We performed phylogenetic profiling by predicting orthologs of human protein-coding genes in 100 eukaryotic species. The analysis integrated three independent methods to predict a consensus set of 274 ciliary genes, including 87 new promising candidates. A fine-grained analysis of the phylogenetic profiles allowed a partitioning of ciliary genes into modules with distinct evolutionary histories and ciliary functions (assembly, movement, centriole, etc.) and thus propagation of potential annotations to previously undocumented genes. The cilia/basal body localization was experimentally confirmed for five of these previously unannotated proteins (LRRC23, LRRC34, TEX9, WDR27, and BIVM), validating the relevance of our approach. Furthermore, our multi-level analysis sheds light on the core gene sets retained in gamete-only flagellates or Ecdysozoa for instance. By combining gene-centric and species-oriented analyses, this work reveals new ciliary and ciliopathy gene candidates and provides clues about the evolution of ciliary processes in the eukaryotic domain. Additionally, the positive and negative reference gene sets and the phylogenetic profile of human genes constructed during this study can be exploited in future work.


Assuntos
Cílios/genética , Ciliopatias/genética , Animais , Movimento Celular/genética , Cílios/metabolismo , Ciliopatias/metabolismo , Bases de Dados de Ácidos Nucleicos , Eucariotos , Células Eucarióticas , Evolução Molecular , Flagelos/genética , Flagelos/metabolismo , Genômica , Humanos , Filogenia , Análise de Sequência de DNA/métodos
16.
Genome Biol Evol ; 9(2): 279-296, 2017 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-28082607

RESUMO

Temperature, perhaps more than any other environmental factor, is likely to influence the evolution of all organisms. It is also a very interesting factor to understand how genomes are shaped by selection over evolutionary timescales, as it potentially affects the whole genome. Among thermophilic prokaryotes, temperature affects both codon usage and protein composition to increase the stability of the transcriptional/translational machinery, and the resulting proteins need to be functional at high temperatures. Among eukaryotes less is known about genome evolution, and the tube-dwelling worms of the family Alvinellidae represent an excellent opportunity to test hypotheses about the emergence of thermophily in ectothermic metazoans. The Alvinellidae are a group of worms that experience varying thermal regimes, presumably having evolved into these niches over evolutionary times. Here we analyzed 423 putative orthologous loci derived from 6 alvinellid species including the thermophilic Alvinella pompejana and Paralvinella sulfincola. This comparative approach allowed us to assess amino acid composition, codon usage, divergence, direction of residue changes and the strength of selection along the alvinellid phylogeny, and to design a new eukaryotic thermophilic criterion based on significant differences in the residue composition of proteins. Contrary to expectations, the alvinellid ancestor of all present-day species seems to have been thermophilic, a trait subsequently maintained by purifying selection in lineages that still inhabit higher temperature environments. In contrast, lineages currently living in colder habitats likely evolved under selective relaxation, with some degree of positive selection for low-temperature adaptation at the protein level.


Assuntos
Aclimatação , Evolução Molecular , Poliquetos/genética , Proteoma/genética , Animais , Temperatura Baixa , Loci Gênicos , Fontes Hidrotermais , Filogenia , Seleção Genética
17.
Nat Methods ; 13(5): 425-30, 2016 05.
Artigo em Inglês | MEDLINE | ID: mdl-27043882

RESUMO

Achieving high accuracy in orthology inference is essential for many comparative, evolutionary and functional genomic analyses, yet the true evolutionary history of genes is generally unknown and orthologs are used for very different applications across phyla, requiring different precision-recall trade-offs. As a result, it is difficult to assess the performance of orthology inference methods. Here, we present a community effort to establish standards and an automated web-based service to facilitate orthology benchmarking. Using this service, we characterize 15 well-established inference methods and resources on a battery of 20 different benchmarks. Standardized benchmarking provides a way for users to identify the most effective methods for the problem at hand, sets a minimum requirement for new tools and resources, and guides the development of more accurate orthology inference methods.


Assuntos
Biologia Computacional/normas , Genômica/normas , Filogenia , Proteômica/normas , Archaea/classificação , Archaea/genética , Bactérias/classificação , Bactérias/genética , Biologia Computacional/métodos , Bases de Dados Genéticas , Eucariotos/classificação , Eucariotos/genética , Ontologia Genética , Genômica/métodos , Modelos Genéticos , Proteômica/métodos , Análise de Sequência de Proteína , Homologia de Sequência , Especificidade da Espécie
18.
BMC Evol Biol ; 15: 222, 2015 Oct 12.
Artigo em Inglês | MEDLINE | ID: mdl-26459560

RESUMO

BACKGROUND: Transposable elements (TE) have attracted much attention since they shape the genome and contribute to species evolution. Organisms have evolved mechanisms to control TE activity. Testis expressed 19 (Tex19) represses TE expression in mouse testis and placenta. In the human and mouse genomes, Tex19 and Secreted and transmembrane 1 (Sectm1) are neighbors but are not homologs. Sectm1 is involved in immunity and its molecular phylogeny is unknown. METHODS: Using multiple alignments of complete protein sequences (MACS), we inferred Tex19 and Sectm1 molecular phylogenies. Protein conserved regions were identified and folds were predicted. Finally, expression patterns were studied across tissues and species using RNA-seq public data and RT-PCR. RESULTS: We present 2 high quality alignments of 58 Tex19 and 58 Sectm1 protein sequences from 48 organisms. First, both genes are eutherian-specific, i.e., exclusively present in mammals except monotremes (platypus) and marsupials. Second, Tex19 and Sectm1 have both duplicated in Sciurognathi and Bovidae while they have remained as single copy genes in all further placental mammals. Phylogenetic concordance between both genes was significant (p-value < 0.05) and supported co-evolution and functional relationship. At the protein level, Tex19 exhibits 3 conserved regions and 4 invariant cysteines. In particular, a CXXC motif is present in the N-terminal conserved region. Sectm1 exhibits 2 invariant cysteines and an Ig-like domain. Strikingly, Tex19 C-terminal conserved region was lost in Haplorrhini primates while a Sectm1 C-terminal extra domain was acquired. Finally, we have determined that Tex19 and Sectm1 expression levels anti-correlate across the testis of several primates (ρ = -0.72) which supports anti-regulation. CONCLUSIONS: Tex19 and Sectm1 co-evolution and anti-regulated expressions support a strong functional relationship between both genes. Since Tex19 operates a control on TE and Sectm1 plays a role in immunity, Tex19 might suppress an immune response directed against cells that show TE activity in eutherian reproductive tissues.


Assuntos
Evolução Molecular , Mamíferos/genética , Proteínas de Membrana/genética , Proteínas Nucleares/genética , Sequência de Aminoácidos , Animais , Feminino , Expressão Gênica , Humanos , Masculino , Mamíferos/classificação , Mamíferos/metabolismo , Proteínas de Membrana/química , Proteínas de Membrana/metabolismo , Camundongos , Dados de Sequência Molecular , Proteínas Nucleares/química , Proteínas Nucleares/metabolismo , Filogenia , Placenta/metabolismo , Gravidez , Proteínas de Ligação a RNA , Ratos , Retroelementos , Testículo/metabolismo
19.
Bioinformatics ; 31(3): 447-8, 2015 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-25273105

RESUMO

SUMMARY: We previously developed OrthoInspector, a package incorporating an original algorithm for the detection of orthology and inparalogy relations between different species. We have added new functionalities to the package. While its original algorithm was not modified, performing similar orthology predictions, we facilitated the prediction of very large databases (thousands of proteomes), refurbished its graphical interface, added new visualization tools for comparative genomics/protein family analysis and facilitated its deployment in a network environment. Finally, we have released three online databases of precomputed orthology relationships. AVAILABILITY: Package and databases are freely available at http://lbgi.fr/orthoinspector with all major browsers supported. CONTACT: odile.lecompte@unistra.fr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , Gráficos por Computador , Bases de Dados Factuais , Proteômica/métodos , Análise de Sequência de Proteína/métodos , Software , Humanos , Anotação de Sequência Molecular , Filogenia
20.
Mol Endocrinol ; 28(2): 260-72, 2014 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-24422634

RESUMO

Retinoic acid (RA) controls many aspects of embryonic development by binding to specific receptors (retinoic acid receptors [RARs]) that regulate complex transcriptional networks. Three different RAR subtypes are present in vertebrates and play both common and specific roles in transducing RA signaling. Specific activities of each receptor subtype can be correlated with its exclusive expression pattern, whereas shared activities between different subtypes are generally assimilated to functional redundancy. However, the question remains whether some subtype-specific activity still exists in regions or organs coexpressing multiple RAR subtypes. We tackled this issue at the transcriptional level using early zebrafish embryo as a model. Using morpholino knockdown, we specifically invalidated the zebrafish endogenous RAR subtypes in an in vivo context. After building up a list of RA-responsive genes in the zebrafish gastrula through a whole-transcriptome analysis, we compared this panel of genes with those that still respond to RA in embryos lacking one or another RAR subtype. Our work reveals that RAR subtypes do not have fully redundant functions at the transcriptional level but can transduce RA signal in a subtype-specific fashion. As a result, we define RAR subtype-specific transcriptotypes that correspond to repertoires of genes activated by different RAR subtypes. Finally, we found genes of the RA pathway (cyp26a1, raraa) the regulation of which by RA is highly robust and can even resist the knockdown of all RARs. This suggests that RA-responsive genes are differentially sensitive to alterations in the RA pathway and, in particular, cyp26a1 and raraa are under a high pressure to maintain signaling integrity.


Assuntos
Gástrula/metabolismo , Receptores do Ácido Retinoico/metabolismo , Peixe-Zebra/genética , Animais , Sequência de Bases , Sistema Enzimático do Citocromo P-450/genética , Sistema Enzimático do Citocromo P-450/metabolismo , Regulação da Expressão Gênica no Desenvolvimento , Técnicas de Silenciamento de Genes , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Receptores do Ácido Retinoico/antagonistas & inibidores , Receptores do Ácido Retinoico/genética , Ácido Retinoico 4 Hidroxilase , Transdução de Sinais , Transcrição Gênica , Peixe-Zebra/embriologia , Peixe-Zebra/metabolismo , Proteínas de Peixe-Zebra
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA