RESUMO
In mammals, a single Dicer participates in biogenesis of small RNAs in microRNA (miRNA) and RNAi pathways. In mice, endogenous RNAi is highly active in oocytes, but not in somatic cells, which we ascribe here to an oocyte-specific Dicer isoform (Dicer(O)). Dicer(O) lacks the N-terminal DExD helicase domain and has higher cleavage activity than the full-length Dicer in somatic cells (Dicer(S)). Unlike Dicer(S), Dicer(O) efficiently produces small RNAs from long double-stranded (dsRNA) substrates. Expression of the Dicer(O) isoform is driven by an intronic MT-C retrotransposon promoter, deletion of which causes loss of Dicer(O) and female sterility. Oocytes from females lacking the MT-C element show meiotic spindle defects and increased levels of endogenous small interfering RNA (endo-siRNA) targets, phenocopying the maternal Dicer null phenotype. The alternative Dicer isoform, whose phylogenetic origin demonstrates evolutionary plasticity of RNA-silencing pathways, is the main determinant of endogenous RNAi activity in the mouse female germline.
Assuntos
RNA Helicases DEAD-box/genética , RNA Helicases DEAD-box/metabolismo , Oócitos/metabolismo , RNA Interferente Pequeno/metabolismo , Retroelementos , Ribonuclease III/genética , Ribonuclease III/metabolismo , Animais , Sequência de Bases , RNA Helicases DEAD-box/química , Feminino , Expressão Gênica , Infertilidade Feminina , Camundongos , Dados de Sequência Molecular , Filogenia , Regiões Promotoras Genéticas , Isoformas de Proteínas/química , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , RNA Interferente Pequeno/química , Ribonuclease III/químicaRESUMO
BACKGROUND: Human mitochondrial heteroplasmy is an extensively investigated phenomenon in the context of medical diagnostics, forensic identification and molecular evolution. However, technical limitations of high-throughput sequencing hinder reliable determination of point heteroplasmies (PHPs) with minor allele frequencies (MAFs) within the noise threshold. RESULTS: To investigate the PHP landscape at an MAF threshold down to 0.1%, we sequenced whole mitochondrial genomes at approximately 7.700x coverage, in multiple technical and biological replicates of longitudinal blood and buccal swab samples from 11 human donors (159 libraries in total). The results obtained by two independent sequencing platforms and bioinformatics pipelines indicate distinctive PHP patterns below and above the 1% MAF cut-off. We found a high inter-individual prevalence of low-level PHPs (MAF < 1%) at polymorphic positions of the mitochondrial DNA control region (CR), their tissue preference, and a tissue-specific minor allele linkage. We also established the position-dependent potential of minor allele expansion in PHPs, and short-term PHP instability in a mitotically active tissue. We demonstrate that the increase in sensitivity of PHP detection to minor allele frequencies below 1% within a robust experimental and analytical pipeline, provides new information with potential applicative value. CONCLUSIONS: Our findings reliably show different mutational loads between tissues at sub-1% allele frequencies, which may serve as an informative medical biomarker of time-dependent, tissue-specific mutational burden, or help discriminate forensically relevant tissues in a single person, close maternal relatives or unrelated individuals of similar phylogenetic background.
Assuntos
Heteroplasmia , Mitocôndrias , Humanos , Filogenia , Mitocôndrias/genética , Sequenciamento de Nucleotídeos em Larga Escala , DNA Mitocondrial/genéticaRESUMO
The determination of the protein's intracellular localization is essential for understanding its biological function. Protein localization studies are mainly performed on primary and secondary vertebrate cell lines for which most protocols have been optimized. In spite of experimental difficulties, studies on invertebrate cells, including basal Metazoa, have greatly advanced. In recent years, the interest in studying human diseases from an evolutionary perspective has significantly increased. Sponges, placed at the base of the animal tree, are simple animals without true tissues and organs but with a complex genome containing many genes whose human homologs have been implicated in human diseases, including cancer. Therefore, sponges are an innovative model for elucidating the fundamental role of the proteins involved in cancer. In this study, we overexpressed human cancer-related proteins and their sponge homologs in human cancer cells, human fibroblasts, and sponge cells. We demonstrated that human and sponge MYC proteins localize in the nucleus, the RRAS2 in the plasma membrane, the membranes of the endolysosomal vesicles, and the DRG1 in the cell's cytosol. Despite the very low transfection efficiency of sponge cells, we observed an identical localization of human proteins and their sponge homologs, indicating their similar cellular functions.
Assuntos
Proteínas Monoméricas de Ligação ao GTP , Neoplasias , Poríferos , Animais , Humanos , Genoma , Evolução Biológica , Linhagem Celular , Transfecção , Proteínas de MembranaRESUMO
Correspondence between evolution and development has been discussed for more than two centuries. Recent work reveals that phylogeny-ontogeny correlations are indeed present in developmental transcriptomes of eukaryotic clades with complex multicellularity. Nevertheless, it has been largely ignored that the pervasive presence of phylogeny-ontogeny correlations is a hallmark of development in eukaryotes. This perspective opens a possibility to look for similar parallelisms in biological settings where developmental logic and multicellular complexity are more obscure. For instance, it has been increasingly recognized that multicellular behavior underlies biofilm formation in bacteria. However, it remains unclear whether bacterial biofilm growth shares some basic principles with development in complex eukaryotes. Here we show that the ontogeny of growing Bacillus subtilis biofilms recapitulates phylogeny at the expression level. Using time-resolved transcriptome and proteome profiles, we found that biofilm ontogeny correlates with the evolutionary measures, in a way that evolutionary younger and more diverged genes were increasingly expressed toward later timepoints of biofilm growth. Molecular and morphological signatures also revealed that biofilm growth is highly regulated and organized into discrete ontogenetic stages, analogous to those of eukaryotic embryos. Together, this suggests that biofilm formation in Bacillus is a bona fide developmental process comparable to organismal development in animals, plants, and fungi. Given that most cells on Earth reside in the form of biofilms and that biofilms represent the oldest known fossils, we anticipate that the widely adopted vision of the first life as a single-cell and free-living organism needs rethinking.
Assuntos
Bacillus subtilis/fisiologia , Biofilmes , Evolução Biológica , Bacillus subtilis/citologiaRESUMO
Tens of thousands of rapidly evolving long non-coding RNA (lncRNA) genes have been identified, but functions were assigned to relatively few of them. The lncRNA contribution to the mouse oocyte physiology remains unknown. We report the evolutionary history and functional analysis of Sirena1, the most expressed lncRNA and the 10th most abundant poly(A) transcript in mouse oocytes. Sirena1 appeared in the common ancestor of mouse and rat and became engaged in two different post-transcriptional regulations. First, antisense oriented Elob pseudogene insertion into Sirena1 exon 1 is a source of small RNAs targeting Elob mRNA via RNA interference. Second, Sirena1 evolved functional cytoplasmic polyadenylation elements, an unexpected feature borrowed from translation control of specific maternal mRNAs. Sirena1 knock-out does not affect fertility, but causes minor dysregulation of the maternal transcriptome. This includes increased levels of Elob and mitochondrial mRNAs. Mitochondria in Sirena1-/- oocytes disperse from the perinuclear compartment, but do not change in number or ultrastructure. Taken together, Sirena1 contributes to RNA interference and mitochondrial aggregation in mouse oocytes. Sirena1 exemplifies how lncRNAs stochastically engage or even repurpose molecular mechanisms during evolution. Simultaneously, Sirena1 expression levels and unique functional features contrast with the lack of functional importance assessed under laboratory conditions.
Assuntos
Mitocôndrias/genética , Oócitos/metabolismo , RNA Longo não Codificante/genética , RNA Mensageiro/genética , RNA Mitocondrial/genética , Animais , Técnicas de Inativação de Genes , Camundongos , Mitocôndrias/ultraestrutura , Oócitos/crescimento & desenvolvimento , Oócitos/ultraestrutura , Poliadenilação/genética , Ratos , Transcriptoma/genéticaRESUMO
Bacilli can form dormant, highly resistant, and metabolically inactive spores to cope with extreme environmental challenges. In this study, we examined the evolutionary age of Bacillus subtilis sporulation genes using the approach known as genomic phylostratigraphy. We found that B. subtilis sporulation genes cluster in several groups that emerged at distant evolutionary time-points, suggesting that the sporulation process underwent several stages of expansion. Next, we asked whether such evolutionary stratification of the genome could be used to predict involvement in sporulation of presently uncharacterized genes (y-genes). We individually inactivated a representative sample of uncharacterized genes that arose during the same evolutionary periods as the known sporulation genes and tested the resulting strains for sporulation phenotypes. Sporulation was significantly affected in 16 out of 37 (43%) tested strains. In addition to expanding the knowledge base on B. subtilis sporulation, our findings suggest that evolutionary age could be used to help with genome mining.
Assuntos
Bacillus subtilis/fisiologia , Evolução Molecular , Genoma Bacteriano , Esporos Bacterianos , FenótipoRESUMO
Cancer is a disease potentiated by mutations in somatic cells. Cancer mutations are not distributed uniformly along the human genome. Instead, different human genomic regions vary by up to fivefold in the local density of cancer somatic mutations, posing a fundamental problem for statistical methods used in cancer genomics. Epigenomic organization has been proposed as a major determinant of the cancer mutational landscape. However, both somatic mutagenesis and epigenomic features are highly cell-type-specific. We investigated the distribution of mutations in multiple independent samples of diverse cancer types and compared them to cell-type-specific epigenomic features. Here we show that chromatin accessibility and modification, together with replication timing, explain up to 86% of the variance in mutation rates along cancer genomes. The best predictors of local somatic mutation density are epigenomic features derived from the most likely cell type of origin of the corresponding malignancy. Moreover, we find that cell-of-origin chromatin features are much stronger determinants of cancer mutation profiles than chromatin features of matched cancer cell lines. Furthermore, we show that the cell type of origin of a cancer can be accurately determined based on the distribution of mutations along its genome. Thus, the DNA sequence of a cancer genome encompasses a wealth of information about the identity and epigenomic features of its cell of origin.
Assuntos
Cromatina/genética , Cromatina/metabolismo , Epigênese Genética/genética , Mutação/genética , Neoplasias/genética , Neoplasias/patologia , Linhagem Celular Tumoral , Cromatina/química , Período de Replicação do DNA , Epigenômica , Genoma Humano/genética , Humanos , Melanócitos/metabolismo , Melanócitos/patologia , Melanoma/genética , Melanoma/patologia , Especificidade de Órgãos/genéticaRESUMO
Retrotransposons are "copy-and-paste" insertional mutagens that substantially contribute to mammalian genome content. Retrotransposons often carry long terminal repeats (LTRs) for retrovirus-like reverse transcription and integration into the genome. We report an extraordinary impact of a group of LTRs from the mammalian endogenous retrovirus-related ERVL retrotransposon class on gene expression in the germline and beyond. In mouse, we identified more than 800 LTRs from ORR1, MT, MT2, and MLT families, which resemble mobile gene-remodeling platforms that supply promoters and first exons. The LTR-mediated gene remodeling also extends to hamster, human, and bovine oocytes. The LTRs function in a stage-specific manner during the oocyte-to-embryo transition by activating transcription, altering protein-coding sequences, producing noncoding RNAs, and even supporting evolution of new protein-coding genes. These functions result, for example, in recycling processed pseudogenes into mRNAs or lncRNAs with regulatory roles. The functional potential of the studied LTRs is even higher, because we show that dormant LTR promoter activity can rescue loss of an essential upstream promoter. We also report a novel protein-coding gene evolution-D6Ertd527e-in which an MT LTR provided a promoter and the 5' exon with a functional start codon while the bulk of the protein-coding sequence evolved through a CAG repeat expansion. Altogether, ERVL LTRs provide molecular mechanisms for stochastically scanning, rewiring, and recycling genetic information on an extraordinary scale. ERVL LTRs thus offer means for a comprehensive survey of the genome's expression potential, tightly intertwining with gene expression and evolution in the germline.
Assuntos
Evolução Molecular , Regulação da Expressão Gênica , Oócitos/metabolismo , Retroelementos , Sequências Repetidas Terminais , Zigoto/metabolismo , Animais , Bovinos , Cricetinae , Retrovirus Endógenos , Humanos , Camundongos , Oócitos/citologia , Regiões Promotoras Genéticas , Transcrição Gênica , Zigoto/citologiaRESUMO
Initiation of zygotic transcription in mammals is poorly understood. In mice, zygotic transcription is first detected shortly after pronucleus formation in 1-cell embryos, but the identity of the transcribed loci and mechanisms regulating their expression are not known. Using total RNA-Seq, we have found that transcription in 1-cell embryos is highly promiscuous, such that intergenic regions are extensively expressed and thousands of genes are transcribed at comparably low levels. Striking is that transcription can occur in the absence of defined core-promoter elements. Furthermore, accumulation of translatable zygotic mRNAs is minimal in 1-cell embryos because of inefficient splicing and 3' processing of nascent transcripts. These findings provide novel insights into regulation of gene expression in 1-cell mouse embryos that may confer a protective mechanism against precocious gene expression that is the product of a relaxed chromatin structure present in 1-cell embryos. The results also suggest that the first zygotic transcription itself is an active component of chromatin remodeling in 1-cell embryos.
Assuntos
Regiões 3' não Traduzidas/fisiologia , Embrião de Mamíferos/metabolismo , Regulação da Expressão Gênica no Desenvolvimento/fisiologia , Splicing de RNA/fisiologia , Transcrição Gênica/fisiologia , Zigoto/metabolismo , Animais , Cromatina/metabolismo , Montagem e Desmontagem da Cromatina/fisiologia , Embrião de Mamíferos/citologia , Camundongos , Zigoto/citologiaRESUMO
UNLABELLED: Biological insights can be obtained through computational integration of genomics data sets consisting of diverse types of information. The integration is often hampered by a large variety of existing file formats, often containing similar information, and the necessity to use complicated tools to achieve the desired results. We have built an R package, genomation, to expedite the extraction of biological information from high throughput data. The package works with a variety of genomic interval file types and enables easy summarization and annotation of high throughput data sets with given genomic annotations. AVAILABILITY AND IMPLEMENTATION: The software is currently distributed under MIT artistic license and freely available at http://bioinformatics.mdc-berlin.de/genomation, and through the Bioconductor framework.
Assuntos
Gráficos por Computador , Éxons/genética , Genômica/métodos , Íntrons/genética , Anotação de Sequência Molecular , Software , Biologia Computacional/métodos , Bases de Dados Genéticas , Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala , HumanosRESUMO
Exon splicing enhancers (ESEs) overlap with amino acid coding sequences implying a dual evolutionary selective pressure. In this study, we map ESEs in the placental alkaline phosphatase gene (ALPP), absent in the corresponding exon of the ancestral tissue-non-specific alkaline phosphatase gene (ALPL). The ESEs are associated with amino acid differences between the transcripts in an area otherwise conserved. We switched out the ALPP ESEs sequences with the sequence from the related ALPL, introducing the associated amino acid changes. The resulting enzymes, produced by cDNA expression, showed different kinetic characteristics than ALPL and ALPP. In the organism, this enzyme will never be subjected to selection because gene splicing analysis shows exon skipping due to loss of the ESE. Our data prove that ESEs restrict the evolution of enzymatic activity. Thus, suboptimal proteins may exist in scenarios when coding nucleotide changes and consequent amino acid variation cannot be reconciled with the splicing function.
Assuntos
Fosfatase Alcalina/genética , Evolução Molecular , Éxons , Isoenzimas/genética , Fosfatase Alcalina/química , Sequência de Aminoácidos , Animais , Sequência de Bases , Células COS , Domínio Catalítico , Chlorocebus aethiops , Simulação por Computador , Proteínas Ligadas por GPI/química , Proteínas Ligadas por GPI/genética , Duplicação Gênica , Genoma Humano , Células HeLa , Humanos , Isoenzimas/química , Cinética , Modelos Genéticos , Dados de Sequência Molecular , Sítios de Splice de RNA , SoftwareRESUMO
Microbial communities represent the largest portion of the Earth's biomass. Metagenomics projects use high-throughput sequencing to survey these communities and shed light on genetic capabilities that enable microbes to inhabit every corner of the biosphere. Metagenome studies are generally based on (i) classifying and ranking functions of identified genes; and (ii) estimating the phyletic distribution of constituent microbial species. To understand microbial communities at the systems level, it is necessary to extend these studies beyond the species' boundaries and capture higher levels of metabolic complexity. We evaluated 11 metagenome samples and demonstrated that microbes inhabiting the same ecological niche share common preferences for synonymous codons, regardless of their phylogeny. By exploring concepts of translational optimization through codon usage adaptation, we demonstrated that community-wide bias in codon usage can be used as a prediction tool for lifestyle-specific genes across the entire microbial community, effectively considering microbial communities as meta-genomes. These findings set up a 'functional metagenomics' platform for the identification of genes relevant for adaptations of entire microbial communities to environments. Our results provide valuable arguments in defining the concept of microbial species through the context of their interactions within the community.
Assuntos
Adaptação Biológica/genética , Códon , Metagenoma , Animais , Ecossistema , Genoma Bacteriano , Humanos , Metagenômica , Camundongos , Filogenia , ProteômicaRESUMO
Histones are frequently decorated with covalent modifications. These histone modifications are thought to be involved in various chromatin-dependent processes including transcription. To elucidate the relationship between histone modifications and transcription, we derived quantitative models to predict the expression level of genes from histone modification levels. We found that histone modification levels and gene expression are very well correlated. Moreover, we show that only a small number of histone modifications are necessary to accurately predict gene expression. We show that different sets of histone modifications are necessary to predict gene expression driven by high CpG content promoters (HCPs) or low CpG content promoters (LCPs). Quantitative models involving H3K4me3 and H3K79me1 are the most predictive of the expression levels in LCPs, whereas HCPs require H3K27ac and H4K20me1. Finally, we show that the connections between histone modifications and gene expression seem to be general, as we were able to predict gene expression levels of one cell type using a model trained on another one.
Assuntos
Linfócitos T CD4-Positivos/metabolismo , Regulação da Expressão Gênica/fisiologia , Histonas/metabolismo , Modelos Biológicos , Biologia Computacional , Ilhas de CpG/genética , Humanos , Regiões Promotoras Genéticas/genética , Análise de RegressãoRESUMO
Codon usage bias in prokaryotic genomes is largely a consequence of background substitution patterns in DNA, but highly expressed genes may show a preference towards codons that enable more efficient and/or accurate translation. We introduce a novel approach based on supervised machine learning that detects effects of translational selection on genes, while controlling for local variation in nucleotide substitution patterns represented as sequence composition of intergenic DNA. A cornerstone of our method is a Random Forest classifier that outperformed previous distance measure-based approaches, such as the codon adaptation index, in the task of discerning the (highly expressed) ribosomal protein genes by their codon frequencies. Unlike previous reports, we show evidence that translational selection in prokaryotes is practically universal: in 460 of 461 examined microbial genomes, we find that a subset of genes shows a higher codon usage similarity to the ribosomal proteins than would be expected from the local sequence composition. These genes constitute a substantial part of the genome--between 5% and 33%, depending on genome size--while also exhibiting higher experimentally measured mRNA abundances and tending toward codons that match tRNA anticodons by canonical base pairing. Certain gene functional categories are generally enriched with, or depleted of codon-optimized genes, the trends of enrichment/depletion being conserved between Archaea and Bacteria. Prominent exceptions from these trends might indicate genes with alternative physiological roles; we speculate on specific examples related to detoxication of oxygen radicals and ammonia and to possible misannotations of asparaginyl-tRNA synthetases. Since the presence of codon optimizations on genes is a valid proxy for expression levels in fully sequenced genomes, we provide an example of an "adaptome" by highlighting gene functions with expression levels elevated specifically in thermophilic Bacteria and Archaea.
Assuntos
Células Procarióticas/metabolismo , Biossíntese de Proteínas , Archaea/genética , Bactérias/genética , Códon , Regulação da Expressão Gênica , Genoma , Modelos Genéticos , Proteínas Ribossômicas/genéticaRESUMO
HIV-1 encounters the hierarchically organized host chromatin to stably integrate and persist in anatomically distinct latent reservoirs. The contribution of genome organization in HIV-1 infection has been largely understudied across different HIV-1 targets. Here, we determine HIV-1 integration sites (ISs), associate them with chromatin and expression signatures at different genomic scales in a microglia cell model, and profile them together with the primary T cell reservoir. HIV-1 insertions into introns of actively transcribed genes with IS hotspots in genic and super-enhancers, characteristic of blood cells, are maintained in the microglia cell model. Genome organization analysis reveals dynamic CCCTC-binding factor (CTCF) clusters in cells with active and repressed HIV-1 transcription, whereas CTCF removal impairs viral integration. We identify CTCF-enriched topologically associated domain (TAD) boundaries with signatures of transcriptionally active chromatin as HIV-1 integration determinants in microglia and CD4+ T cells, highlighting the importance of host genome organization in HIV-1 infection.
Assuntos
HIV-1 , HIV-1/genética , HIV-1/metabolismo , Microglia/metabolismo , Fator de Ligação a CCCTC/metabolismo , Cromatina , Genômica , Integração Viral/genéticaRESUMO
Cancer is a disease caused by errors within the multicellular system and it represents a major health issue in multicellular organisms. Although cancer research has advanced substantially, new approaches focusing on fundamental aspects of cancer origin and mechanisms of spreading are necessary. Comparative genomic studies have shown that most genes linked to human cancer emerged during the early evolution of Metazoa. Thus, basal animals without true tissues and organs, such as sponges (Porifera), might be an innovative model system for understanding the molecular mechanisms of proteins involved in cancer biology. One of these proteins is developmentally regulated GTP-binding protein 1 (DRG1), a GTPase stabilized by interaction with DRG family regulatory protein 1 (DFRP1). This study reveals a high evolutionary conservation of DRG1 gene/protein in metazoans. Our biochemical analysis and structural predictions show that both recombinant sponge and human DRG1 are predominantly monomers that form complexes with DFRP1 and bind non-specifically to RNA and DNA. We demonstrate the conservation of sponge and human DRG1 biological features, including intracellular localization and DRG1:DFRP1 binding, function of DRG1 in α-tubulin dynamics, and its role in cancer biology demonstrated by increased proliferation, migration and colonization in human cancer cells. These results suggest that the ancestor of all Metazoa already possessed DRG1 that is structurally and functionally similar to the human DRG1, even before the development of real tissues or tumors, indicating an important function of DRG1 in fundamental cellular pathways.
Assuntos
Neoplasias , Oncogenes , Animais , Proteínas de Ligação ao GTP , Genômica , Humanos , Neoplasias/genética , RNA , Fatores de TranscriçãoRESUMO
TDP-43 is a multifunctional RNA-binding protein found to be a major protein component of intracellular inclusions found in neurodegenerative disorders such as Fronto Temporal Lobar Degeneration, Amyotrophic Lateral Sclerosis, and Alzheimer Disease. PRO-MINE (PROtein Mutations In NEurodegeneration) is a database populated with manually curated data from the literature regarding all TDP-43/TDP43/TARDBP gene disease-associated mutations identified to date. A web server interface has been developed to query the database and to provide tools for the analysis of already reported or novel TDP-43 gene mutations. As is usually the case with genetic association studies, assessing the potential impact of identified mutations is of crucial importance, and in order to avoid prediction biases it is essential to compare the prediction results. However, in most cases mutations have to be submitted separately to various prediction tools and the individual results manually merged together afterwards. The implemented web server aims to overcome the problem by providing simultaneous access to several prediction tools and by displaying the results into a single output. Furthermore, the results are displayed together in a comprehensive output for a more convenient analysis and are enriched with additional information about mutations. In addition, our web server can also display the mutation(s) of interest within an alignment of annotated TDP-43 protein sequences from different vertebrate species. In this way, the degree of sequence conservation where the mutation(s) occur can be easily tracked and visualized. The web server is freely available to researchers and can be accessed at http://bioinfo.hr/pro-mine.
Assuntos
Biologia Computacional , Proteínas de Ligação a DNA/genética , Bases de Dados Genéticas , Mutação , Sequência de Aminoácidos , Biologia Computacional/instrumentação , Biologia Computacional/métodos , Humanos , Dados de Sequência Molecular , Doenças Neurodegenerativas/genética , SoftwareRESUMO
Sponges (Porifera) are among the simplest living and the earliest branching metazoans. They hold a pivotal role for studying genome evolution of the entire metazoan branch, both as an outgroup to Eumetazoa and as the closest branching phylum to the common ancestor of all multicellular animals (Urmetazoa). In order to assess the transcription inventory of sponges, we sequenced expressed sequence tag libraries of two demosponge species, Suberites domuncula and Lubomirskia baicalensis, and systematically analyzed the assembled sponge transcripts against their homologs from complete proteomes of six well-characterized metazoans--Nematostella vectensis, Caenorhabditis elegans, Drosophila melanogaster, Strongylocentrotus purpuratus, Ciona intestinalis, and Homo sapiens. We show that even the earliest metazoan species already have strikingly complex genomes in terms of gene content and functional repertoire and that the rich gene repertoire existed even before the emergence of true tissues, therefore further emphasizing the importance of gene loss and spatio-temporal changes in regulation of gene expression in shaping the metazoan genomes. Our findings further indicate that sponge and human genes generally show similarity levels higher than expected from their respective positions in metazoan phylogeny, providing direct evidence for slow rate of evolution in both "basal" and "apical" metazoan genome lineages. We propose that the ancestor of all metazoans had already had an unusually complex genome, thereby shifting the origins of genome complexity from Urbilateria to Urmetazoa.
Assuntos
Evolução Molecular , Etiquetas de Sequências Expressas , Filogenia , Poríferos/genética , Suberites/genética , Animais , Sequência de Bases , Hibridização Genômica Comparativa , Regulação da Expressão Gênica , Genoma , Dados de Sequência Molecular , Alinhamento de Sequência , Homologia de Sequência de AminoácidosRESUMO
After performing hydrophilic interaction and weak anion exchange high-performance liquid chromatography to analyze N-glycans in the plasma of 1991 people, we identified several individuals that differed significantly from the "normal" profile of N-glycans. By performing consensus scoring of pairwise distances between vectors containing measured glycan values, we formed six groups of individuals with specific glyco-phenotypes. Some aberrations from the normal plasma protein patterns were found to be associated with clinical conditions (such as renal problems in people with increased monosialylated biantennary glycans, A2G2S1), while other substantial changes in N-glycan structure, such as the near complete absence of neutral glycans or antennary fucosylated tri- and tetraantennary glycans, were not associated with any observed adverse health outcomes. These results demonstrate the existence of specific altered glyco-phenotypes in some individuals and indicate that in some cases they might represent risk factors for the development of specific diseases.