Pesquisa | Portal Regional da BVS

1.

Cohesin and CTCF do not assemble TADs in Xenopus sperm and male pronuclei.

Jessberger, Gregor; Várnai, Csilla; Stocsits, Roman R; Tang, Wen; Stary, Georg; Peters, Jan-Michael.

Genome Res ; 2023 Dec 21.

Artigo em Inglês | MEDLINE | ID: mdl-38129077

RESUMO

Paternal genomes are compacted during spermiogenesis and decompacted following fertilization. These processes are fundamental for inheritance but incompletely understood. We analyzed these processes in the frog Xenopus laevis, whose sperm can be assembled into functional pronuclei in egg extracts in vitro. In such extracts, cohesin extrudes DNA into loops, but in vivo cohesin only assembles topologically associating domains (TADs) at the mid-blastula transition (MBT). Why cohesin assembles TADs only at this stage is unknown. We first analyzed genome architecture in frog sperm and compared it to human and mouse. Our results indicate that sperm genome organization is conserved between frogs and humans and occurs without formation of TADs. TADs can be detected in mouse sperm samples, as reported, but these structures might originate from somatic chromatin contaminations. We therefore discuss the possibility that the absence of TADs might be a general feature of vertebrate sperm. To analyze sperm genome remodeling upon fertilization, we reconstituted male pronuclei in Xenopus egg extracts. In pronuclei, chromatin compartmentalization increases, but cohesin does not accumulate at CTCF sites and assemble TADs. However, if pronuclei are formed in the presence of exogenous CTCF, CTCF binds to its consensus sites, and cohesin accumulates at these and forms short-range chromatin loops, which are preferentially anchored at CTCF's N terminus. These results indicate that TADs are only assembled at MBT because before this stage CTCF sites are not occupied and cohesin only forms short-range chromatin loops.

2.

Cohesin mediates DNA loop extrusion and sister chromatid cohesion by distinct mechanisms.

Nagasaka, Kota; Davidson, Iain F; Stocsits, Roman R; Tang, Wen; Wutz, Gordana; Batty, Paul; Panarotto, Melanie; Litos, Gabriele; Schleiffer, Alexander; Gerlich, Daniel W; Peters, Jan-Michael.

Mol Cell ; 83(17): 3049-3063.e6, 2023 09 07.

Artigo em Inglês | MEDLINE | ID: mdl-37591243

RESUMO

Cohesin connects CTCF-binding sites and other genomic loci in cis to form chromatin loops and replicated DNA molecules in trans to mediate sister chromatid cohesion. Whether cohesin uses distinct or related mechanisms to perform these functions is unknown. Here, we describe a cohesin hinge mutant that can extrude DNA into loops but is unable to mediate cohesion in human cells. Our results suggest that the latter defect arises during cohesion establishment. The observation that cohesin's cohesion and loop extrusion activities can be partially separated indicates that cohesin uses distinct mechanisms to perform these two functions. Unexpectedly, the same hinge mutant can also not be stopped by CTCF boundaries as well as wild-type cohesin. This suggests that cohesion establishment and cohesin's interaction with CTCF boundaries depend on related mechanisms and raises the possibility that both require transient hinge opening to entrap DNA inside the cohesin ring.

Assuntos

Proteínas de Ciclo Celular , Cromátides , Humanos , Cromátides/genética , Sítios de Ligação , Proteínas de Ciclo Celular/genética , Proteínas Cromossômicas não Histona/genética , Coesinas

3.

HiCognition: a visual exploration and hypothesis testing tool for 3D genomics.

Langer, Christoph C H; Mitter, Michael; Stocsits, Roman R; Gerlich, Daniel W.

Genome Biol ; 24(1): 158, 2023 07 05.

Artigo em Inglês | MEDLINE | ID: mdl-37408019

RESUMO

Genome browsers facilitate integrated analysis of multiple genomics datasets yet visualize only a few regions at a time and lack statistical functions for extracting meaningful information. We present HiCognition, a visual exploration and machine-learning tool based on a new genomic region set concept, enabling detection of patterns and associations between 3D chromosome conformation and collections of 1D genomics profiles of any type. By revealing how transcription and cohesion subunit isoforms contribute to chromosome conformation, we showcase how the flexible user interface and machine learning tools of HiCognition help to understand the relationship between the structure and function of the genome.

Assuntos

Genoma Humano , Genômica , Software , Humanos , Genômica/métodos , Cromossomos Humanos , Aprendizado de Máquina

4.

Transcription shapes 3D chromatin organization by interacting with loop extrusion.

Banigan, Edward J; Tang, Wen; van den Berg, Aafke A; Stocsits, Roman R; Wutz, Gordana; Brandão, Hugo B; Busslinger, Georg A; Peters, Jan-Michael; Mirny, Leonid A.

Proc Natl Acad Sci U S A ; 120(11): e2210480120, 2023 03 14.

Artigo em Inglês | MEDLINE | ID: mdl-36897969

RESUMO

Cohesin folds mammalian interphase chromosomes by extruding the chromatin fiber into numerous loops. "Loop extrusion" can be impeded by chromatin-bound factors, such as CTCF, which generates characteristic and functional chromatin organization patterns. It has been proposed that transcription relocalizes or interferes with cohesin and that active promoters are cohesin loading sites. However, the effects of transcription on cohesin have not been reconciled with observations of active extrusion by cohesin. To determine how transcription modulates extrusion, we studied mouse cells in which we could alter cohesin abundance, dynamics, and localization by genetic "knockouts" of the cohesin regulators CTCF and Wapl. Through Hi-C experiments, we discovered intricate, cohesin-dependent contact patterns near active genes. Chromatin organization around active genes exhibited hallmarks of interactions between transcribing RNA polymerases (RNAPs) and extruding cohesins. These observations could be reproduced by polymer simulations in which RNAPs were moving barriers to extrusion that obstructed, slowed, and pushed cohesins. The simulations predicted that preferential loading of cohesin at promoters is inconsistent with our experimental data. Additional ChIP-seq experiments showed that the putative cohesin loader Nipbl is not predominantly enriched at promoters. Therefore, we propose that cohesin is not preferentially loaded at promoters and that the barrier function of RNAP accounts for cohesin accumulation at active promoters. Altogether, we find that RNAP is an extrusion barrier that is not stationary, but rather, translocates and relocalizes cohesin. Loop extrusion and transcription might interact to dynamically generate and maintain gene interactions with regulatory elements and shape functional genomic organization.

Assuntos

Proteínas de Ciclo Celular , Cromatina , Animais , Camundongos , Fator de Ligação a CCCTC/genética , Proteínas de Ciclo Celular/metabolismo , Cromossomos de Mamíferos/metabolismo , RNA Polimerases Dirigidas por DNA/genética , Mamíferos/genética

5.

ESCO1 and CTCF enable formation of long chromatin loops by protecting cohesin^STAG1 from WAPL.

Wutz, Gordana; Ladurner, Rene; St Hilaire, Brian Glenn; Stocsits, Roman R; Nagasaka, Kota; Pignard, Benoit; Sanborn, Adrian; Tang, Wen; Várnai, Csilla; Ivanov, Miroslav P; Schoenfelder, Stefan; van der Lelij, Petra; Huang, Xingfan; Dürnberger, Gerhard; Roitinger, Elisabeth; Mechtler, Karl; Davidson, Iain Finley; Fraser, Peter; Lieberman-Aiden, Erez; Peters, Jan-Michael.

Elife ; 92020 02 17.

Artigo em Inglês | MEDLINE | ID: mdl-32065581

RESUMO

Eukaryotic genomes are folded into loops. It is thought that these are formed by cohesin complexes via extrusion, either until loop expansion is arrested by CTCF or until cohesin is removed from DNA by WAPL. Although WAPL limits cohesin's chromatin residence time to minutes, it has been reported that some loops exist for hours. How these loops can persist is unknown. We show that during G1-phase, mammalian cells contain acetylated cohesinSTAG1 which binds chromatin for hours, whereas cohesinSTAG2 binds chromatin for minutes. Our results indicate that CTCF and the acetyltransferase ESCO1 protect a subset of cohesinSTAG1 complexes from WAPL, thereby enable formation of long and presumably long-lived loops, and that ESCO1, like CTCF, contributes to boundary formation in chromatin looping. Our data are consistent with a model of nested loop extrusion, in which acetylated cohesinSTAG1 forms stable loops between CTCF sites, demarcating the boundaries of more transient cohesinSTAG2 extrusion activity.

Assuntos

Acetiltransferases/fisiologia , Fator de Ligação a CCCTC/fisiologia , Proteínas de Transporte/metabolismo , Proteínas de Ciclo Celular/metabolismo , Cromatina/metabolismo , Proteínas Cromossômicas não Histona/metabolismo , Proteínas Nucleares/metabolismo , Proteínas Proto-Oncogênicas/metabolismo , Acetilação , Proteínas de Transporte/genética , Simulação por Computador , Fase G1 , Genoma Humano , Humanos , Proteínas Nucleares/genética , Ligação Proteica , Proteínas Proto-Oncogênicas/genética , Coesinas

6.

Absolute quantification of cohesin, CTCF and their regulators in human cells.

Holzmann, Johann; Politi, Antonio Z; Nagasaka, Kota; Hantsche-Grininger, Merle; Walther, Nike; Koch, Birgit; Fuchs, Johannes; Dürnberger, Gerhard; Tang, Wen; Ladurner, Rene; Stocsits, Roman R; Busslinger, Georg A; Novák, Béla; Mechtler, Karl; Davidson, Iain Finley; Ellenberg, Jan; Peters, Jan-Michael.

Elife ; 82019 06 17.

Artigo em Inglês | MEDLINE | ID: mdl-31204999

RESUMO

The organisation of mammalian genomes into loops and topologically associating domains (TADs) contributes to chromatin structure, gene expression and recombination. TADs and many loops are formed by cohesin and positioned by CTCF. In proliferating cells, cohesin also mediates sister chromatid cohesion, which is essential for chromosome segregation. Current models of chromatin folding and cohesion are based on assumptions of how many cohesin and CTCF molecules organise the genome. Here we have measured absolute copy numbers and dynamics of cohesin, CTCF, NIPBL, WAPL and sororin by mass spectrometry, fluorescence-correlation spectroscopy and fluorescence recovery after photobleaching in HeLa cells. In G1-phase, there are ~250,000 nuclear cohesin complexes, of which ~ 160,000 are chromatin-bound. Comparison with chromatin immunoprecipitation-sequencing data implies that some genomic cohesin and CTCF enrichment sites are unoccupied in single cells at any one time. We discuss the implications of these findings for how cohesin can contribute to genome organisation and cohesion.

Assuntos

Fator de Ligação a CCCTC/genética , Proteínas de Transporte/genética , Proteínas de Ciclo Celular/genética , Proteínas Cromossômicas não Histona/genética , Dosagem de Genes , Expressão Gênica , Proteínas Nucleares/genética , Proteínas Proto-Oncogênicas/genética , Fator de Ligação a CCCTC/metabolismo , Proteínas de Transporte/metabolismo , Proteínas de Ciclo Celular/metabolismo , Linhagem Celular , Cromátides/genética , Cromatina/genética , Cromatina/metabolismo , Proteínas Cromossômicas não Histona/metabolismo , Segregação de Cromossomos/genética , Recuperação de Fluorescência Após Fotodegradação/métodos , Fase G1/genética , Genoma Humano/genética , Células HeLa , Humanos , Espectrometria de Massas/métodos , Proteínas Nucleares/metabolismo , Proteínas Proto-Oncogênicas/metabolismo , Coesinas

7.

Topologically associating domains and chromatin loops depend on cohesin and are regulated by CTCF, WAPL, and PDS5 proteins.

Wutz, Gordana; Várnai, Csilla; Nagasaka, Kota; Cisneros, David A; Stocsits, Roman R; Tang, Wen; Schoenfelder, Stefan; Jessberger, Gregor; Muhar, Matthias; Hossain, M Julius; Walther, Nike; Koch, Birgit; Kueblbeck, Moritz; Ellenberg, Jan; Zuber, Johannes; Fraser, Peter; Peters, Jan-Michael.

EMBO J ; 36(24): 3573-3599, 2017 12 15.

Artigo em Inglês | MEDLINE | ID: mdl-29217591

RESUMO

Mammalian genomes are spatially organized into compartments, topologically associating domains (TADs), and loops to facilitate gene regulation and other chromosomal functions. How compartments, TADs, and loops are generated is unknown. It has been proposed that cohesin forms TADs and loops by extruding chromatin loops until it encounters CTCF, but direct evidence for this hypothesis is missing. Here, we show that cohesin suppresses compartments but is required for TADs and loops, that CTCF defines their boundaries, and that the cohesin unloading factor WAPL and its PDS5 binding partners control the length of loops. In the absence of WAPL and PDS5 proteins, cohesin forms extended loops, presumably by passing CTCF sites, accumulates in axial chromosomal positions (vermicelli), and condenses chromosomes. Unexpectedly, PDS5 proteins are also required for boundary function. These results show that cohesin has an essential genome-wide function in mediating long-range chromatin interactions and support the hypothesis that cohesin creates these by loop extrusion, until it is delayed by CTCF in a manner dependent on PDS5 proteins, or until it is released from DNA by WAPL.

Assuntos

Fator de Ligação a CCCTC/metabolismo , Proteínas de Transporte/metabolismo , Proteínas de Ciclo Celular/metabolismo , Cromatina/genética , Proteínas Cromossômicas não Histona/metabolismo , Proteínas de Ligação a DNA/metabolismo , Proteínas Nucleares/metabolismo , Proteínas Proto-Oncogênicas/metabolismo , Fatores de Transcrição/metabolismo , Fator de Ligação a CCCTC/genética , Proteínas de Transporte/genética , Proteínas de Ciclo Celular/genética , Proteínas Cromossômicas não Histona/genética , Cromossomos/genética , Proteínas de Ligação a DNA/genética , Genoma Humano/genética , Células HeLa , Humanos , Proteínas Nucleares/genética , Proteínas Proto-Oncogênicas/genética , Fatores de Transcrição/genética , Coesinas

8.

Cohesin is positioned in mammalian genomes by transcription, CTCF and Wapl.

Busslinger, Georg A; Stocsits, Roman R; van der Lelij, Petra; Axelsson, Elin; Tedeschi, Antonio; Galjart, Niels; Peters, Jan-Michael.

Nature ; 544(7651): 503-507, 2017 04 27.

Artigo em Inglês | MEDLINE | ID: mdl-28424523

RESUMO

Mammalian genomes are spatially organized by CCCTC-binding factor (CTCF) and cohesin into chromatin loops and topologically associated domains, which have important roles in gene regulation and recombination. By binding to specific sequences, CTCF defines contact points for cohesin-mediated long-range chromosomal cis-interactions. Cohesin is also present at these sites, but has been proposed to be loaded onto DNA elsewhere and to extrude chromatin loops until it encounters CTCF bound to DNA. How cohesin is recruited to CTCF sites, according to this or other models, is unknown. Here we show that the distribution of cohesin in the mouse genome depends on transcription, CTCF and the cohesin release factor Wings apart-like (Wapl). In CTCF-depleted fibroblasts, cohesin cannot be properly recruited to CTCF sites but instead accumulates at transcription start sites of active genes, where the cohesin-loading complex is located. In the absence of both CTCF and Wapl, cohesin accumulates in up to 70 kilobase-long regions at 3'-ends of active genes, in particular if these converge on each other. Changing gene expression modulates the position of these 'cohesin islands'. These findings indicate that transcription can relocate mammalian cohesin over long distances on DNA, as previously reported for yeast cohesin, that this translocation contributes to positioning cohesin at CTCF sites, and that active genes can be freed from cohesin either by transcription-mediated translocation or by Wapl-mediated release.

Assuntos

Proteínas de Ciclo Celular/metabolismo , Proteínas Cromossômicas não Histona/metabolismo , Cromossomos de Mamíferos/metabolismo , Genoma/genética , Proteínas/metabolismo , Proteínas Repressoras/metabolismo , Transcrição Gênica/genética , Animais , Sítios de Ligação , Fator de Ligação a CCCTC , Proteínas de Ciclo Celular/deficiência , Proteínas de Ciclo Celular/genética , Células Cultivadas , Proteoglicanas de Sulfatos de Condroitina/deficiência , Proteoglicanas de Sulfatos de Condroitina/genética , Cromatina/genética , Cromatina/metabolismo , Proteínas Cromossômicas não Histona/deficiência , Proteínas Cromossômicas não Histona/genética , Cromossomos de Mamíferos/genética , DNA/genética , DNA/metabolismo , Feminino , Fibroblastos/citologia , Fibroblastos/metabolismo , Masculino , Camundongos , Transporte Proteico , Proteínas/genética , Proteínas Repressoras/deficiência , Proteínas Repressoras/genética , Sítio de Iniciação de Transcrição , Coesinas

9.

SNW1 enables sister chromatid cohesion by mediating the splicing of sororin and APC2 pre-mRNAs.

van der Lelij, Petra; Stocsits, Roman R; Ladurner, Rene; Petzold, Georg; Kreidl, Emanuel; Koch, Birgit; Schmitz, Julia; Neumann, Beate; Ellenberg, Jan; Peters, Jan-Michael.

EMBO J ; 33(22): 2643-58, 2014 Nov 18.

Artigo em Inglês | MEDLINE | ID: mdl-25257309

RESUMO

Although splicing is essential for the expression of most eukaryotic genes, inactivation of splicing factors causes specific defects in mitosis. The molecular cause of this defect is unknown. Here, we show that the spliceosome subunits SNW1 and PRPF8 are essential for sister chromatid cohesion in human cells. A transcriptome-wide analysis revealed that SNW1 or PRPF8 depletion affects the splicing of specific introns in a subset of pre-mRNAs, including pre-mRNAs encoding the cohesion protein sororin and the APC/C subunit APC2. SNW1 depletion causes cohesion defects predominantly by reducing sororin levels, which causes destabilisation of cohesin on DNA. SNW1 depletion also reduces APC/C activity and contributes to cohesion defects indirectly by delaying mitosis and causing "cohesion fatigue". Simultaneous expression of sororin and APC2 from intron-less cDNAs restores cohesion in SNW1-depleted cells. These results indicate that the spliceosome is required for mitosis because it enables expression of genes essential for cohesion. Our transcriptome-wide identification of retained introns in SNW1- and PRPF8-depleted cells may help to understand the aetiology of diseases associated with splicing defects, such as retinosa pigmentosum and cancer.

Assuntos

Proteínas Adaptadoras de Transdução de Sinal/metabolismo , Proteínas de Ciclo Celular/metabolismo , Cromátides/metabolismo , Proteínas do Citoesqueleto/metabolismo , Coativadores de Receptor Nuclear/metabolismo , Precursores de RNA/metabolismo , Splicing de RNA/fisiologia , Proteínas Adaptadoras de Transdução de Sinal/genética , Proteínas de Ciclo Celular/genética , Cromátides/genética , Proteínas do Citoesqueleto/genética , Deleção de Genes , Células HeLa , Humanos , Coativadores de Receptor Nuclear/genética , Precursores de RNA/genética , Transcriptoma/fisiologia

10.

H3S28 phosphorylation is a hallmark of the transcriptional response to cellular stress.

Sawicka, Anna; Hartl, Dominik; Goiser, Malgorzata; Pusch, Oliver; Stocsits, Roman R; Tamir, Ido M; Mechtler, Karl; Seiser, Christian.

Genome Res ; 24(11): 1808-20, 2014 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-25135956

RESUMO

The selectivity of transcriptional responses to extracellular cues is reflected by the deposition of stimulus-specific chromatin marks. Although histone H3 phosphorylation is a target of numerous signaling pathways, its role in transcriptional regulation remains poorly understood. Here, for the first time, we report a genome-wide analysis of H3S28 phosphorylation in a mammalian system in the context of stress signaling. We found that this mark targets as many as 50% of all stress-induced genes, underlining its importance in signal-induced transcription. By combining ChIP-seq, RNA-seq, and mass spectrometry we identified the factors involved in the biological interpretation of this histone modification. We found that MSK1/2-mediated phosphorylation of H3S28 at stress-responsive promoters contributes to the dissociation of HDAC corepressor complexes and thereby to enhanced local histone acetylation and subsequent transcriptional activation of stress-induced genes. Our data reveal a novel function of the H3S28ph mark in the activation of mammalian genes in response to MAP kinase pathway activation.

Assuntos

Histonas/metabolismo , Serina/metabolismo , Estresse Fisiológico/genética , Ativação Transcricional , Células 3T3 , Acetilação , Animais , Imunoprecipitação da Cromatina , Fibroblastos/citologia , Fibroblastos/metabolismo , Perfilação da Expressão Gênica , Ontologia Genética , Estudo de Associação Genômica Ampla , Células HeLa , Sequenciamento de Nucleotídeos em Larga Escala , Histona Desacetilases/genética , Histona Desacetilases/metabolismo , Humanos , Sistema de Sinalização das MAP Quinases/genética , Camundongos , Análise de Sequência com Séries de Oligonucleotídeos , Fosforilação , Regiões Promotoras Genéticas/genética , Proteínas Quinases S6 Ribossômicas 90-kDa/genética , Proteínas Quinases S6 Ribossômicas 90-kDa/metabolismo

11.

Airn transcriptional overlap, but not its lncRNA products, induces imprinted Igf2r silencing.

Latos, Paulina A; Pauler, Florian M; Koerner, Martha V; Senergin, H Basak; Hudson, Quanah J; Stocsits, Roman R; Allhoff, Wolfgang; Stricker, Stefan H; Klement, Ruth M; Warczok, Katarzyna E; Aumayr, Karin; Pasierbek, Pawel; Barlow, Denise P.

Science ; 338(6113): 1469-72, 2012 Dec 14.

Artigo em Inglês | MEDLINE | ID: mdl-23239737

RESUMO

Mammalian imprinted genes often cluster with long noncoding (lnc) RNAs. Three lncRNAs that induce parental-specific silencing show hallmarks indicating that their transcription is more important than their product. To test whether Airn transcription or product silences the Igf2r gene, we shortened the endogenous lncRNA to different lengths. The results excluded a role for spliced and unspliced Airn lncRNA products and for Airn nuclear size and location in silencing Igf2r. Instead, silencing only required Airn transcriptional overlap of the Igf2r promoter, which interferes with RNA polymerase II recruitment in the absence of repressive chromatin. Such a repressor function for lncRNA transcriptional overlap reveals a gene silencing mechanism that may be widespread in the mammalian genome, given the abundance of lncRNA transcripts.

Assuntos

Inativação Gênica , Impressão Genômica , RNA Longo não Codificante/metabolismo , Receptor IGF Tipo 2/genética , Transcrição Gênica , Processamento Alternativo , Animais , Células Cultivadas , Camundongos , Família Multigênica , Regiões Promotoras Genéticas , RNA Polimerase II/metabolismo , RNA Longo não Codificante/genética

12.

The impact of rRNA secondary structure consideration in alignment and tree reconstruction: simulated data and a case study on the phylogeny of hexapods.

Letsch, Harald O; Kück, Patrick; Stocsits, Roman R; Misof, Bernhard.

Mol Biol Evol ; 27(11): 2507-21, 2010 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-20530152

RESUMO

The use of secondary structures has been advocated to improve both the alignment and the tree reconstruction processes of ribosomal RNA (rRNA) data sets. We used simulated and empirical rRNA data to test the impact of secondary structure consideration in both steps of molecular phylogenetic analyses. A simulation approach was used to generate realistic rRNA data sets based on real 16S, 18S, and 28S sequences and structures in combination with different branch length and topologies. Alignment and tree reconstruction performance of four recent structural alignment methods was compared with exclusively sequence-based approaches. As empirical data, we used a hexapod rRNA data set to study the influence of nucleotide interdependencies in sequence alignment and tree reconstruction. Structural alignment methods delivered significantly better sequence alignments compared with pure sequence-based methods. Also, structural alignment methods delivered better trees judged by topological congruence to simulation base trees. However, the advantage of structural alignments was less pronounced and even vanished in several instances. For simulated data, application of mixed RNA/DNA models to stems and loops, respectively, led to significantly shorter branches. The application of mixed RNA/DNA models in the hexapod analyses delivered partly implausible relationships. This can be interpreted as a stronger sensitivity of mixed model setups to nonphylogenetic signal. Secondary structure consideration clearly influenced sequence alignment and tree reconstruction of ribosomal genes. Although sequence alignment quality can considerably be improved by the use of secondary structure information, the application of mixed models in tree reconstructions needs further studies to understand the observed effects.

Assuntos

Artrópodes/genética , Simulação por Computador , Conformação de Ácido Nucleico , Filogenia , RNA Ribossômico/química , Alinhamento de Sequência/métodos , Animais , Teorema de Bayes , RNA Ribossômico/genética

13.

Accurate and efficient reconstruction of deep phylogenies from structured RNAs.

Stocsits, Roman R; Letsch, Harald; Hertel, Jana; Misof, Bernhard; Stadler, Peter F.

Nucleic Acids Res ; 37(18): 6184-93, 2009 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-19723687

RESUMO

Ribosomal RNA (rRNA) genes are probably the most frequently used data source in phylogenetic reconstruction. Individual columns of rRNA alignments are not independent as a consequence of their highly conserved secondary structures. Unless explicitly taken into account, these correlation can distort the phylogenetic signal and/or lead to gross overestimates of tree stability. Maximum likelihood and Bayesian approaches are of course amenable to using RNA-specific substitution models that treat conserved base pairs appropriately, but require accurate secondary structure models as input. So far, however, no accurate and easy-to-use tool has been available for computing structure-aware alignments and consensus structures that can deal with the large rRNAs. The RNAsalsa approach is designed to fill this gap. Capitalizing on the improved accuracy of pairwise consensus structures and informed by a priori knowledge of group-specific structural constraints, the tool provides both alignments and consensus structures that are of sufficient accuracy for routine phylogenetic analysis based on RNA-specific substitution models. The power of the approach is demonstrated using two rRNA data sets: a mitochondrial rRNA set of 26 Mammalia, and a collection of 28S nuclear rRNAs representative of the five major echinoderm groups.

Assuntos

Filogenia , RNA Ribossômico/classificação , Animais , Sequência de Bases , Equinodermos/genética , Conformação de Ácido Nucleico , Primatas/genética , RNA Ribossômico/química , Alinhamento de Sequência , Software

14.

Simultaneous alignment and folding of 28S rRNA sequences uncovers phylogenetic signal in structure variation.

Letsch, Harald O; Greve, Carola; Kück, Patrick; Fleck, Günther; Stocsits, Roman R; Misof, Bernhard.

Mol Phylogenet Evol ; 53(3): 758-71, 2009 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-19654047

RESUMO

Secondary structure models of mitochondrial and nuclear (r)RNA sequences are frequently applied to aid the alignment of these molecules in phylogenetic analyses. Additionally, it is often speculated that structure variation of (r)RNA sequences might profitably be used as phylogenetic markers. The benefit of these approaches depends on the reliability of structure models. We used a recently developed approach to show that reliable inference of large (r)RNA secondary structures as a prerequisite of simultaneous sequence and structure alignment is feasible. The approach iteratively establishes local structure constraints of each sequence and infers fully folded individual structures by constrained MFE optimization. A comparison of structure edit distances of individual constraints and fully folded structures showed pronounced phylogenetic signal in fully folded structures. As model sequences we characterized secondary structures of 28S rRNA sequences of selected insects and examined their phylogenetic signal according to established phylogenetic hypotheses.

Assuntos

Conformação de Ácido Nucleico , Filogenia , RNA Ribossômico 28S/genética , Animais , Genes de RNAr , Insetos/genética , Alinhamento de Sequência , Análise de Sequência de RNA

15.

Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships.

von Reumont, Björn M; Meusemann, Karen; Szucsich, Nikolaus U; Dell'Ampio, Emiliano; Gowri-Shankar, Vivek; Bartel, Daniela; Simon, Sabrina; Letsch, Harald O; Stocsits, Roman R; Luan, Yun-xia; Wägele, Johann Wolfgang; Pass, Günther; Hadrys, Heike; Misof, Bernhard.

BMC Evol Biol ; 9: 119, 2009 May 27.

Artigo em Inglês | MEDLINE | ID: mdl-19473484

RESUMO

BACKGROUND: Whenever different data sets arrive at conflicting phylogenetic hypotheses, only testable causal explanations of sources of errors in at least one of the data sets allow us to critically choose among the conflicting hypotheses of relationships. The large (28S) and small (18S) subunit rRNAs are among the most popular markers for studies of deep phylogenies. However, some nodes supported by this data are suspected of being artifacts caused by peculiarities of the evolution of these molecules. Arthropod phylogeny is an especially controversial subject dotted with conflicting hypotheses which are dependent on data set and method of reconstruction. We assume that phylogenetic analyses based on these genes can be improved further i) by enlarging the taxon sample and ii) employing more realistic models of sequence evolution incorporating non-stationary substitution processes and iii) considering covariation and pairing of sites in rRNA-genes. RESULTS: We analyzed a large set of arthropod sequences, applied new tools for quality control of data prior to tree reconstruction, and increased the biological realism of substitution models. Although the split-decomposition network indicated a high noise content in the data set, our measures were able to both improve the analyses and give causal explanations for some incongruities mentioned from analyses of rRNA sequences. However, misleading effects did not completely disappear. CONCLUSION: Analyses of data sets that result in ambiguous phylogenetic hypotheses demand for methods, which do not only filter stochastic noise, but likewise allow to differentiate phylogenetic signal from systematic biases. Such methods can only rely on our findings regarding the evolution of the analyzed data. Analyses on independent data sets then are crucial to test the plausibility of the results. Our approach can easily be extended to genomic data, as well, whereby layers of quality assessment are set up applicable to phylogenetic reconstructions in general.

Assuntos

Artrópodes/genética , Evolução Molecular , Modelos Genéticos , Filogenia , Animais , Conformação de Ácido Nucleico , RNA Ribossômico 18S/genética , RNA Ribossômico 28S/genética , Alinhamento de Sequência , Análise de Sequência de RNA/métodos

16.

Multiple sequence alignments of partially coding nucleic acid sequences.

Stocsits, Roman R; Hofacker, Ivo L; Fried, Claudia; Stadler, Peter F.

BMC Bioinformatics ; 6: 160, 2005 Jun 28.

Artigo em Inglês | MEDLINE | ID: mdl-15985156

RESUMO

BACKGROUND: High quality sequence alignments of RNA and DNA sequences are an important prerequisite for the comparative analysis of genomic sequence data. Nucleic acid sequences, however, exhibit a much larger sequence heterogeneity compared to their encoded protein sequences due to the redundancy of the genetic code. It is desirable, therefore, to make use of the amino acid sequence when aligning coding nucleic acid sequences. In many cases, however, only a part of the sequence of interest is translated. On the other hand, overlapping reading frames may encode multiple alternative proteins, possibly with intermittent non-coding parts. Examples are, in particular, RNA virus genomes. RESULTS: The standard scoring scheme for nucleic acid alignments can be extended to incorporate simultaneously information on translation products in one or more reading frames. Here we present a multiple alignment tool, codaln, that implements a combined nucleic acid plus amino acid scoring model for pairwise and progressive multiple alignments that allows arbitrary weighting for almost all scoring parameters. Resource requirements of codaln are comparable with those of standard tools such as ClustalW. CONCLUSION: We demonstrate the applicability of codaln to various biologically relevant types of sequences (bacteriophage Levivirus and Vertebrate Hox clusters) and show that the combination of nucleic acid and amino acid sequence information leads to improved alignments. These, in turn, increase the performance of analysis tools that depend strictly on good input alignments such as methods for detecting conserved RNA secondary structure elements.

Assuntos

Algoritmos , Modelos Moleculares , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Sequência de Aminoácidos , Códon , Sequência Conservada , Proteínas de Homeodomínio/química , Levivirus/genética , Fases de Leitura Aberta , Estrutura Secundária de Proteína , RNA/química , Reprodutibilidade dos Testes , Homologia de Sequência do Ácido Nucleico

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA