Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 38
Filtrar
Mais filtros

Tipo de documento
Intervalo de ano de publicação
1.
Proc Natl Acad Sci U S A ; 120(10): e2220828120, 2023 03 07.
Artigo em Inglês | MEDLINE | ID: mdl-36848551

RESUMO

Trypanosomatid pathogens are transmitted by blood-feeding insects, causing devastating human infections. These parasites show important phenotypic shifts that often impact parasite pathogenicity, tissue tropism, or drug susceptibility. The evolutionary mechanisms that allow for the selection of such adaptive phenotypes remain only poorly investigated. Here, we use Leishmania donovani as a trypanosomatid model pathogen to assess parasite evolutionary adaptation during experimental sand fly infection. Comparing the genome of the parasites before and after sand fly infection revealed a strong population bottleneck effect as judged by allele frequency analysis. Apart from random genetic drift caused by the bottleneck effect, our analyses revealed haplotype and allelic changes during sand fly infection that seem under natural selection given their convergence between independent biological replicates. Our analyses further uncovered signature mutations of oxidative DNA damage in the parasite genomes after sand fly infection, suggesting that Leishmania suffers from oxidative stress inside the insect digestive tract. Our results propose a model of Leishmania genomic adaptation during sand fly infection, with oxidative DNA damage and DNA repair processes likely driving haplotype and allelic selection. The experimental and computational framework presented here provides a useful blueprint to assess evolutionary adaptation of other eukaryotic pathogens inside their insect vectors, such as Plasmodium spp, Trypanosoma brucei, and Trypanosoma cruzi.


Assuntos
Leishmania donovani , Psychodidae , Humanos , Animais , Estresse Oxidativo/genética , Reparo do DNA/genética , Mutação
2.
Cell ; 143(1): 46-58, 2010 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-20887892

RESUMO

While the long noncoding RNAs (ncRNAs) constitute a large portion of the mammalian transcriptome, their biological functions has remained elusive. A few long ncRNAs that have been studied in any detail silence gene expression in processes such as X-inactivation and imprinting. We used a GENCODE annotation of the human genome to characterize over a thousand long ncRNAs that are expressed in multiple cell lines. Unexpectedly, we found an enhancer-like function for a set of these long ncRNAs in human cell lines. Depletion of a number of ncRNAs led to decreased expression of their neighboring protein-coding genes, including the master regulator of hematopoiesis, SCL (also called TAL1), Snai1 and Snai2. Using heterologous transcription assays we demonstrated a requirement for the ncRNAs in activation of gene expression. These results reveal an unanticipated role for a class of long ncRNAs in activation of critical regulators of development and differentiation.


Assuntos
Elementos Facilitadores Genéticos , Genoma Humano , RNA não Traduzido/metabolismo , Linhagem Celular , Linhagem Celular Tumoral , Células Cultivadas , Humanos , RNA Mensageiro/genética , Fatores de Transcrição da Família Snail , Fatores de Transcrição/genética , Ativação Transcricional
3.
PLoS Pathog ; 18(3): e1010375, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-35294501

RESUMO

The protozoan parasite Leishmania donovani causes fatal human visceral leishmaniasis in absence of treatment. Genome instability has been recognized as a driver in Leishmania fitness gain in response to environmental change or chemotherapy. How genome instability generates beneficial phenotypes despite potential deleterious gene dosage effects is unknown. Here we address this important open question applying experimental evolution and integrative systems approaches on parasites adapting to in vitro culture. Phenotypic analyses of parasites from early and late stages of culture adaptation revealed an important fitness tradeoff, with selection for accelerated growth in promastigote culture (fitness gain) impairing infectivity (fitness costs). Comparative genomics, transcriptomics and proteomics analyses revealed a complex regulatory network associated with parasite fitness gain, with genome instability causing highly reproducible, gene dosage-independent and -dependent changes. Reduction of flagellar transcripts and increase in coding and non-coding RNAs implicated in ribosomal biogenesis and protein translation were not correlated to dosage changes of the corresponding genes, revealing a gene dosage-independent, post-transcriptional mechanism of regulation. In contrast, abundance of gene products implicated in post-transcriptional regulation itself correlated to corresponding gene dosage changes. Thus, RNA abundance during parasite adaptation is controled by direct and indirect gene dosage changes. We correlated differential expression of small nucleolar RNAs (snoRNAs) with changes in rRNA modification, providing first evidence that Leishmania fitness gain in culture may be controlled by post-transcriptional and epitranscriptomic regulation. Our findings propose a novel model for Leishmania fitness gain in culture, where differential regulation of mRNA stability and the generation of modified ribosomes may potentially filter deleterious from beneficial gene dosage effects and provide proteomic robustness to genetically heterogenous, adapting parasite populations. This model challenges the current, genome-centric approach to Leishmania epidemiology and identifies the Leishmania transcriptome and non-coding small RNome as potential novel sources for the discovery of biomarkers that may be associated with parasite phenotypic adaptation in clinical settings.


Assuntos
Leishmania donovani , Leishmaniose Visceral , Regulação da Expressão Gênica , Instabilidade Genômica , Humanos , Leishmania donovani/genética , Leishmaniose Visceral/parasitologia , Proteômica
4.
Nucleic Acids Res ; 50(6): e36, 2022 04 08.
Artigo em Inglês | MEDLINE | ID: mdl-34928370

RESUMO

Genome instability has been recognized as a key driver for microbial and cancer adaptation and thus plays a central role in many diseases. Genome instability encompasses different types of genomic alterations, yet most available genome analysis software are limited to just one type of mutation. To overcome this limitation and better understand the role of genetic changes in enhancing pathogenicity we established GIP, a novel, powerful bioinformatic pipeline for comparative genome analysis. Here, we show its application to whole genome sequencing datasets of Leishmania, Plasmodium, Candida and cancer. Applying GIP on available data sets validated our pipeline and demonstrated the power of our tool to drive biological discovery. Applied to Plasmodium vivax genomes, our pipeline uncovered the convergent amplification of erythrocyte binding proteins and identified a nullisomic strain. Re-analyzing genomes of drug adapted Candida albicans strains revealed correlated copy number variations of functionally related genes, strongly supporting a mechanism of epistatic adaptation through interacting gene-dosage changes. Our results illustrate how GIP can be used for the identification of aneuploidy, gene copy number variations, changes in nucleic acid sequences, and chromosomal rearrangements. Altogether, GIP can shed light on the genetic bases of cell adaptation and drive disease biomarker discovery.


Assuntos
Biologia Computacional/métodos , Variações do Número de Cópias de DNA , Instabilidade Genômica , Variações do Número de Cópias de DNA/genética , Dosagem de Genes , Humanos , Neoplasias/genética
5.
Proc Natl Acad Sci U S A ; 118(51)2021 12 21.
Artigo em Inglês | MEDLINE | ID: mdl-34903666

RESUMO

How genome instability is harnessed for fitness gain despite its potential deleterious effects is largely elusive. An ideal system to address this important open question is provided by the protozoan pathogen Leishmania, which exploits frequent variations in chromosome and gene copy number to regulate expression levels. Using ecological genomics and experimental evolution approaches, we provide evidence that Leishmania adaptation relies on epistatic interactions between functionally associated gene copy number variations in pathways driving fitness gain in a given environment. We further uncover posttranscriptional regulation as a key mechanism that compensates for deleterious gene dosage effects and provides phenotypic robustness to genetically heterogenous parasite populations. Finally, we correlate dynamic variations in small nucleolar RNA (snoRNA) gene dosage with changes in ribosomal RNA 2'-O-methylation and pseudouridylation, suggesting translational control as an additional layer of parasite adaptation. Leishmania genome instability is thus harnessed for fitness gain by genome-dependent variations in gene expression and genome-independent compensatory mechanisms. This allows for polyclonal adaptation and maintenance of genetic heterogeneity despite strong selective pressure. The epistatic adaptation described here needs to be considered in Leishmania epidemiology and biomarker discovery and may be relevant to other fast-evolving eukaryotic cells that exploit genome instability for adaptation, such as fungal pathogens or cancer.


Assuntos
Adaptação Fisiológica/genética , Epistasia Genética , Genoma de Protozoário , Instabilidade Genômica , Leishmania/genética , Dosagem de Genes , Aptidão Genética , Humanos , Leishmaniose/parasitologia
6.
PLoS Pathog ; 17(12): e1009087, 2021 12.
Artigo em Inglês | MEDLINE | ID: mdl-34855911

RESUMO

Pathogenic Leptospira are the causative agents of leptospirosis, the most widespread zoonotic infectious disease. Leptospirosis is a potentially severe and life-threatening emerging disease with highest burden in sub-tropical areas and impoverished populations. Mechanisms allowing pathogenic Leptospira to survive inside a host and induce acute leptospirosis are not fully understood. The ability to resist deadly oxidants produced by the host during infection is pivotal for Leptospira virulence. We have previously shown that genes encoding defenses against oxidants in L. interrogans are repressed by PerRA (encoded by LIMLP_10155), a peroxide stress regulator of the Fur family. In this study, we describe the identification and characterization of another putative PerR-like regulator (LIMLP_05620) in L. interrogans. Protein sequence and phylogenetic analyses indicated that LIMLP_05620 displayed all the canonical PerR amino acid residues and is restricted to pathogenic Leptospira clades. We therefore named this PerR-like regulator PerRB. In L. interrogans, the PerRB regulon is distinct from that of PerRA. While a perRA mutant had a greater tolerance to peroxide, inactivating perRB led to a higher tolerance to superoxide, suggesting that these two regulators have a distinct function in the adaptation of L. interrogans to oxidative stress. The concomitant inactivation of perRA and perRB resulted in a higher tolerance to both peroxide and superoxide and, unlike the single mutants, a double perRAperRB mutant was avirulent. Interestingly, this correlated with major changes in gene and non-coding RNA expression. Notably, several virulence-associated genes (clpB, ligA/B, and lvrAB) were repressed. By obtaining a double mutant in a pathogenic Leptospira strain, our study has uncovered an interplay of two PerRs in the adaptation of Leptospira to oxidative stress with a putative role in virulence and pathogenicity, most likely through the transcriptional control of a complex regulatory network.


Assuntos
Proteínas de Bactérias/metabolismo , Redes Reguladoras de Genes/genética , Leptospira/genética , Leptospirose/microbiologia , Adaptação Fisiológica , Sequência de Aminoácidos , Proteínas de Bactérias/genética , Regulação Bacteriana da Expressão Gênica , Leptospira/patogenicidade , Leptospira/fisiologia , Modelos Moleculares , Mutação , Estresse Oxidativo , Filogenia , Regulon/genética , Proteínas Repressoras/genética , Proteínas Repressoras/metabolismo , Alinhamento de Sequência , Virulência
7.
PLoS Pathog ; 17(8): e1009875, 2021 08.
Artigo em Inglês | MEDLINE | ID: mdl-34432858

RESUMO

Development of cervical cancer is directly associated with integration of human papillomavirus (HPV) genomes into host chromosomes and subsequent modulation of HPV oncogene expression, which correlates with multi-layered epigenetic changes at the integrated HPV genomes. However, the process of integration itself and dysregulation of host gene expression at sites of integration in our model of HPV16 integrant clone natural selection has remained enigmatic. We now show, using a state-of-the-art 'HPV integrated site capture' (HISC) technique, that integration likely occurs through microhomology-mediated repair (MHMR) mechanisms via either a direct process, resulting in host sequence deletion (in our case, partially homozygously) or via a 'looping' mechanism by which flanking host regions become amplified. Furthermore, using our 'HPV16-specific Region Capture Hi-C' technique, we have determined that chromatin interactions between the integrated virus genome and host chromosomes, both at short- (<500 kbp) and long-range (>500 kbp), appear to drive local host gene dysregulation through the disruption of host:host interactions within (but not exceeding) host structures known as topologically associating domains (TADs). This mechanism of HPV-induced host gene expression modulation indicates that integration of virus genomes near to or within a 'cancer-causing gene' is not essential to influence their expression and that these modifications to genome interactions could have a major role in selection of HPV integrants at the early stage of cervical neoplastic progression.


Assuntos
Carcinogênese/patologia , Cromatina/metabolismo , Genoma Viral , Papillomavirus Humano 16/isolamento & purificação , Infecções por Papillomavirus/complicações , Neoplasias do Colo do Útero/patologia , Integração Viral , Carcinogênese/metabolismo , Cromatina/genética , Epigênese Genética , Feminino , Humanos , Células Tumorais Cultivadas , Neoplasias do Colo do Útero/genética , Neoplasias do Colo do Útero/metabolismo , Neoplasias do Colo do Útero/virologia
8.
PLoS Pathog ; 16(10): e1008904, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-33021995

RESUMO

Pathogenic Leptospira spp. are the causative agents of the waterborne zoonotic disease leptospirosis. Leptospira are challenged by numerous adverse conditions, including deadly reactive oxygen species (ROS), when infecting their hosts. Withstanding ROS produced by the host innate immunity is an important strategy evolved by pathogenic Leptospira for persisting in and colonizing hosts. In L. interrogans, genes encoding defenses against ROS are repressed by the peroxide stress regulator, PerR. In this study, RNA sequencing was performed to characterize both the L. interrogans response to low and high concentrations of hydrogen peroxide and the PerR regulon. We showed that Leptospira solicit three main peroxidase machineries (catalase, cytochrome C peroxidase and peroxiredoxin) and heme to detoxify oxidants produced during peroxide stress. In addition, canonical molecular chaperones of the heat shock response and DNA repair proteins from the SOS response were required for Leptospira recovering from oxidative damage. Identification of the PerR regulon upon exposure to H2O2 allowed to define the contribution of this regulator in the oxidative stress response. This study has revealed a PerR-independent regulatory network involving other transcriptional regulators, two-component systems and sigma factors as well as non-coding RNAs that putatively orchestrate, in concert with PerR, the oxidative stress response. We have shown that PerR-regulated genes encoding a TonB-dependent transporter and a two-component system (VicKR) are involved in Leptospira tolerance to superoxide. This could represent the first defense mechanism against superoxide in L. interrogans, a bacterium lacking canonical superoxide dismutase. Our findings provide an insight into the mechanisms required by pathogenic Leptospira to overcome oxidative damage during infection-related conditions. This will participate in framing future hypothesis-driven studies to identify and decipher novel virulence mechanisms in this life-threatening pathogen.


Assuntos
Peróxido de Hidrogênio/farmacologia , Leptospira/patogenicidade , Estresse Oxidativo/efeitos dos fármacos , Peróxidos/metabolismo , Proteínas de Bactérias/efeitos dos fármacos , Proteínas de Bactérias/metabolismo , Regulação Bacteriana da Expressão Gênica/efeitos dos fármacos , Regulação Bacteriana da Expressão Gênica/genética , Ferro/metabolismo , Leptospira/efeitos dos fármacos , Leptospira interrogans/efeitos dos fármacos , Leptospira interrogans/genética , Leptospirose/genética , Chaperonas Moleculares/metabolismo , Estresse Oxidativo/fisiologia , Virulência/efeitos dos fármacos , Virulência/fisiologia
9.
Nature ; 515(7527): 355-64, 2014 Nov 20.
Artigo em Inglês | MEDLINE | ID: mdl-25409824

RESUMO

The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.


Assuntos
Genoma/genética , Genômica , Camundongos/genética , Anotação de Sequência Molecular , Animais , Linhagem da Célula/genética , Cromatina/genética , Cromatina/metabolismo , Sequência Conservada/genética , Replicação do DNA/genética , Desoxirribonuclease I/metabolismo , Regulação da Expressão Gênica/genética , Redes Reguladoras de Genes/genética , Estudo de Associação Genômica Ampla , Humanos , RNA/genética , Sequências Reguladoras de Ácido Nucleico/genética , Especificidade da Espécie , Fatores de Transcrição/metabolismo , Transcriptoma/genética
10.
Genome Res ; 26(5): 705-16, 2016 05.
Artigo em Inglês | MEDLINE | ID: mdl-27197243

RESUMO

Targeted RNA sequencing (CaptureSeq) uses oligonucleotide probes to capture RNAs for sequencing, providing enriched read coverage, accurate measurement of gene expression, and quantitative expression data. We applied CaptureSeq to refine transcript annotations in the current murine GRCm38 assembly. More than 23,000 regions corresponding to putative or annotated long noncoding RNAs (lncRNAs) and 154,281 known splicing junction sites were selected for targeted sequencing across five mouse tissues and three brain subregions. The results illustrate that the mouse transcriptome is considerably more complex than previously thought. We assemble more complete transcript isoforms than GENCODE, expand transcript boundaries, and connect interspersed islands of mapped reads. We describe a novel filtering pipeline that identifies previously unannotated but high-quality transcript isoforms. In this set, 911 GENCODE neighboring genes are condensed into 400 expanded gene models. Additionally, 594 GENCODE lncRNAs acquire an open reading frame (ORF) when their structure is extended with CaptureSeq. Finally, we validate our observations using current FANTOM and Mouse ENCODE resources.


Assuntos
Perfilação da Expressão Gênica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , RNA Longo não Codificante/biossíntese , RNA Longo não Codificante/genética , Transcriptoma , Animais , Camundongos
11.
EMBO Rep ; 18(7): 1231-1247, 2017 07.
Artigo em Inglês | MEDLINE | ID: mdl-28500258

RESUMO

Spermatogenesis is associated with major and unique changes to chromosomes and chromatin. Here, we sought to understand the impact of these changes on spermatogenic transcriptomes. We show that long terminal repeats (LTRs) of specific mouse endogenous retroviruses (ERVs) drive the expression of many long non-coding transcripts (lncRNA). This process occurs post-mitotically predominantly in spermatocytes and round spermatids. We demonstrate that this transposon-driven lncRNA expression is a conserved feature of vertebrate spermatogenesis. We propose that transposon promoters are a mechanism by which the genome can explore novel transcriptional substrates, increasing evolutionary plasticity and allowing for the genesis of novel coding and non-coding genes. Accordingly, we show that a small fraction of these novel ERV-driven transcripts encode short open reading frames that produce detectable peptides. Finally, we find that distinct ERV elements from the same subfamilies act as differentially activated promoters in a tissue-specific context. In summary, we demonstrate that LTRs can act as tissue-specific promoters and contribute to post-mitotic spermatogenic transcriptome diversity.


Assuntos
Elementos de DNA Transponíveis , Evolução Molecular , Espermatogênese , Transcrição Gênica , Animais , Retrovirus Endógenos/genética , Genômica , Masculino , Camundongos , Fases de Leitura Aberta , Regiões Promotoras Genéticas , RNA Longo não Codificante/genética , Espermatócitos/fisiologia , Sequências Repetidas Terminais , Transcriptoma
12.
Nat Methods ; 12(4): 339-42, 2015 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-25751143

RESUMO

We compared quantitative RT-PCR (qRT-PCR), RNA-seq and capture sequencing (CaptureSeq) in terms of their ability to assemble and quantify long noncoding RNAs and novel coding exons across 20 human tissues. CaptureSeq was superior for the detection and quantification of genes with low expression, showed little technical variation and accurately measured differential expression. This approach expands and refines previous annotations and simultaneously generates an expression atlas.


Assuntos
Perfilação da Expressão Gênica , RNA Longo não Codificante/genética , RNA/genética , Análise de Sequência/métodos , Humanos , Células K562 , Reação em Cadeia da Polimerase , RNA/sangue , RNA/química
13.
Brief Bioinform ; 17(6): 1009-1023, 2016 11.
Artigo em Inglês | MEDLINE | ID: mdl-26615024

RESUMO

This review provides an overview on the development of Multiple sequence alignment (MSA) methods and their main applications. It is focused on progress made over the past decade. The three first sections review recent algorithmic developments for protein, RNA/DNA and genomic alignments. The fourth section deals with benchmarks and explores the relationship between empirical and simulated data, along with the impact on method developments. The last part of the review gives an overview on available MSA local reliability estimators and their dependence on various algorithmic properties of available methods.


Assuntos
Alinhamento de Sequência , Algoritmos , DNA , Genômica , Proteínas , Reprodutibilidade dos Testes
14.
Genes Dev ; 24(2): 129-34, 2010 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-20080951

RESUMO

Cytoplasmic polyadenylation is a widespread mechanism to regulate mRNA translation that requires two sequences in the 3' untranslated region (UTR) of vertebrate substrates: the polyadenylation hexanucleotide, and the cytoplasmic polyadenylation element (CPE). Using a cell-free Drosophila system, we show that these signals are not relevant for Toll polyadenylation but, instead, a "polyadenylation region" (PR) is necessary. Competition experiments indicate that PR-mediated polyadenylation is required for viability and is mechanistically distinct from the CPE/hexanucleotide-mediated process. These data indicate that Toll mRNA is polyadenylated by a noncanonical mechanism, and suggest that a novel machinery functions for cytoplasmic polyadenylation during Drosophila embryogenesis.


Assuntos
Citoplasma/metabolismo , Drosophila melanogaster/embriologia , Poliadenilação/fisiologia , Regiões 3' não Traduzidas , Animais , Proteínas de Drosophila/metabolismo , Transdução de Sinais , Receptores Toll-Like/metabolismo
15.
Nucleic Acids Res ; 42(Web Server issue): W356-60, 2014 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-24972831

RESUMO

This article introduces the SARA-Coffee web server; a service allowing the online computation of 3D structure based multiple RNA sequence alignments. The server makes it possible to combine sequences with and without known 3D structures. Given a set of sequences SARA-Coffee outputs a multiple sequence alignment along with a reliability index for every sequence, column and aligned residue. SARA-Coffee combines SARA, a pairwise structural RNA aligner with the R-Coffee multiple RNA aligner in a way that has been shown to improve alignment accuracy over most sequence aligners when enough structural data is available. The server can be accessed from http://tcoffee.crg.cat/apps/tcoffee/do:saracoffee.


Assuntos
RNA/química , Alinhamento de Sequência/métodos , Análise de Sequência de RNA/métodos , Software , Algoritmos , Internet , Conformação de Ácido Nucleico
16.
J Mol Cell Cardiol ; 89(Pt A): 98-112, 2015 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-26423156

RESUMO

Long noncoding RNAs (lncRNAs) are emerging as important regulators of developmental pathways. However, their roles in human cardiac precursor cell (CPC) remain unexplored. To characterize the long noncoding transcriptome during human CPC cardiac differentiation, we profiled the lncRNA transcriptome in CPCs isolated from the human fetal heart and identified 570 lncRNAs that were modulated during cardiac differentiation. Many of these were associated with active cardiac enhancer and super enhancers (SE) with their expression being correlated with proximal cardiac genes. One of the most upregulated lncRNAs was a SE-associated lncRNA that was named CARMEN, (CAR)diac (M)esoderm (E)nhancer-associated (N)oncoding RNA. CARMEN exhibits RNA-dependent enhancing activity and is upstream of the cardiac mesoderm-specifying gene regulatory network. Interestingly, CARMEN interacts with SUZ12 and EZH2, two components of the polycomb repressive complex 2 (PRC2). We demonstrate that CARMEN knockdown inhibits cardiac specification and differentiation in cardiac precursor cells independently of MIR-143 and -145 expression, two microRNAs located proximal to the enhancer sequences. Importantly, CARMEN expression was activated during pathological remodeling in the mouse and human hearts, and was necessary for maintaining cardiac identity in differentiated cardiomyocytes. This study demonstrates therefore that CARMEN is a crucial regulator of cardiac cell differentiation and homeostasis.


Assuntos
Padronização Corporal/genética , Diferenciação Celular/genética , Coração/embriologia , Homeostase/genética , RNA Longo não Codificante/metabolismo , Animais , Linhagem da Célula/genética , Elementos Facilitadores Genéticos/genética , Proteína Potenciadora do Homólogo 2 de Zeste , Perfilação da Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento , Técnicas de Silenciamento de Genes , Humanos , Camundongos , Miocárdio/patologia , Complexo Repressor Polycomb 2/metabolismo , RNA Longo não Codificante/genética , Células-Tronco/citologia , Transcriptoma/genética
17.
Genome Res ; 22(9): 1775-89, 2012 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-22955988

RESUMO

The human genome contains many thousands of long noncoding RNAs (lncRNAs). While several studies have demonstrated compelling biological and disease roles for individual examples, analytical and experimental approaches to investigate these genes have been hampered by the lack of comprehensive lncRNA annotation. Here, we present and analyze the most complete human lncRNA annotation to date, produced by the GENCODE consortium within the framework of the ENCODE project and comprising 9277 manually annotated genes producing 14,880 transcripts. Our analyses indicate that lncRNAs are generated through pathways similar to that of protein-coding genes, with similar histone-modification profiles, splicing signals, and exon/intron lengths. In contrast to protein-coding genes, however, lncRNAs display a striking bias toward two-exon transcripts, they are predominantly localized in the chromatin and nucleus, and a fraction appear to be preferentially processed into small RNAs. They are under stronger selective pressure than neutrally evolving sequences-particularly in their promoter regions, which display levels of selection comparable to protein-coding genes. Importantly, about one-third seem to have arisen within the primate lineage. Comprehensive analysis of their expression in multiple human organs and brain regions shows that lncRNAs are generally lower expressed than protein-coding genes, and display more tissue-specific expression patterns, with a large fraction of tissue-specific lncRNAs expressed in the brain. Expression correlation analysis indicates that lncRNAs show particularly striking positive correlation with the expression of antisense coding genes. This GENCODE annotation represents a valuable resource for future studies of lncRNAs.


Assuntos
Bases de Dados Genéticas , RNA Longo não Codificante/genética , Processamento Alternativo , Animais , Núcleo Celular/genética , Núcleo Celular/metabolismo , Análise por Conglomerados , Evolução Molecular , Éxons , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Histonas/metabolismo , Humanos , Anotação de Sequência Molecular , Fases de Leitura Aberta , Especificidade de Órgãos/genética , Primatas/genética , Processamento Pós-Transcricional do RNA , Sítios de Splice de RNA , RNA Mensageiro/genética , Seleção Genética , Transcrição Gênica
18.
Bioinformatics ; 29(9): 1112-9, 2013 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-23449094

RESUMO

MOTIVATION: Aligning RNAs is useful to search for homologous genes, study evolutionary relationships, detect conserved regions and identify any patterns that may be of biological relevance. Poor levels of conservation among homologs, however, make it difficult to compare RNA sequences, even when considering closely evolutionary related sequences. RESULTS: We describe SARA-Coffee, a tertiary structure-based multiple RNA aligner, which has been validated using BRAliDARTS, a new benchmark framework designed for evaluating tertiary structure-based multiple RNA aligners. We provide two methods to measure the capacity of alignments to match corresponding secondary and tertiary structure features. On this benchmark, SARA-Coffee outperforms both regular aligners and those using secondary structure information. Furthermore, we show that on sequences in which <60% of the nucleotides form base pairs, primary sequence methods usually perform better than secondary-structure aware aligners. AVAILABILITY AND IMPLEMENTATION: The package and the datasets are available from http://www.tcoffee.org/Projects/saracoffee and http://structure.biofold.org/sara/.


Assuntos
RNA/química , Alinhamento de Sequência/métodos , Análise de Sequência de RNA/métodos , Software , Algoritmos , Conformação de Ácido Nucleico
19.
Nucleic Acids Res ; 40(7): e52, 2012 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-22230796

RESUMO

We address the challenge of regulatory sequence alignment with a new method, Pro-Coffee, a multiple aligner specifically designed for homologous promoter regions. Pro-Coffee uses a dinucleotide substitution matrix estimated on alignments of functional binding sites from TRANSFAC. We designed a validation framework using several thousand families of orthologous promoters. This dataset was used to evaluate the accuracy for predicting true human orthologs among their paralogs. We found that whereas other methods achieve on average 73.5% accuracy, and 77.6% when trained on that same dataset, the figure goes up to 80.4% for Pro-Coffee. We then applied a novel validation procedure based on multi-species ChIP-seq data. Trained and untrained methods were tested for their capacity to correctly align experimentally detected binding sites. Whereas the average number of correctly aligned sites for two transcription factors is 284 for default methods and 316 for trained methods, Pro-Coffee achieves 331, 16.5% above the default average. We find a high correlation between a method's performance when classifying orthologs and its ability to correctly align proven binding sites. Not only has this interesting biological consequences, it also allows us to conclude that any method that is trained on the ortholog data set will result in functionally more informative alignments.


Assuntos
Imunoprecipitação da Cromatina , Regiões Promotoras Genéticas , Alinhamento de Sequência/métodos , Análise de Sequência de DNA , Animais , Sítios de Ligação , Bovinos , Cães , Evolução Molecular , Humanos , Camundongos , Software , Fatores de Transcrição/metabolismo
20.
Microb Genom ; 10(4)2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38578294

RESUMO

In Morocco, cutaneous leishmaniasis (CL) caused by Leishmania (L.) tropica is an important health problem. Despite the high incidence of CL in the country, the genomic heterogeneity of these parasites is still incompletely understood. In this study, we sequenced the genomes of 14 Moroccan isolates of L. tropica collected from confirmed cases of CL to investigate their genomic heterogeneity. Comparative genomics analyses were conducted by applying the recently established Genome Instability Pipeline (GIP), which allowed us to conduct phylogenomic and principal components analyses (PCA), and to assess genomic variations at the levels of the karyotype, gene copy number, single nucleotide polymorphisms (SNPs) and small insertions/deletions (INDELs) variants. Read-depth analyses revealed a mostly disomic karyotype, with the exception of the stable tetrasomy of chromosome 31. In contrast, we identified important gene copy number variations across all isolates, which affect known virulence genes and thus were probably selected in the field. SNP-based cluster analysis of the 14 isolates revealed a core group of 12 strains that formed a tight cluster and shared 45.1 % (87 751) of SNPs, as well as two strains (M3015, Ltr_16) that clustered separately from each other and the core group, suggesting the circulation of genetically highly diverse strains in Morocco. Phylogenetic analysis, which compared our 14 L. tropica isolates against 40 published genomes of L. tropica from a diverse array of locations, confirmed the genetic difference of our Moroccan isolates from all other isolates examined. In conclusion, our results indicate potential regional variations in SNP profiles that may differentiate Moroccan L. tropica from other L. tropica strains circulating in endemic countries in the Middle East. Our report paves the way for future research with a larger number of strains that will allow correlation of diverse phenotypes (resistance to treatments, virulence) and origins (geography, host species, year of isolation) to defined genomic signals such as gene copy number variations or SNP profiles that may represent interesting biomarker candidates.


Assuntos
Leishmania tropica , Leishmaniose Cutânea , Humanos , Leishmania tropica/genética , Filogenia , Variações do Número de Cópias de DNA , Marrocos/epidemiologia , Leishmaniose Cutânea/epidemiologia , Leishmaniose Cutânea/parasitologia , Genômica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA