RESUMO
Transcription factors (TFs) and their specific interactions with targets are crucial for specifying gene-expression programs. To gain insights into the transcriptional regulatory networks in embryonic stem (ES) cells, we use chromatin immunoprecipitation coupled with ultra-high-throughput DNA sequencing (ChIP-seq) to map the locations of 13 sequence-specific TFs (Nanog, Oct4, STAT3, Smad1, Sox2, Zfx, c-Myc, n-Myc, Klf4, Esrrb, Tcfcp2l1, E2f1, and CTCF) and 2 transcription regulators (p300 and Suz12). These factors are known to play different roles in ES-cell biology as components of the LIF and BMP signaling pathways, self-renewal regulators, and key reprogramming factors. Our study provides insights into the integration of the signaling pathways into the ES-cell-specific transcription circuitries. Intriguingly, we find specific genomic regions extensively targeted by different TFs. Collectively, the comprehensive mapping of TF-binding sites identifies important features of the transcriptional regulatory networks that define ES-cell identity.
Assuntos
Células-Tronco Embrionárias/metabolismo , Redes Reguladoras de Genes , Transdução de Sinais , Animais , Sequência de Bases , Sítios de Ligação , Imunoprecipitação da Cromatina , Genoma , Fator 4 Semelhante a Kruppel , Camundongos , Complexos Multiproteicos , Fatores de Transcrição/metabolismoRESUMO
Despite advancements in genetic studies, it is difficult to understand and characterize the functional relevance of disease-associated genetic variants, especially in the context of a complex multifactorial disease such as multiple sclerosis (MS). As a large proportion of expression quantitative trait loci (eQTLs) are context-specific, we performed RNA-Seq in peripheral blood mononuclear cells from MS patients (n = 145) to identify eQTLs in regions centered on 109 MS risk single nucleotide polymorphisms and 7 associated human leukocyte antigen variants. We identified 77 statistically significant eQTL associations, including pseudogenes and non-coding RNAs. Thirty-eight out of 40 testable eQTL effects were colocalized with the disease association signal. As many eQTLs are tissue specific, we aimed to detail their significance in different cell types. Approximately 70% of the eQTLs were replicated and characterized in at least one major peripheral blood mononuclear cell-derived cell type. Furthermore, 40% of eQTLs were found to be more pronounced in MS patients compared with non-inflammatory neurological diseases patients. In addition, we found two single nucleotide polymorphisms to be significantly associated with the proportions of three different cell types. Mapping to enhancer histone marks and predicted transcription factor binding sites added additional functional evidence for eight eQTL regions. As an example, we found that rs71624119, shared with three other autoimmune diseases and located in a primed enhancer (H3K4me1) with potential binding for STAT transcription factors, significantly associates with ANKRD55 expression. This study provides many novel and validated targets for future functional characterization of MS and other diseases.
Assuntos
Esclerose Múltipla/genética , Locos de Características Quantitativas , Estudos de Coortes , Regulação da Expressão Gênica , Predisposição Genética para Doença , Antígenos HLA/genética , Humanos , Interferon gama/farmacologia , Leucócitos Mononucleares/fisiologia , Desequilíbrio de Ligação , Lipopolissacarídeos/farmacologia , Monócitos/efeitos dos fármacos , Monócitos/metabolismo , Polimorfismo de Nucleotídeo Único , Reprodutibilidade dos TestesRESUMO
Dravet syndrome (DS) is an early onset refractory epilepsy typically caused by de novo heterozygous variants in SCN1A encoding the α-subunit of the neuronal sodium channel Nav1.1. The syndrome is characterized by age-related progression of seizures, cognitive decline and movement disorders. We hypothesized that the distinct neurodevelopmental features in DS are caused by the disruption of molecular pathways in Nav1.1 haploinsufficient cells resulting in perturbed neural differentiation and maturation. Here, we established DS-patient and control induced pluripotent stem cell derived neural progenitor cells (iPSC NPC) and GABAergic inter-neuronal (iPSC GABA) cells. The DS-patient iPSC GABA cells showed a shift in sodium current activation and a perturbed response to induced oxidative stress. Transcriptome analysis revealed specific dysregulations of genes for chromatin structure, mitotic progression, neural plasticity and excitability in DS-patient iPSC NPCs and DS-patient iPSC GABA cells versus controls. The transcription factors FOXM1 and E2F1, positive regulators of the disrupted pathways for histone modification and cell cycle regulation, were markedly up-regulated in DS-iPSC GABA lines. Our study highlights transcriptional changes and disrupted pathways of chromatin remodeling in Nav1.1 haploinsufficient GABAergic cells, providing a molecular framework that overlaps with that of neurodevelopmental disorders and other epilepsies.
Assuntos
Montagem e Desmontagem da Cromatina/genética , Epilepsias Mioclônicas/genética , Neurônios GABAérgicos/metabolismo , Neurogênese/genética , Células Cultivadas , Epilepsias Mioclônicas/metabolismo , Haploinsuficiência , Humanos , Células-Tronco Pluripotentes Induzidas/metabolismo , Canal de Sódio Disparado por Voltagem NAV1.1/genética , Células-Tronco Neurais/metabolismo , Plasticidade Neuronal/fisiologia , Estresse Oxidativo/fisiologia , TranscriptomaRESUMO
Spatially distinct gene expression profiles in neural stem cells (NSCs) are a prerequisite to the formation of neuronal diversity, but how these arise from the regulatory interactions between chromatin accessibility and transcription factor activity has remained unclear. Here, we demonstrate that, despite their distinct gene expression profiles, NSCs of the mouse cortex and spinal cord share the majority of their DNase I hypersensitive sites (DHSs). Regardless of this similarity, domain-specific gene expression is highly correlated with the relative accessibility of associated DHSs, as determined by sequence read density. Notably, the binding pattern of the general NSC transcription factor SOX2 is also largely cell type specific and coincides with an enrichment of LHX2 motifs in the cortex and HOXA9 motifs in the spinal cord. Interestingly, in a zebrafish reporter gene system, these motifs were critical determinants of patterned gene expression along the rostral-caudal axis. Our findings establish a predictive model for patterned NSC gene expression, whereby domain-specific expression of LHX2 and HOX proteins act on their target motifs within commonly accessible cis-regulatory regions to specify SOX2 binding. In turn, this binding correlates strongly with these DHSs relative accessibility-a robust predictor of neighboring gene expression.
Assuntos
Cromatina/metabolismo , Regulação da Expressão Gênica no Desenvolvimento , Células-Tronco Neurais/fisiologia , Animais , Células Cultivadas , Córtex Cerebral/citologia , Expressão Gênica , Proteínas de Homeodomínio/metabolismo , Proteínas com Homeodomínio LIM/metabolismo , Camundongos , Ligação Proteica , Fatores de Transcrição SOXB1/metabolismo , Medula Espinal/citologia , Fatores de Transcrição/metabolismo , Peixe-Zebra , Proteínas de Peixe-Zebra/metabolismoRESUMO
Regularly performed endurance training has many beneficial effects on health and skeletal muscle function, and can be used to prevent and treat common diseases e.g. cardiovascular disease, type II diabetes and obesity. The molecular adaptation mechanisms regulating these effects are incompletely understood. To date, global transcriptome changes in skeletal muscles have been studied at the gene level only. Therefore, global isoform expression changes following exercise training in humans are unknown. Also, the effects of repeated interventions on transcriptional memory or training response have not been studied before. In this study, 23 individuals trained one leg for three months. Nine months later, 12 of the same subjects trained both legs in a second training period. Skeletal muscle biopsies were obtained from both legs before and after both training periods. RNA sequencing analysis of all 119 skeletal muscle biopsies showed that training altered the expression of 3,404 gene isoforms, mainly associated with oxidative ATP production. Fifty-four genes had isoforms that changed in opposite directions. Training altered expression of 34 novel transcripts, all with protein-coding potential. After nine months of detraining, no training-induced transcriptome differences were detected between the previously trained and untrained legs. Although there were several differences in the physiological and transcriptional responses to repeated training, no coherent evidence of an endurance training induced transcriptional skeletal muscle memory was found. This human lifestyle intervention induced differential expression of thousands of isoforms and several transcripts from unannotated regions of the genome. It is likely that the observed isoform expression changes reflect adaptational mechanisms and processes that provide the functional and health benefits of regular physical activity.
RESUMO
We present a liquid chromatography-mass spectrometry (LC-MS)-based method permitting unbiased (gene prediction-independent) genome-wide discovery of protein-coding loci in higher eukaryotes. Using high-resolution isoelectric focusing (HiRIEF) at the peptide level in the 3.7-5.0 pH range and accurate peptide isoelectric point (pI) prediction, we probed the six-reading-frame translation of the human and mouse genomes and identified 98 and 52 previously undiscovered protein-coding loci, respectively. The method also enabled deep proteome coverage, identifying 13,078 human and 10,637 mouse proteins.
Assuntos
Cromatografia Líquida/métodos , Genômica/métodos , Espectrometria de Massas/métodos , Proteoma/análise , Proteômica/métodos , Animais , Arabidopsis/genética , Biologia Computacional/métodos , Éxons , Humanos , Concentração de Íons de Hidrogênio , Focalização Isoelétrica/métodos , Camundongos , Modelos Estatísticos , Fases de Leitura Aberta , Peptídeos/química , Biossíntese de Proteínas , Proteínas/químicaRESUMO
Sequencing-based gene expression methods like RNA-sequencing (RNA-seq) have become increasingly common, but it is often claimed that results obtained in different studies are not comparable owing to the influence of laboratory batch effects, differences in RNA extraction and sequencing library preparation methods and bioinformatics processing pipelines. It would be unfortunate if different experiments were in fact incomparable, as there is great promise in data fusion and meta-analysis applied to sequencing data sets. We therefore compared reported gene expression measurements for ostensibly similar samples (specifically, human brain, heart and kidney samples) in several different RNA-seq studies to assess their overall consistency and to examine the factors contributing most to systematic differences. The same comparisons were also performed after preprocessing all data in a consistent way, eliminating potential bias from bioinformatics pipelines. We conclude that published human tissue RNA-seq expression measurements appear relatively consistent in the sense that samples cluster by tissue rather than laboratory of origin given simple preprocessing transformations. The article is supplemented by a detailed walkthrough with embedded R code and figures.
Assuntos
Bases de Dados Genéticas , RNA/genética , Análise de Sequência de RNA , Encéfalo/metabolismo , Perfilação da Expressão Gênica , Humanos , Rim/metabolismo , Miocárdio/metabolismoRESUMO
We recently found that Yersinia pseudotuberculosis can be used as a model of persistent bacterial infections. We performed in vivo RNA-seq of bacteria in small cecal tissue biopsies at early and persistent stages of infection to determine strategies associated with persistence. Comprehensive analysis of mixed RNA populations from infected tissues revealed that Y. pseudotuberculosis undergoes transcriptional reprogramming with drastic down-regulation of T3SS virulence genes during persistence when the pathogen resides within the cecum. At the persistent stage, the expression pattern in many respects resembles the pattern seen in vitro at 26oC, with for example, up-regulation of flagellar genes and invA. These findings are expected to have impact on future rationales to identify suitable bacterial targets for new antibiotics. Other genes that are up-regulated during persistence are genes involved in anaerobiosis, chemotaxis, and protection against oxidative and acidic stress, which indicates the influence of different environmental cues. We found that the Crp/CsrA/RovA regulatory cascades influence the pattern of bacterial gene expression during persistence. Furthermore, arcA, fnr, frdA, and wrbA play critical roles in persistence. Our findings suggest a model for the life cycle of this enteropathogen with reprogramming from a virulent to an adapted phenotype capable of persisting and spreading by fecal shedding.
Assuntos
Análise de Sequência de RNA/métodos , Virulência/genética , Infecções por Yersinia pseudotuberculosis/genética , Yersinia pseudotuberculosis/genética , Yersinia pseudotuberculosis/patogenicidade , Animais , Ceco/imunologia , Ceco/microbiologia , Ceco/patologia , Feminino , Regulação Bacteriana da Expressão Gênica , Genes Bacterianos , Camundongos , Análise em Microsséries , Microbiota/imunologia , RNA Bacteriano/genética , Yersinia pseudotuberculosis/imunologia , Infecções por Yersinia pseudotuberculosis/imunologia , Infecções por Yersinia pseudotuberculosis/patologiaRESUMO
The histone H3 Lys 9 (H3K9) methyltransferase Eset is an epigenetic regulator critical for the development of the inner cell mass (ICM). Although ICM-derived embryonic stem (ES) cells are normally unable to contribute to the trophectoderm (TE) in blastocysts, we find that depletion of Eset by shRNAs leads to differentiation with the formation of trophoblast-like cells and induction of trophoblast-associated gene expression. Using chromatin immmunoprecipitation (ChIP) and sequencing (ChIP-seq) analyses, we identified Eset target genes with Eset-dependent H3K9 trimethylation. We confirmed that genes that are preferentially expressed in the TE (Tcfap2a and Cdx2) are bound and repressed by Eset. Single-cell PCR analysis shows that the expression of Cdx2 and Tcfap2a is also induced in Eset-depleted morula cells. Importantly, Eset-depleted cells can incorporate into the TE of a blastocyst and, subsequently, placental tissues. Coimmunoprecipitation and ChIP assays further demonstrate that Eset interacts with Oct4, which in turn recruits Eset to silence these trophoblast-associated genes. Our results suggest that Eset restricts the extraembryonic trophoblast lineage potential of pluripotent cells and links an epigenetic regulator to key cell fate decision through a pluripotency factor.
Assuntos
Diferenciação Celular , Linhagem da Célula , Células-Tronco Embrionárias/citologia , Metiltransferases/metabolismo , Fator 3 de Transcrição de Octâmero/metabolismo , Proteínas Metiltransferases/metabolismo , Trofoblastos/citologia , Trofoblastos/metabolismo , Animais , Imunoprecipitação da Cromatina , Regulação da Expressão Gênica no Desenvolvimento , Genoma/fisiologia , Histona-Lisina N-Metiltransferase , Proteínas de Homeodomínio/metabolismo , Camundongos , Mórula/citologia , Fator de Transcrição AP-2/metabolismoRESUMO
BACKGROUND: Although most insect species are specialized on one or few groups of plants, there are phytophagous insects that seem to use virtually any kind of plant as food. Understanding the nature of this ability to feed on a wide repertoire of plants is crucial for the control of pest species and for the elucidation of the macroevolutionary mechanisms of speciation and diversification of insect herbivores. Here we studied Vanessa cardui, the species with the widest diet breadth among butterflies and a potential insect pest, by comparing tissue-specific transcriptomes from caterpillars that were reared on different host plants. We tested whether the similarities of gene-expression response reflect the evolutionary history of adaptation to these plants in the Vanessa and related genera, against the null hypothesis of transcriptional profiles reflecting plant phylogenetic relatedness. RESULT: Using both unsupervised and supervised methods of data analysis, we found that the tissue-specific patterns of caterpillar gene expression are better explained by the evolutionary history of adaptation of the insects to the plants than by plant phylogeny. CONCLUSION: Our findings suggest that V. cardui may use two sets of expressed genes to achieve polyphagy, one associated with the ancestral capability to consume Rosids and Asterids, and another allowing the caterpillar to incorporate a wide range of novel host-plants.
Assuntos
Evolução Biológica , Borboletas/genética , Animais , Borboletas/crescimento & desenvolvimento , Borboletas/fisiologia , Herbivoria , Larva/fisiologia , Magnoliopsida/genética , Magnoliopsida/fisiologia , Oviposição , Filogenia , TranscriptomaRESUMO
BACKGROUND: Infectious disease involving multiple genetically distinct populations of pathogens is frequently concurrent, but difficult to detect or describe with current routine methodology. Cryptosporidium sp. is a widespread gastrointestinal protozoan of global significance in both animals and humans. It cannot be easily maintained in culture and infections of multiple strains have been reported. To explore the potential use of single cell genomics methodology for revealing genome-level variation in clinical samples from Cryptosporidium-infected hosts, we sorted individual oocysts for subsequent genome amplification and full-genome sequencing. RESULTS: Cells were identified with fluorescent antibodies with an 80 % success rate for the entire single cell genomics workflow, demonstrating that the methodology can be applied directly to purified fecal samples. Ten amplified genomes from sorted single cells were selected for genome sequencing and compared both to the original population and a reference genome in order to evaluate the accuracy and performance of the method. Single cell genome coverage was on average 81 % even with a moderate sequencing effort and by combining the 10 single cell genomes, the full genome was accounted for. By a comparison to the original sample, biological variation could be distinguished and separated from noise introduced in the amplification. CONCLUSIONS: As a proof of principle, we have demonstrated the power of applying single cell genomics to dissect infectious disease caused by closely related parasite species or subtypes. The workflow can easily be expanded and adapted to target other protozoans, and potential applications include mapping genome-encoded traits, virulence, pathogenicity, host specificity and resistance at the level of cells as truly meaningful biological units.
Assuntos
Cryptosporidium/genética , Eucariotos/genética , Genoma , Genômica , Alelos , Variação Genética , Genômica/métodos , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Mutação , Oocistos , Polimorfismo de Nucleotídeo ÚnicoRESUMO
The derivation of human ES cells (hESCs) from human blastocysts represents one of the milestones in stem cell biology. The full potential of hESCs in research and clinical applications requires a detailed understanding of the genetic network that governs the unique properties of hESCs. Here, we report a genome-wide RNA interference screen to identify genes which regulate self-renewal and pluripotency properties in hESCs. Interestingly, functionally distinct complexes involved in transcriptional regulation and chromatin remodelling are among the factors identified in the screen. To understand the roles of these potential regulators of hESCs, we studied transcription factor PRDM14 to gain new insights into its functional roles in the regulation of pluripotency. We showed that PRDM14 regulates directly the expression of key pluripotency gene POU5F1 through its proximal enhancer. Genome-wide location profiling experiments revealed that PRDM14 colocalized extensively with other key transcription factors such as OCT4, NANOG and SOX2, indicating that PRDM14 is integrated into the core transcriptional regulatory network. More importantly, in a gain-of-function assay, we showed that PRDM14 is able to enhance the efficiency of reprogramming of human fibroblasts in conjunction with OCT4, SOX2 and KLF4. Altogether, our study uncovers a wealth of novel hESC regulators wherein PRDM14 exemplifies a key transcription factor required for the maintenance of hESC identity and the reacquisition of pluripotency in human somatic cells.
Assuntos
Células-Tronco Embrionárias/citologia , Células-Tronco Embrionárias/metabolismo , Genoma Humano/genética , Interferência de RNA , Proteínas Repressoras/metabolismo , Animais , Sequência de Bases , Linhagem Celular , Reprogramação Celular/genética , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Elementos Facilitadores Genéticos/genética , Fibroblastos/citologia , Fibroblastos/metabolismo , Regulação da Expressão Gênica/genética , Humanos , Células-Tronco Pluripotentes Induzidas/citologia , Células-Tronco Pluripotentes Induzidas/metabolismo , Fator 4 Semelhante a Kruppel , Camundongos , Fator 3 de Transcrição de Octâmero/genética , Fator 3 de Transcrição de Octâmero/metabolismo , Proteínas de Ligação a RNA , Proteínas Repressoras/genética , Fatores de Transcrição SOXB1/metabolismo , Fatores de TranscriçãoRESUMO
Global classification of the human proteins with regards to spatial expression patterns across organs and tissues is important for studies of human biology and disease. Here, we used a quantitative transcriptomics analysis (RNA-Seq) to classify the tissue-specific expression of genes across a representative set of all major human organs and tissues and combined this analysis with antibody-based profiling of the same tissues. To present the data, we launch a new version of the Human Protein Atlas that integrates RNA and protein expression data corresponding to â¼80% of the human protein-coding genes with access to the primary data for both the RNA and the protein analysis on an individual gene level. We present a classification of all human protein-coding genes with regards to tissue-specificity and spatial expression pattern. The integrative human expression map can be used as a starting point to explore the molecular constituents of the human body.
Assuntos
Anticorpos/farmacologia , Expressão Gênica , Genômica/métodos , Especificidade de Órgãos/genética , Proteômica/métodos , Transcriptoma , Feminino , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Humanos , Masculino , Proteínas/genética , Proteínas/metabolismo , Proteoma/genética , Proteoma/metabolismo , Integração de Sistemas , Análise Serial de TecidosRESUMO
The transformation of normal cells to malignant, metastatic tumor cells is a multistep process caused by the sequential acquirement of genetic changes. To identify these changes, we compared the transcriptomes and levels and distribution of proteins in a four-stage cell model of isogenically matched normal, immortalized, transformed, and metastatic human cells, using deep transcriptome sequencing and immunofluorescence microscopy. The data show that â¼6% (n = 1,357) of the human protein-coding genes are differentially expressed across the stages in the model. Interestingly, the majority of these genes are down-regulated, linking malignant transformation to dedifferentiation. The up-regulated genes are mainly components that control cellular proliferation, whereas the down-regulated genes consist of proteins exposed on or secreted from the cell surface. As many of the identified gene products control basic cellular functions that are defective in cancers, the data provide candidates for follow-up studies to investigate their functional roles in tumor formation. When we further compared the expression levels of four of the identified proteins in clinical cancer cohorts, similar differences were observed between benign and cancer cells, as in the cell model. This shows that this comprehensive demonstration of the molecular changes underlying malignant transformation is a relevant model to study the process of tumor formation.
Assuntos
Transformação Celular Neoplásica/metabolismo , Regulação Neoplásica da Expressão Gênica/fisiologia , Modelos Biológicos , Proteínas/metabolismo , Antígenos Transformantes de Poliomavirus/metabolismo , Linhagem Celular Tumoral , Fibroblastos/metabolismo , Perfilação da Expressão Gênica , Genes ras/fisiologia , Estudos de Associação Genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Imuno-Histoquímica , Microscopia de Fluorescência , Análise de Componente Principal , Telomerase/metabolismoRESUMO
BACKGROUND: To understand cardiac and skeletal muscle function, it is important to define and explore their molecular constituents and also to identify similarities and differences in the gene expression in these two different striated muscle tissues. Here, we have investigated the genes and proteins with elevated expression in cardiac and skeletal muscle in relation to all other major human tissues and organs using a global transcriptomics analysis complemented with antibody-based profiling to localize the corresponding proteins on a single cell level. RESULTS: Our study identified a comprehensive list of genes expressed in cardiac and skeletal muscle. The genes with elevated expression were further stratified according to their global expression pattern across the human body as well as their precise localization in the muscle tissues. The functions of the proteins encoded by the elevated genes are well in line with the physiological functions of cardiac and skeletal muscle, such as contraction, ion transport, regulation of membrane potential and actomyosin structure organization. A large fraction of the transcripts in both cardiac and skeletal muscle correspond to mitochondrial proteins involved in energy metabolism, which demonstrates the extreme specialization of these muscle tissues to provide energy for contraction. CONCLUSIONS: Our results provide a comprehensive list of genes and proteins elevated in striated muscles. A number of proteins not previously characterized in cardiac and skeletal muscle were identified and localized to specific cellular subcompartments. These proteins represent an interesting starting point for further functional analysis of their role in muscle biology and disease.
Assuntos
Músculo Esquelético/metabolismo , Miocárdio/metabolismo , Proteoma/genética , Transcriptoma/genética , Anticorpos/genética , Perfilação da Expressão Gênica , Humanos , Proteoma/metabolismoRESUMO
Previous studies of antibiotic resistance dissemination by travel have, by targeting only a select number of cultivable bacterial species, omitted most of the human microbiome. Here, we used explorative shotgun metagenomic sequencing to address the abundance of >300 antibiotic resistance genes in fecal specimens from 35 Swedish students taken before and after exchange programs on the Indian peninsula or in Central Africa. All specimens were additionally cultured for extended-spectrum beta-lactamase (ESBL)-producing enterobacteria, and the isolates obtained were genome sequenced. The overall taxonomic diversity and composition of the gut microbiome remained stable before and after travel, but there was an increasing abundance of Proteobacteria in 25/35 students. The relative abundance of antibiotic resistance genes increased, most prominently for genes encoding resistance to sulfonamide (2.6-fold increase), trimethoprim (7.7-fold), and beta-lactams (2.6-fold). Importantly, the increase observed occurred without any antibiotic intake. Of 18 students visiting the Indian peninsula, 12 acquired ESBL-producing Escherichia coli, while none returning from Africa were positive. Despite deep sequencing efforts, the sensitivity of metagenomics was not sufficient to detect acquisition of the low-abundant genes responsible for the observed ESBL phenotype. In conclusion, metagenomic sequencing of the intestinal microbiome of Swedish students returning from exchange programs in Central Africa or the Indian peninsula showed increased abundance of genes encoding resistance to widely used antibiotics.
Assuntos
Antibacterianos/farmacologia , Resistência Microbiana a Medicamentos/genética , Microbioma Gastrointestinal/genética , Adulto , Escherichia coli/efeitos dos fármacos , Fezes/microbiologia , Feminino , Humanos , Masculino , Metagenômica , Testes de Sensibilidade Microbiana , Proteobactérias/efeitos dos fármacos , Proteobactérias/genética , Sulfonamidas/farmacologia , Trimetoprima/farmacologia , Adulto Jovem , beta-Lactamas/farmacologiaRESUMO
Genetic studies of complex traits have become increasingly successful as progress is made in next-generation sequencing. We aimed at discovering single nucleotide variation present in known and new candidate genes for developmental dyslexia: CYP19A1, DCDC2, DIP2A, DYX1C1, GCFC2 (also known as C2orf3), KIAA0319, MRPL19, PCNT, PRMT2, ROBO1 and S100B. We used next-generation sequencing to identify single-nucleotide polymorphisms in the exons of these 11 genes in pools of 100 DNA samples of Finnish individuals with developmental dyslexia. Subsequent individual genotyping of those 100 individuals, and additional cases and controls from the Finnish and German populations, validated 92 out of 111 different single-nucleotide variants. A nonsynonymous polymorphism in DCDC2 (corrected P = 0.002) and a noncoding variant in S100B (corrected P = 0.016) showed a significant association with spelling performance in families of German origin. No significant association was found for the variants neither in the Finnish case-control sample set nor in the Finnish family sample set. Our findings further strengthen the role of DCDC2 and implicate S100B, in the biology of reading and spelling.
Assuntos
Dislexia/genética , Proteínas Associadas aos Microtúbulos/genética , Subunidade beta da Proteína Ligante de Cálcio S100/genética , Estudos de Casos e Controles , Estudos de Associação Genética , Predisposição Genética para Doença , Humanos , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNARESUMO
Human skeletal muscle health is important for quality of life and several chronic diseases, including type II diabetes, heart disease, and cancer. Skeletal muscle is a tissue widely used to study mechanisms behind different diseases and adaptive effects of controlled interventions. For such mechanistic studies, knowledge about the gene expression profiles in different states is essential. Since the baseline transcriptome has not been analyzed systematically, the purpose of this study was to provide a deep reference profile of female and male skeletal muscle. RNA sequencing data were analyzed from a large set of 45 resting human muscle biopsies. We provide extensive information on the skeletal muscle transcriptome, including 5 previously unannotated protein-coding transcripts. Global transcriptional tissue homogeneity was strikingly high, within both a specific muscle and the contralateral leg. We identified >23,000 known isoforms and found >5000 isoforms that differ between the sexes. The female and male transcriptome was enriched for genes associated with oxidative metabolism and protein catabolic processes, respectively. The data demonstrate remarkably high tissue homogeneity and provide a deep and extensive baseline reference for the human skeletal muscle transcriptome, with regard to alternative splicing, novel transcripts, and sex differences in functional ontology.transcriptome: sex differences, alternative splicing, and tissue homogeneity assessed with RNA sequencing.
Assuntos
Processamento Alternativo , Músculo Esquelético/metabolismo , Transcriptoma , Adulto , Feminino , Humanos , Masculino , Especificidade de Órgãos , Análise de Sequência de RNA , Fatores SexuaisRESUMO
The human body contains thousands of unique cell types, each with specialized functions. Cell identity is governed in large part by gene transcription programs, which are determined by regulatory elements encoded in DNA. To identify regulatory elements active in seven cell lines representative of diverse human cell types, we used DNase-seq and FAIRE-seq (Formaldehyde Assisted Isolation of Regulatory Elements) to map "open chromatin." Over 870,000 DNaseI or FAIRE sites, which correspond tightly to nucleosome-depleted regions, were identified across the seven cell lines, covering nearly 9% of the genome. The combination of DNaseI and FAIRE is more effective than either assay alone in identifying likely regulatory elements, as judged by coincidence with transcription factor binding locations determined in the same cells. Open chromatin common to all seven cell types tended to be at or near transcription start sites and to be coincident with CTCF binding sites, while open chromatin sites found in only one cell type were typically located away from transcription start sites and contained DNA motifs recognized by regulators of cell-type identity. We show that open chromatin regions bound by CTCF are potent insulators. We identified clusters of open regulatory elements (COREs) that were physically near each other and whose appearance was coordinated among one or more cell types. Gene expression and RNA Pol II binding data support the hypothesis that COREs control gene activity required for the maintenance of cell-type identity. This publicly available atlas of regulatory elements may prove valuable in identifying noncoding DNA sequence variants that are causally linked to human disease.
Assuntos
Cromatina/metabolismo , Mapeamento Cromossômico , Elementos Reguladores de Transcrição , Análise de Sequência de DNA/métodos , Sequência de Bases , Sítios de Ligação , Fator de Ligação a CCCTC , Diferenciação Celular/genética , Linhagem Celular , Regulação da Expressão Gênica , Humanos , Ligação Proteica , Proteínas Repressoras/metabolismo , Transcrição Gênica , Ativação TranscricionalRESUMO
Personalized cancer treatment requires molecular characterization of individual tumor biopsies. These samples are frequently only available in limited quantities hampering genomic analysis. Several whole genome amplification (WGA) protocols have been developed with reported varying representation of genomic regions post amplification. In this study we investigate region dropout using a φ29 polymerase based WGA approach. DNA from 123 lung cancers specimens and corresponding normal tissue were used and evaluated by Sanger sequencing of the p53 exons 5-8. To enable comparative analysis of this scarce material, WGA samples were compared with unamplified material using a pooling strategy of the 123 samples. In addition, a more detailed analysis of exon 7 amplicons were performed followed by extensive cloning and Sanger sequencing. Interestingly, by comparing data from the pooled samples to the individually sequenced exon 7, we demonstrate that mutations are more easily recovered from WGA pools and this was also supported by simulations of different sequencing coverage. Overall this data indicate a limited random loss of genomic regions supporting the use of whole genome amplification for genomic analysis.