Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 80
Filtrar
1.
Nature ; 622(7981): 41-47, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37794265

RESUMO

Scientists have been trying to identify every gene in the human genome since the initial draft was published in 2001. In the years since, much progress has been made in identifying protein-coding genes, currently estimated to number fewer than 20,000, with an ever-expanding number of distinct protein-coding isoforms. Here we review the status of the human gene catalogue and the efforts to complete it in recent years. Beside the ongoing annotation of protein-coding genes, their isoforms and pseudogenes, the invention of high-throughput RNA sequencing and other technological breakthroughs have led to a rapid growth in the number of reported non-coding RNA genes. For most of these non-coding RNAs, the functional relevance is currently unclear; we look at recent advances that offer paths forward to identifying their functions and towards eventually completing the human gene catalogue. Finally, we examine the need for a universal annotation standard that includes all medically significant genes and maintains their relationships with different reference genomes for the use of the human gene catalogue in clinical settings.


Assuntos
Genes , Genoma Humano , Anotação de Sequência Molecular , Isoformas de Proteínas , Humanos , Genoma Humano/genética , Anotação de Sequência Molecular/normas , Anotação de Sequência Molecular/tendências , Isoformas de Proteínas/genética , Projeto Genoma Humano , Pseudogenes , RNA/genética
2.
Nucleic Acids Res ; 52(W1): W341-W347, 2024 Jul 05.
Artigo em Inglês | MEDLINE | ID: mdl-38709877

RESUMO

Genes commonly express multiple RNA products (RNA isoforms), which differ in exonic content and can have different functions. Making sense of the plethora of known and novel RNA isoforms being identified by transcriptomic approaches requires a user-friendly way to visualize gene isoforms and how they differ in exonic content, expression levels and potential functions. Here we introduce IsoVis, a freely available webserver that accepts user-supplied transcriptomic data and visualizes the expressed isoforms in a clear, intuitive manner. IsoVis contains numerous features, including the ability to visualize all RNA isoforms of a gene and their expression levels; the annotation of known isoforms from external databases; mapping of protein domains and features to exons, allowing changes to protein sequence and function between isoforms to be established; and extensive species compatibility. Datasets visualised on IsoVis remain private to the user, allowing analysis of sensitive data. IsoVis visualisations can be downloaded to create publication-ready figures. The IsoVis webserver enables researchers to perform isoform analyses without requiring programming skills, is free to use, and available at https://isomix.org/isovis/.


Assuntos
Internet , Anotação de Sequência Molecular , Isoformas de RNA , Software , Isoformas de RNA/genética , Isoformas de RNA/metabolismo , Isoformas de RNA/química , Humanos , Animais , Éxons/genética , Transcriptoma/genética , Processamento Alternativo
3.
J Immunol ; 209(12): 2352­2361, 2022 12 15.
Artigo em Inglês | MEDLINE | ID: mdl-36427009

RESUMO

Dendritic cells (DCs) are functionally diverse and are present in most adult tissues, but deep understanding of human DC biology is hampered by relatively small numbers of these in circulation and their short lifespan in human tissues. We built a transcriptional atlas of human DCs by combining samples from 14 expression profiling studies derived from 10 laboratories. We identified significant gene expression variation of DC subset-defining markers across tissue type and upon viral or bacterial stimulation. We further highlight critical gaps between in vitro-derived DC subsets and their in vivo counterparts and provide evidence that monocytes or cord blood progenitor in vitro-differentiated DCs fail to capture the repertoire of primary DC subsets or behaviors. In constructing a reference DC atlas, we provide an important resource for the community wishing to identify and annotate tissue-specific DC subsets from single-cell datasets, or benchmark new in vitro models of DC biology.


Assuntos
Células Dendríticas , Monócitos , Humanos , Células Dendríticas/metabolismo , Diferenciação Celular , Biologia
4.
Nature ; 543(7644): 199-204, 2017 03 09.
Artigo em Inglês | MEDLINE | ID: mdl-28241135

RESUMO

Long non-coding RNAs (lncRNAs) are largely heterogeneous and functionally uncharacterized. Here, using FANTOM5 cap analysis of gene expression (CAGE) data, we integrate multiple transcript collections to generate a comprehensive atlas of 27,919 human lncRNA genes with high-confidence 5' ends and expression profiles across 1,829 samples from the major human primary cell types and tissues. Genomic and epigenomic classification of these lncRNAs reveals that most intergenic lncRNAs originate from enhancers rather than from promoters. Incorporating genetic and expression data, we show that lncRNAs overlapping trait-associated single nucleotide polymorphisms are specifically expressed in cell types relevant to the traits, implicating these lncRNAs in multiple diseases. We further demonstrate that lncRNAs overlapping expression quantitative trait loci (eQTL)-associated single nucleotide polymorphisms of messenger RNAs are co-expressed with the corresponding messenger RNAs, suggesting their potential roles in transcriptional regulation. Combining these findings with conservation data, we identify 19,175 potentially functional lncRNAs in the human genome.


Assuntos
Bases de Dados Genéticas , RNA Longo não Codificante/química , RNA Longo não Codificante/genética , Transcriptoma/genética , Células Cultivadas , Sequência Conservada/genética , Conjuntos de Dados como Assunto , Elementos Facilitadores Genéticos/genética , Epigênese Genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Genoma Humano/genética , Estudo de Associação Genômica Ampla , Genômica , Humanos , Internet , Anotação de Sequência Molecular , Especificidade de Órgãos/genética , Polimorfismo de Nucleotídeo Único , Regiões Promotoras Genéticas/genética , Locos de Características Quantitativas/genética , Estabilidade de RNA , RNA Mensageiro/genética
5.
PLoS Comput Biol ; 16(9): e1008219, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32986694

RESUMO

Gene expression atlases have transformed our understanding of the development, composition and function of human tissues. New technologies promise improved cellular or molecular resolution, and have led to the identification of new cell types, or better defined cell states. But as new technologies emerge, information derived on old platforms becomes obsolete. We demonstrate that it is possible to combine a large number of different profiling experiments summarised from dozens of laboratories and representing hundreds of donors, to create an integrated molecular map of human tissue. As an example, we combine 850 samples from 38 platforms to build an integrated atlas of human blood cells. We achieve robust and unbiased cell type clustering using a variance partitioning method, selecting genes with low platform bias relative to biological variation. Other than an initial rescaling, no other transformation to the primary data is applied through batch correction or renormalisation. Additional data, including single-cell datasets, can be projected for comparison, classification and annotation. The resulting atlas provides a multi-scaled approach to visualise and analyse the relationships between sets of genes and blood cell lineages, including the maturation and activation of leukocytes in vivo and in vitro. In allowing for data integration across hundreds of studies, we address a key reproduciblity challenge which is faced by any new technology. This allows us to draw on the deep phenotypes and functional annotations that accompany traditional profiling methods, and provide important context to the high cellular resolution of single cell profiling. Here, we have implemented the blood atlas in the open access Stemformatics.org platform, drawing on its extensive collection of curated transcriptome data. The method is simple, scalable and amenable for rapid deployment in other biological systems or computational workflows.


Assuntos
Transcriptoma , Análise por Conglomerados , Curadoria de Dados , Perfilação da Expressão Gênica , Humanos
6.
Nucleic Acids Res ; 47(D1): D841-D846, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30407577

RESUMO

Stemformatics is an established gene expression data portal containing over 420 public gene expression datasets derived from microarray, RNA sequencing and single cell profiling technologies. Developed for the stem cell community, it has a major focus on pluripotency, tissue stem cells, and staged differentiation. Stemformatics includes curated 'collections' of data relevant to cell reprogramming, as well as hematopoiesis and leukaemia. Rather than simply rehosting datasets as they appear in public repositories, Stemformatics uses a stringent set of quality control metrics and its own pipelines to process handpicked datasets from raw files. This means that about 30% of datasets processed by Stemformatics fail the quality control metrics and never make it to the portal, ensuring that Stemformatics data are of high quality and have been processed in a consistent manner. Stemformatics provides easy-to-use and intuitive tools for biologists to visually explore the data, including interactive gene expression profiles, principal component analysis plots and hierarchical clusters, among others. The addition of tools that facilitate cross-dataset comparisons provides users with snapshots of gene expression in multiple cell and tissues, assisting the identification of cell-type restricted genes, or potential housekeeping genes. Stemformatics is freely available at stemformatics.org.


Assuntos
Bases de Dados Genéticas , Células-Tronco , Transcriptoma/genética , Animais , Diferenciação Celular/genética , Curadoria de Dados , Genes Essenciais/genética , Humanos , Análise de Sequência de RNA , Software
7.
Proc Natl Acad Sci U S A ; 115(45): E10662-E10671, 2018 11 06.
Artigo em Inglês | MEDLINE | ID: mdl-30352847

RESUMO

Group A Streptococcus (GAS) is a Gram-positive bacterial pathogen that causes a range of diseases, including fatal invasive infections. However, the mechanisms by which the innate immune system recognizes GAS are not well understood. We herein report that the C-type lectin receptor macrophage inducible C-type lectin (Mincle) recognizes GAS and initiates antibacterial immunity. Gene expression analysis of myeloid cells upon GAS stimulation revealed the contribution of the caspase recruitment domain-containing protein 9 (CARD9) pathway to the antibacterial responses. Among receptors signaling through CARD9, Mincle induced the production of inflammatory cytokines, inducible nitric oxide synthase, and reactive oxygen species upon recognition of the anchor of lipoteichoic acid, monoglucosyldiacylglycerol (MGDG), produced by GAS. Upon GAS infection, Mincle-deficient mice exhibited impaired production of proinflammatory cytokines, severe bacteremia, and rapid lethality. GAS also possesses another Mincle ligand, diglucosyldiacylglycerol; however, this glycolipid interfered with MGDG-induced activation. These results indicate that Mincle plays a central role in protective immunity against acute GAS infection.


Assuntos
Lectinas Tipo C/metabolismo , Lipopolissacarídeos/metabolismo , Proteínas de Membrana/metabolismo , Infecções Estreptocócicas/imunologia , Streptococcus pyogenes/patogenicidade , Ácidos Teicoicos/metabolismo , Animais , Proteínas Adaptadoras de Sinalização CARD/genética , Células HEK293 , Humanos , Camundongos , Camundongos Endogâmicos C57BL , Monócitos/metabolismo , Infecções Estreptocócicas/microbiologia
8.
Nature ; 516(7530): 192-7, 2014 Dec 11.
Artigo em Inglês | MEDLINE | ID: mdl-25503232

RESUMO

Pluripotency is defined by the ability of a cell to differentiate to the derivatives of all the three embryonic germ layers: ectoderm, mesoderm and endoderm. Pluripotent cells can be captured via the archetypal derivation of embryonic stem cells or via somatic cell reprogramming. Somatic cells are induced to acquire a pluripotent stem cell (iPSC) state through the forced expression of key transcription factors, and in the mouse these cells can fulfil the strictest of all developmental assays for pluripotent cells by generating completely iPSC-derived embryos and mice. However, it is not known whether there are additional classes of pluripotent cells, or what the spectrum of reprogrammed phenotypes encompasses. Here we explore alternative outcomes of somatic reprogramming by fully characterizing reprogrammed cells independent of preconceived definitions of iPSC states. We demonstrate that by maintaining elevated reprogramming factor expression levels, mouse embryonic fibroblasts go through unique epigenetic modifications to arrive at a stable, Nanog-positive, alternative pluripotent state. In doing so, we prove that the pluripotent spectrum can encompass multiple, unique cell states.


Assuntos
Reprogramação Celular/genética , Reprogramação Celular/fisiologia , Epigênese Genética , Células-Tronco Pluripotentes Induzidas/citologia , Células-Tronco Pluripotentes Induzidas/metabolismo , Animais , Células-Tronco Embrionárias/citologia , Células-Tronco Embrionárias/metabolismo , Feminino , Fibroblastos/classificação , Fibroblastos/citologia , Fibroblastos/metabolismo , Histona Desacetilases/metabolismo , Células-Tronco Pluripotentes Induzidas/classificação , Camundongos , Camundongos Nus , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Transgenes/genética
9.
Nature ; 516(7530): 198-206, 2014 Dec 11.
Artigo em Inglês | MEDLINE | ID: mdl-25503233

RESUMO

Somatic cell reprogramming to a pluripotent state continues to challenge many of our assumptions about cellular specification, and despite major efforts, we lack a complete molecular characterization of the reprograming process. To address this gap in knowledge, we generated extensive transcriptomic, epigenomic and proteomic data sets describing the reprogramming routes leading from mouse embryonic fibroblasts to induced pluripotency. Through integrative analysis, we reveal that cells transition through distinct gene expression and epigenetic signatures and bifurcate towards reprogramming transgene-dependent and -independent stable pluripotent states. Early transcriptional events, driven by high levels of reprogramming transcription factor expression, are associated with widespread loss of histone H3 lysine 27 (H3K27me3) trimethylation, representing a general opening of the chromatin state. Maintenance of high transgene levels leads to re-acquisition of H3K27me3 and a stable pluripotent state that is alternative to the embryonic stem cell (ESC)-like fate. Lowering transgene levels at an intermediate phase, however, guides the process to the acquisition of ESC-like chromatin and DNA methylation signature. Our data provide a comprehensive molecular description of the reprogramming routes and is accessible through the Project Grandiose portal at http://www.stemformatics.org.


Assuntos
Reprogramação Celular/genética , Genoma/genética , Células-Tronco Pluripotentes Induzidas/citologia , Células-Tronco Pluripotentes Induzidas/metabolismo , Animais , Cromatina/química , Cromatina/genética , Cromatina/metabolismo , Montagem e Desmontagem da Cromatina , Metilação de DNA , Células-Tronco Embrionárias/citologia , Células-Tronco Embrionárias/metabolismo , Epistasia Genética/genética , Fibroblastos/citologia , Fibroblastos/metabolismo , Histonas/química , Histonas/metabolismo , Internet , Camundongos , Proteoma/genética , Proteômica , RNA Longo não Codificante/genética , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Transcrição Gênica/genética , Transcriptoma/genética , Transgenes/genética
10.
PLoS Genet ; 13(3): e1006641, 2017 03.
Artigo em Inglês | MEDLINE | ID: mdl-28263993

RESUMO

The FANTOM5 consortium utilised cap analysis of gene expression (CAGE) to provide an unprecedented insight into transcriptional regulation in human cells and tissues. In the current study, we have used CAGE-based transcriptional profiling on an extended dense time course of the response of human monocyte-derived macrophages grown in macrophage colony-stimulating factor (CSF1) to bacterial lipopolysaccharide (LPS). We propose that this system provides a model for the differentiation and adaptation of monocytes entering the intestinal lamina propria. The response to LPS is shown to be a cascade of successive waves of transient gene expression extending over at least 48 hours, with hundreds of positive and negative regulatory loops. Promoter analysis using motif activity response analysis (MARA) identified some of the transcription factors likely to be responsible for the temporal profile of transcriptional activation. Each LPS-inducible locus was associated with multiple inducible enhancers, and in each case, transient eRNA transcription at multiple sites detected by CAGE preceded the appearance of promoter-associated transcripts. LPS-inducible long non-coding RNAs were commonly associated with clusters of inducible enhancers. We used these data to re-examine the hundreds of loci associated with susceptibility to inflammatory bowel disease (IBD) in genome-wide association studies. Loci associated with IBD were strongly and specifically (relative to rheumatoid arthritis and unrelated traits) enriched for promoters that were regulated in monocyte differentiation or activation. Amongst previously-identified IBD susceptibility loci, the vast majority contained at least one promoter that was regulated in CSF1-dependent monocyte-macrophage transitions and/or in response to LPS. On this basis, we concluded that IBD loci are strongly-enriched for monocyte-specific genes, and identified at least 134 additional candidate genes associated with IBD susceptibility from reanalysis of published GWA studies. We propose that dysregulation of monocyte adaptation to the environment of the gastrointestinal mucosa is the key process leading to inflammatory bowel disease.


Assuntos
Doenças Inflamatórias Intestinais/genética , Macrófagos/citologia , Monócitos/citologia , Transcriptoma , Motivos de Aminoácidos , Diferenciação Celular , Citocinas/metabolismo , Regulação da Expressão Gênica , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Genômica , Humanos , Inflamação , Doenças Inflamatórias Intestinais/etiologia , Mucosa Intestinal/metabolismo , Ligantes , Lipopolissacarídeos/farmacologia , Fator Estimulador de Colônias de Macrófagos/farmacologia , Família Multigênica , Regiões Promotoras Genéticas , Fatores de Tempo , Fatores de Transcrição/metabolismo , Transcrição Gênica , Ativação Transcricional
11.
PLoS Comput Biol ; 14(3): e1005934, 2018 03.
Artigo em Inglês | MEDLINE | ID: mdl-29494619

RESUMO

Genetic variants underlying complex traits, including disease susceptibility, are enriched within the transcriptional regulatory elements, promoters and enhancers. There is emerging evidence that regulatory elements associated with particular traits or diseases share similar patterns of transcriptional activity. Accordingly, shared transcriptional activity (coexpression) may help prioritise loci associated with a given trait, and help to identify underlying biological processes. Using cap analysis of gene expression (CAGE) profiles of promoter- and enhancer-derived RNAs across 1824 human samples, we have analysed coexpression of RNAs originating from trait-associated regulatory regions using a novel quantitative method (network density analysis; NDA). For most traits studied, phenotype-associated variants in regulatory regions were linked to tightly-coexpressed networks that are likely to share important functional characteristics. Coexpression provides a new signal, independent of phenotype association, to enable fine mapping of causative variants. The NDA coexpression approach identifies new genetic variants associated with specific traits, including an association between the regulation of the OCT1 cation transporter and genetic variants underlying circulating cholesterol levels. NDA strongly implicates particular cell types and tissues in disease pathogenesis. For example, distinct groupings of disease-associated regulatory regions implicate two distinct biological processes in the pathogenesis of ulcerative colitis; a further two separate processes are implicated in Crohn's disease. Thus, our functional analysis of genetic predisposition to disease defines new distinct disease endotypes. We predict that patients with a preponderance of susceptibility variants in each group are likely to respond differently to pharmacological therapy. Together, these findings enable a deeper biological understanding of the causal basis of complex traits.


Assuntos
Predisposição Genética para Doença/genética , Genômica/métodos , Regiões Promotoras Genéticas/genética , Doença de Crohn/genética , Bases de Dados Genéticas , Perfilação da Expressão Gênica , Humanos , Transcriptoma/genética
12.
Differentiation ; 104: 42-49, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30453197

RESUMO

Macrophages are phagocytic immune cells resident in every tissue that are not only important for host defence, but are also involved in tissue homeostasis, injury, and disease. Despite increasingly sophisticated methods for in vitro macrophage isolation, expansion and activation over the past three decades, these have largely been restricted to modelling bone-marrow or blood-derived cells. The in vitro derivation of macrophages from human pluripotent stem cells provides new opportunities to study macrophage biology, including the factors that impact human myeloid development and those that induce macrophage activation. While sharing many of the functional characteristics of monocyte-derived macrophages, stem cell-derived macrophages may offer new opportunities to understand the role of development or tissue context in innate immune cell function. Immune responsiveness to pathogenic challenge is known to be impacted by a macrophage's history of prior exposure, as well as ontogeny and tissue context. Therefore, we explore the factors of in vitro derivation likely to influence macrophage phenotype and function.


Assuntos
Diferenciação Celular/genética , Células-Tronco Pluripotentes Induzidas/citologia , Macrófagos/citologia , Células-Tronco Pluripotentes/citologia , Humanos , Imunidade Inata/genética , Células-Tronco Pluripotentes Induzidas/imunologia , Macrófagos/imunologia , Monócitos/citologia , Monócitos/imunologia , Células-Tronco Pluripotentes/imunologia
13.
J Immunol ; 196(7): 3148-58, 2016 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-26921309

RESUMO

Uveitis, which occurs in association with systemic immunological diseases, presents a considerable medical challenge because of incomplete understanding of its pathogenesis. The signals that initiate T cells to target the eye, which may be of infectious or noninfectious origin, are poorly understood. Experimental autoimmune uveoretinitis (EAU) develops in mice immunized with the endogenous retinal protein interphotoreceptor retinoid binding protein in the presence of the adjuvant CFA. EAU manifests as posterior ocular inflammation consisting of vasculitis, granulomas, retinal damage, and invasion of self-reactive T cells, which are key clinical features of human uveitis. Our studies uncover Card9 as a critical genetic determinant for EAU. Card9 was responsible for Th17 polarization and Th17-associated Ag-specific responses, but not Th1-associated responses. Nonetheless, Card9 expression was essential for accumulation of both lineages within the eye. Consistent with its recently identified role as an intracellular signaling mediator for C-type lectin receptors (CLRs), a Card9-dependent transcriptional response in the neuroretina was observed involving genes encoding the CLRs Dectin-1, Dectin-2, and Mincle. Genetic deletion of these individual CLRs revealed an essential role for Mincle. Mincle activation was sufficient to generate the EAU phenotype, and this required activation of both Syk and Card9. In contrast, Dectin-1 contributed minimally and a possible repressive role was shown for Dectin-2. These findings extend our understanding of CLRs in autoimmune uveitis. The newly identified role of Mincle and Syk/Card9-coupled signaling axis in autoimmune uveitis could provide novel targets for treatment of patients with ocular inflammatory disease.


Assuntos
Doenças Autoimunes/imunologia , Doenças Autoimunes/metabolismo , Proteínas Adaptadoras de Sinalização CARD/metabolismo , Peptídeos e Proteínas de Sinalização Intracelular/metabolismo , Lectinas Tipo C/metabolismo , Proteínas de Membrana/metabolismo , Proteínas Tirosina Quinases/metabolismo , Transdução de Sinais , Uveíte/imunologia , Uveíte/metabolismo , Animais , Doenças Autoimunes/diagnóstico , Doenças Autoimunes/genética , Proteínas Adaptadoras de Sinalização CARD/genética , Proteína Semelhante a Receptor de Calcitonina/genética , Proteína Semelhante a Receptor de Calcitonina/metabolismo , Modelos Animais de Doenças , Proteínas do Olho/metabolismo , Expressão Gênica , Perfilação da Expressão Gênica , Lectinas Tipo C/genética , Proteínas de Membrana/genética , Camundongos , Camundongos Knockout , Retina/imunologia , Retina/metabolismo , Retina/patologia , Proteínas de Ligação ao Retinol/metabolismo , Quinase Syk , Subpopulações de Linfócitos T/imunologia , Subpopulações de Linfócitos T/metabolismo , Células Th17/imunologia , Células Th17/metabolismo , Transcriptoma , Uveíte/diagnóstico , Uveíte/genética
17.
Genomics ; 103(4): 239-51, 2014 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-24667244

RESUMO

Gene expression databases contain invaluable information about a range of cell states, but the question "Where is my gene of interest expressed?" remains one of the most difficult to systematically assess when relevant data is derived on different platforms. Barriers to integrating this data include disparities in data formats and scale, a lack of common identifiers, and the disproportionate contribution of a platform to the 'batch effect'. There are few purpose-built cross-platform normalization strategies, and most of these fit data to an idealized data structure, which in turn may compromise gene expression comparisons between different platforms. YuGene addresses this gap by providing a simple transform that assigns a modified cumulative proportion value to each measurement, without losing essential underlying information on data distributions or experimental correlates. The Yugene transform is applied to individual samples and is suitable to apply to data with different distributions. Yugene is robust to combining datasets of different sizes, does not require global renormalization as new data is added, and does not require a common identifier. YuGene was benchmarked against commonly used normalization approaches, performing favorably in comparison to quantile (RMA), Z-score or rank methods. Implementation in the www.stemformatics.org resource provides users with expression queries across stem cell related datasets. Probe performance statistics including poorly performing (never expressed) probes, and examples of probes/genes expressed in a sample-restricted manner are provided. The YuGene software is implemented as an R package available from CRAN.


Assuntos
Bases de Dados Genéticas , Perfilação da Expressão Gênica/métodos , Software , Biologia Computacional/métodos , Humanos , Internet , Análise de Sequência com Séries de Oligonucleotídeos , Células-Tronco
18.
Nat Genet ; 38(6): 626-35, 2006 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-16645617

RESUMO

Mammalian promoters can be separated into two classes, conserved TATA box-enriched promoters, which initiate at a well-defined site, and more plastic, broad and evolvable CpG-rich promoters. We have sequenced tags corresponding to several hundred thousand transcription start sites (TSSs) in the mouse and human genomes, allowing precise analysis of the sequence architecture and evolution of distinct promoter classes. Different tissues and families of genes differentially use distinct types of promoters. Our tagging methods allow quantitative analysis of promoter usage in different tissues and show that differentially regulated alternative TSSs are a common feature in protein-coding genes and commonly generate alternative N termini. Among the TSSs, we identified new start sites associated with the majority of exons and with 3' UTRs. These data permit genome-scale identification of tissue-specific promoters and analysis of the cis-acting elements associated with them.


Assuntos
Evolução Molecular , Regiões Promotoras Genéticas , Regiões 3' não Traduzidas , Animais , Sequência de Bases , DNA , Genoma , Proteoma , TATA Box
19.
Stem Cells ; 31(8): 1498-510, 2013 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-23728894

RESUMO

Human induced pluripotent stem cells (hiPSC) have the potential to generate healthy cells and tissues for the study and medical treatment of a large number of diseases. The utility of putative hiPSC-based therapies is constrained by a lack of robust quality-control assays that address the stability of the cells or their capacity to form teratomas after differentiation. Here we report that virally derived hiPSC, but not human embryonic stem cells (hESC) or hiPSC derived using episomal nonintegrating vectors, exhibit a propensity to revert to a pluripotent phenotype following differentiation. This instability was revealed using our published method to identify pluripotent cells undergoing very early-stage differentiation in standard hESC cultures, by fluorescence activated cell sorting (FACS) based on expression of the cell surface markers TG30 (CD9) and GCTM-2. Differentiated cells cultured post-FACS fractionation from virally derived hiPSC lines reacquired immunoreactivity to TG30 (CD9) and GCTM-2, formed stem cell-like colonies, and re-expressed canonical pluripotency markers. Furthermore, differentiated cells from pluripotency-reverting hiPSC lines generated teratomas in immunocompromised mice, raising concerns about their safety in downstream applications. In contrast, differentiated cell populations from hESC and episomally derived hiPSC did not show any of these abnormalities. Our assays may be used to identify "unsafe" hiPSC cell lines and this information should be considered when selecting hiPSC lines for clinical use and indicate that experiments using these "unsafe" hiPSC lines should be interpreted carefully.


Assuntos
Células-Tronco Pluripotentes Induzidas/citologia , Animais , Diferenciação Celular/fisiologia , Linhagem Celular , Citometria de Fluxo , Humanos , Células-Tronco Pluripotentes Induzidas/metabolismo , Células-Tronco Pluripotentes Induzidas/fisiologia , Camundongos , Camundongos Knockout , Fator 3 de Transcrição de Octâmero/metabolismo , Teratoma/patologia , Transcriptoma
20.
PLoS Genet ; 7(8): e1002207, 2011 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-21852951

RESUMO

Gene expression analysis has become a ubiquitous tool for studying a wide range of human diseases. In a typical analysis we compare distinct phenotypic groups and attempt to identify genes that are, on average, significantly different between them. Here we describe an innovative approach to the analysis of gene expression data, one that identifies differences in expression variance between groups as an informative metric of the group phenotype. We find that genes with different expression variance profiles are not randomly distributed across cell signaling networks. Genes with low-expression variance, or higher constraint, are significantly more connected to other network members and tend to function as core members of signal transduction pathways. Genes with higher expression variance have fewer network connections and also tend to sit on the periphery of the cell. Using neural stem cells derived from patients suffering from Schizophrenia (SZ), Parkinson's disease (PD), and a healthy control group, we find marked differences in expression variance in cell signaling pathways that shed new light on potential mechanisms associated with these diverse neurological disorders. In particular, we find that expression variance of core networks in the SZ patient group was considerably constrained, while in contrast the PD patient group demonstrated much greater variance than expected. One hypothesis is that diminished variance in SZ patients corresponds to an increased degree of constraint in these pathways and a corresponding reduction in robustness of the stem cell networks. These results underscore the role that variation plays in biological systems and suggest that analysis of expression variance is far more important in disease than previously recognized. Furthermore, modeling patterns of variability in gene expression could fundamentally alter the way in which we think about how cellular networks are affected by disease processes.


Assuntos
Perfilação da Expressão Gênica , Variação Genética , Doença de Parkinson/genética , Esquizofrenia/genética , Transdução de Sinais/genética , Análise de Variância , Estudos de Casos e Controles , Fibroblastos/metabolismo , Estudo de Associação Genômica Ampla , Humanos , Células-Tronco Neurais/metabolismo , Doença de Parkinson/patologia , Mapas de Interação de Proteínas/genética , Esquizofrenia/patologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA