Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Bioinformatics ; 35(14): i568-i576, 2019 07 15.
Artículo en Inglés | MEDLINE | ID: mdl-31510680

RESUMEN

MOTIVATION: Late onset Alzheimer's disease is currently a disease with no known effective treatment options. To better understand disease, new multi-omic data-sets have recently been generated with the goal of identifying molecular causes of disease. However, most analytic studies using these datasets focus on uni-modal analysis of the data. Here, we propose a data driven approach to integrate multiple data types and analytic outcomes to aggregate evidences to support the hypothesis that a gene is a genetic driver of the disease. The main algorithmic contributions of our article are: (i) a general machine learning framework to learn the key characteristics of a few known driver genes from multiple feature sets and identifying other potential driver genes which have similar feature representations, and (ii) A flexible ranking scheme with the ability to integrate external validation in the form of Genome Wide Association Study summary statistics. While we currently focus on demonstrating the effectiveness of the approach using different analytic outcomes from RNA-Seq studies, this method is easily generalizable to other data modalities and analysis types. RESULTS: We demonstrate the utility of our machine learning algorithm on two benchmark multiview datasets by significantly outperforming the baseline approaches in predicting missing labels. We then use the algorithm to predict and rank potential drivers of Alzheimer's. We show that our ranked genes show a significant enrichment for single nucleotide polymorphisms associated with Alzheimer's and are enriched in pathways that have been previously associated with the disease. AVAILABILITY AND IMPLEMENTATION: Source code and link to all feature sets is available at https://github.com/Sage-Bionetworks/EvidenceAggregatedDriverRanking.


Asunto(s)
Algoritmos , Enfermedad de Alzheimer , Estudio de Asociación del Genoma Completo , Enfermedad de Alzheimer/genética , Humanos , Aprendizaje Automático , Programas Informáticos
2.
Genome Res ; 22(4): 681-92, 2012 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-22287102

RESUMEN

Although retroviruses are relatively promiscuous in choice of integration sites, retrotransposons can display marked integration specificity. In yeast and slime mold, some retrotransposons are associated with tRNA genes (tDNAs). In the Saccharomyces cerevisiae genome, the long terminal repeat retrotransposon Ty3 is found at RNA polymerase III (Pol III) transcription start sites of tDNAs. Ty1, 2, and 4 elements also cluster in the upstream regions of these genes. To determine the extent to which other Pol III-transcribed genes serve as genomic targets for Ty3, a set of 10,000 Ty3 genomic retrotranspositions were mapped using high-throughput DNA sequencing. Integrations occurred at all known tDNAs, two tDNA relics (iYGR033c and ZOD1), and six non-tDNA, Pol III-transcribed types of genes (RDN5, SNR6, SNR52, RPR1, RNA170, and SCR1). Previous work in vitro demonstrated that the Pol III transcription factor (TF) IIIB is important for Ty3 targeting. However, seven loci that bind the TFIIIB loader, TFIIIC, were not targeted, underscoring the unexplained absence of TFIIIB at those sites. Ty3 integrations also occurred in two open reading frames not previously associated with Pol III transcription, suggesting the existence of a small number of additional sites in the yeast genome that interact with Pol III transcription complexes.


Asunto(s)
ADN Polimerasa III/genética , Mutagénesis Insercional , Retroelementos/genética , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Secuencia de Bases , Sitios de Unión/genética , ADN Polimerasa III/metabolismo , Perfilación de la Expresión Génica , Regulación Fúngica de la Expresión Génica , Genoma Fúngico/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Modelos Genéticos , Análisis de Secuencia por Matrices de Oligonucleótidos , Recombinación Genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Homología de Secuencia de Ácido Nucleico , Factor de Transcripción TFIIIB/genética , Factor de Transcripción TFIIIB/metabolismo , Sitio de Iniciación de la Transcripción , Transcripción Genética
3.
PLoS Genet ; 8(7): e1002829, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22829784

RESUMEN

The antagonistic actions of Polycomb and Trithorax are responsible for proper cell fate determination in mammalian tissues. In the epidermis, a self-renewing epithelium, previous work has shown that release from Polycomb repression only partially explains differentiation gene activation. We now show that Trithorax is also a key regulator of epidermal differentiation, not only through activation of genes repressed by Polycomb in progenitor cells, but also through activation of genes independent of regulation by Polycomb. The differentiation associated transcription factor GRHL3/GET1 recruits the ubiquitously expressed Trithorax complex to a subset of differentiation genes.


Asunto(s)
Diferenciación Celular/genética , Proteínas de Unión al ADN , Células Epiteliales , N-Metiltransferasa de Histona-Lisina , Proteínas de Neoplasias , Factores de Transcripción , Calcio/farmacología , Diferenciación Celular/efectos de los fármacos , Metilación de ADN , Proteínas de Unión al ADN/genética , Proteínas de Unión al ADN/metabolismo , Células Epidérmicas , Epidermis/metabolismo , Células Epiteliales/citología , Células Epiteliales/metabolismo , Regulación del Desarrollo de la Expresión Génica , N-Metiltransferasa de Histona-Lisina/genética , N-Metiltransferasa de Histona-Lisina/metabolismo , Humanos , Péptidos y Proteínas de Señalización Intracelular , Queratinocitos/citología , Queratinocitos/metabolismo , Proteína de la Leucemia Mieloide-Linfoide/genética , Proteína de la Leucemia Mieloide-Linfoide/metabolismo , Proteínas de Neoplasias/genética , Proteínas de Neoplasias/metabolismo , Proteínas del Grupo Polycomb/genética , Proteínas del Grupo Polycomb/metabolismo , Regiones Promotoras Genéticas , ARN Interferente Pequeño , Células Madre/citología , Células Madre/metabolismo , Factores de Transcripción/genética , Factores de Transcripción/metabolismo , Transglutaminasas/genética , Transglutaminasas/metabolismo
5.
Cancers (Basel) ; 16(4)2024 Feb 15.
Artículo en Inglés | MEDLINE | ID: mdl-38398178

RESUMEN

Merkel cell carcinoma (MCC) and small cell lung cancer (SCLC) can be histologically similar. Immunohistochemistry (IHC) for cytokeratin 20 (CK20) and thyroid transcription factor 1 (TTF-1) are commonly used to differentiate MCC from SCLC; however, these markers have limited sensitivity and specificity. To identify new diagnostic markers, we performed differential gene expression analysis on transcriptome data from MCC and SCLC tumors. Candidate markers included atonal BHLH transcription factor 1 (ATOH1) and transcription factor AP-2ß (TFAP2B) for MCC, as well as carcinoembryonic antigen cell adhesion molecule 6 (CEACAM6) for SCLC. Immunostaining for CK20, TTF-1, and new candidate markers was performed on 43 MCC and 59 SCLC samples. All three MCC markers were sensitive and specific, with CK20 and ATOH1 staining 43/43 (100%) MCC and 0/59 (0%) SCLC cases and TFAP2B staining 40/43 (93%) MCC and 0/59 (0%) SCLC cases. TTF-1 stained 47/59 (80%) SCLC and 1/43 (2%) MCC cases. CEACAM6 stained 49/59 (83%) SCLC and 0/43 (0%) MCC cases. Combining CEACAM6 and TTF-1 increased SCLC detection sensitivity to 93% and specificity to 98%. These data suggest that ATOH1, TFAP2B, and CEACAM6 should be explored as markers to differentiate MCC and SCLC.

6.
Commun Biol ; 5(1): 1066, 2022 10 07.
Artículo en Inglés | MEDLINE | ID: mdl-36207580

RESUMEN

The phenotype of a cell and its underlying molecular state is strongly influenced by extracellular signals, including growth factors, hormones, and extracellular matrix proteins. While these signals are normally tightly controlled, their dysregulation leads to phenotypic and molecular states associated with diverse diseases. To develop a detailed understanding of the linkage between molecular and phenotypic changes, we generated a comprehensive dataset that catalogs the transcriptional, proteomic, epigenomic and phenotypic responses of MCF10A mammary epithelial cells after exposure to the ligands EGF, HGF, OSM, IFNG, TGFB and BMP2. Systematic assessment of the molecular and cellular phenotypes induced by these ligands comprise the LINCS Microenvironment (ME) perturbation dataset, which has been curated and made publicly available for community-wide analysis and development of novel computational methods ( synapse.org/LINCS_MCF10A ). In illustrative analyses, we demonstrate how this dataset can be used to discover functionally related molecular features linked to specific cellular phenotypes. Beyond these analyses, this dataset will serve as a resource for the broader scientific community to mine for biological insights, to compare signals carried across distinct molecular modalities, and to develop new computational methods for integrative data analysis.


Asunto(s)
Factor de Crecimiento Epidérmico , Proteómica , Factor de Crecimiento Epidérmico/farmacología , Proteínas de la Matriz Extracelular , Ligandos , Fenotipo
7.
BMC Bioinformatics ; 12: 495, 2011 Dec 30.
Artículo en Inglés | MEDLINE | ID: mdl-22208852

RESUMEN

BACKGROUND: A central challenge of biology is to map and understand gene regulation on a genome-wide scale. For any given genome, only a small fraction of the regulatory elements embedded in the DNA sequence have been characterized, and there is great interest in developing computational methods to systematically map all these elements and understand their relationships. Such computational efforts, however, are significantly hindered by the overwhelming size of non-coding regions and the statistical variability and complex spatial organizations of regulatory elements and interactions. Genome-wide catalogs of regulatory elements for all model species simply do not yet exist. RESULTS: The MotifMap system uses databases of transcription factor binding motifs, refined genome alignments, and a comparative genomic statistical approach to provide comprehensive maps of candidate regulatory elements encoded in the genomes of model species. The system is used to derive new genome-wide maps for yeast, fly, worm, mouse, and human. The human map contains 519,108 sites for 570 matrices with a False Discovery Rate of 0.1 or less. The new maps are assessed in several ways, for instance using high-throughput experimental ChIP-seq data and AUC statistics, providing strong evidence for their accuracy and coverage. The maps can be usefully integrated with many other kinds of omic data and are available at http://motifmap.igb.uci.edu/. CONCLUSIONS: MotifMap and its integration with other data provide a foundation for analyzing gene regulation on a genome-wide scale, and for automatically generating regulatory pathways and hypotheses. The power of this approach is demonstrated and discussed using the P53 apoptotic pathway and the Gli hedgehog pathways as examples.


Asunto(s)
Genómica/métodos , Modelos Animales , Motivos de Nucleótidos , Secuencias Reguladoras de Ácidos Nucleicos , Algoritmos , Animales , Regulación de la Expresión Génica , Genoma Humano , Estudio de Asociación del Genoma Completo , Proteínas Hedgehog/metabolismo , Humanos , Ratones , Transducción de Señal , Proteína p53 Supresora de Tumor/metabolismo
8.
Genome Biol ; 22(1): 92, 2021 03 29.
Artículo en Inglés | MEDLINE | ID: mdl-33781308

RESUMEN

BACKGROUND: Post-zygotic mutations incurred during DNA replication, DNA repair, and other cellular processes lead to somatic mosaicism. Somatic mosaicism is an established cause of various diseases, including cancers. However, detecting mosaic variants in DNA from non-cancerous somatic tissues poses significant challenges, particularly if the variants only are present in a small fraction of cells. RESULTS: Here, the Brain Somatic Mosaicism Network conducts a coordinated, multi-institutional study to examine the ability of existing methods to detect simulated somatic single-nucleotide variants (SNVs) in DNA mixing experiments, generate multiple replicates of whole-genome sequencing data from the dorsolateral prefrontal cortex, other brain regions, dura mater, and dural fibroblasts of a single neurotypical individual, devise strategies to discover somatic SNVs, and apply various approaches to validate somatic SNVs. These efforts lead to the identification of 43 bona fide somatic SNVs that range in variant allele fractions from ~ 0.005 to ~ 0.28. Guided by these results, we devise best practices for calling mosaic SNVs from 250× whole-genome sequencing data in the accessible portion of the human genome that achieve 90% specificity and sensitivity. Finally, we demonstrate that analysis of multiple bulk DNA samples from a single individual allows the reconstruction of early developmental cell lineage trees. CONCLUSIONS: This study provides a unified set of best practices to detect somatic SNVs in non-cancerous tissues. The data and methods are freely available to the scientific community and should serve as a guide to assess the contributions of somatic SNVs to neuropsychiatric diseases.


Asunto(s)
Encéfalo/metabolismo , Estudios de Asociación Genética , Variación Genética , Alelos , Mapeo Cromosómico , Biología Computacional/métodos , Estudios de Asociación Genética/métodos , Genómica/métodos , Células Germinativas/metabolismo , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Especificidad de Órganos/genética , Polimorfismo de Nucleótido Simple
9.
Sci Data ; 4: 170030, 2017 03 28.
Artículo en Inglés | MEDLINE | ID: mdl-28350385

RESUMEN

The use of induced pluripotent stem cells (iPSC) derived from independent patients and sources holds considerable promise to improve the understanding of development and disease. However, optimized use of iPSC depends on our ability to develop methods to efficiently qualify cell lines and protocols, monitor genetic stability, and evaluate self-renewal and differentiation potential. To accomplish these goals, 57 stem cell lines from 10 laboratories were differentiated to 7 different states, resulting in 248 analyzed samples. Cell lines were differentiated and characterized at a central laboratory using standardized cell culture methodologies, protocols, and metadata descriptors. Stem cell and derived differentiated lines were characterized using RNA-seq, miRNA-seq, copy number arrays, DNA methylation arrays, flow cytometry, and molecular histology. All materials, including raw data, metadata, analysis and processing code, and methodological and provenance documentation are publicly available for re-use and interactive exploration at https://www.synapse.org/pcbc. The goal is to provide data that can improve our ability to robustly and reproducibly use human pluripotent stem cells to understand development and disease.


Asunto(s)
Células Madre Pluripotentes Inducidas , Células Madre Pluripotentes , Animales , Técnicas de Cultivo de Célula , Humanos
10.
Stem Cell Reports ; 7(1): 110-25, 2016 07 12.
Artículo en Inglés | MEDLINE | ID: mdl-27293150

RESUMEN

The rigorous characterization of distinct induced pluripotent stem cells (iPSC) derived from multiple reprogramming technologies, somatic sources, and donors is required to understand potential sources of variability and downstream potential. To achieve this goal, the Progenitor Cell Biology Consortium performed comprehensive experimental and genomic analyses of 58 iPSC from ten laboratories generated using a variety of reprogramming genes, vectors, and cells. Associated global molecular characterization studies identified functionally informative correlations in gene expression, DNA methylation, and/or copy-number variation among key developmental and oncogenic regulators as a result of donor, sex, line stability, reprogramming technology, and cell of origin. Furthermore, X-chromosome inactivation in PSC produced highly correlated differences in teratoma-lineage staining and regulator expression upon differentiation. All experimental results, and raw, processed, and metadata from these analyses, including powerful tools, are interactively accessible from a new online portal at https://www.synapse.org to serve as a reusable resource for the stem cell community.


Asunto(s)
Diferenciación Celular/genética , Metilación de ADN/genética , Genoma Humano , Células Madre Pluripotentes Inducidas , Reprogramación Celular , Expresión Génica/genética , Genómica , Humanos , Células Madre/metabolismo
11.
Oncotarget ; 6(28): 26472-82, 2015 Sep 22.
Artículo en Inglés | MEDLINE | ID: mdl-26299616

RESUMEN

BACKGROUND: Merkel cell carcinoma (MCC) is a rare, aggressive neuroendocrine skin cancer. Although used to monitor MCC patients, the clinical utility of neuron-specific enolase (NSE) and chromogranin A (ChrA) blood levels is untested. EpCAM-positive circulating tumor cells (CTC) reflect disease status in several epithelial tumors. Here we investigate the use of NSE and ChrA blood levels and CTC counts as biomarkers for MCC disease behavior. METHODS: NSE and ChrA blood levels from 60 patients with MCC were retrospectively analyzed; 30 patients were additionally screened for CTC. Biomarker values were correlated to clinical parameters. RESULTS: Despite routine use by some physicians, NSE and ChrA blood levels did not correlate with progression free survival, disease specific survival, or MCC recurrence. We found CTC in 97% of tested MCC patients. CTC counts were elevated in patients with active disease, suggesting their potential use in monitoring MCC. CONCLUSIONS: NSE and ChrA levels were not effective in predicting outcomes or detecting recurrences of MCC. In contrast, CTC counts have potential utility as a biomarker for MCC disease behavior.


Asunto(s)
Biomarcadores de Tumor/sangre , Carcinoma de Células de Merkel/sangre , Carcinoma de Células de Merkel/patología , Cromogranina A/sangre , Células Neoplásicas Circulantes/patología , Fosfopiruvato Hidratasa/sangre , Neoplasias Cutáneas/sangre , Neoplasias Cutáneas/patología , Adulto , Anciano , Anciano de 80 o más Años , Carcinoma de Células de Merkel/mortalidad , Carcinoma de Células de Merkel/terapia , Recuento de Células , Supervivencia sin Enfermedad , Femenino , Humanos , Estimación de Kaplan-Meier , Masculino , Persona de Mediana Edad , Recurrencia Local de Neoplasia , Valor Predictivo de las Pruebas , Reproducibilidad de los Resultados , Estudios Retrospectivos , Factores de Riesgo , Neoplasias Cutáneas/mortalidad , Neoplasias Cutáneas/terapia , Regulación hacia Arriba
12.
J Invest Dermatol ; 135(4): 1138-1146, 2015 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-25521454

RESUMEN

When using cell lines to study cancer, phenotypic similarity to the original tumor is paramount. Yet, little has been done to characterize how closely Merkel cell carcinoma (MCC) cell lines model native tumors. To determine their similarity to MCC tumor samples, we characterized MCC cell lines via gene expression microarrays. Using whole transcriptome gene expression signatures and a computational bioinformatic approach, we identified significant differences between variant cell lines (UISO, MCC13, and MCC26) and fresh frozen MCC tumors. Conversely, the classic WaGa and Mkl-1 cell lines more closely represented the global transcriptome of MCC tumors. When compared with publicly available cancer lines, WaGa and Mkl-1 cells were similar to other neuroendocrine tumors, but the variant cell lines were not. WaGa and Mkl-1 cells grown as xenografts in mice had histological and immunophenotypical features consistent with MCC, whereas UISO xenograft tumors were atypical for MCC. Spectral karyotyping and short tandem repeat analysis of the UISO cells matched the original cell line's description, ruling out contamination. Our results validate the use of transcriptome analysis to assess the cancer cell line representativeness and indicate that UISO, MCC13, and MCC26 cell lines are not representative of MCC tumors, whereas WaGa and Mkl-1 more closely model MCC.


Asunto(s)
Carcinoma de Células de Merkel/metabolismo , Regulación Neoplásica de la Expresión Génica , Animales , Carcinoma de Células de Merkel/genética , Línea Celular Tumoral , Análisis por Conglomerados , Femenino , Perfilación de la Expresión Génica , Humanos , Inmunohistoquímica , Cariotipificación , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/metabolismo , Ratones , Ratones Endogámicos NOD , Repeticiones de Microsatélite , Trasplante de Neoplasias , Análisis de Secuencia por Matrices de Oligonucleótidos , Fenotipo , Infecciones por Polyomavirus/genética , Infecciones por Polyomavirus/metabolismo , Neoplasias Cutáneas/genética , Neoplasias Cutáneas/metabolismo , Transcriptoma
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA