Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 41
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
Cell ; 150(5): 1068-81, 2012 Aug 31.
Artigo em Inglês | MEDLINE | ID: mdl-22939629

RESUMO

Cellular processes often depend on stable physical associations between proteins. Despite recent progress, knowledge of the composition of human protein complexes remains limited. To close this gap, we applied an integrative global proteomic profiling approach, based on chromatographic separation of cultured human cell extracts into more than one thousand biochemical fractions that were subsequently analyzed by quantitative tandem mass spectrometry, to systematically identify a network of 13,993 high-confidence physical interactions among 3,006 stably associated soluble human proteins. Most of the 622 putative protein complexes we report are linked to core biological processes and encompass both candidate disease genes and unannotated proteins to inform on mechanism. Strikingly, whereas larger multiprotein assemblies tend to be more extensively annotated and evolutionarily conserved, human protein complexes with five or fewer subunits are far more likely to be functionally unannotated or restricted to vertebrates, suggesting more recent functional innovations.


Assuntos
Complexos Multiproteicos/análise , Mapas de Interação de Proteínas , Proteínas/química , Proteômica/métodos , Humanos , Espectrometria de Massas em Tandem
2.
Am J Hum Genet ; 108(8): 1359-1366, 2021 08 05.
Artigo em Inglês | MEDLINE | ID: mdl-34297908

RESUMO

DNA methylation (DNAm) signatures are unique patterns of DNAm alterations defined for rare disorders caused by pathogenic variants in epigenetic regulatory genes. The potential of DNAm signatures (also known as "episignatures") is just beginning to emerge as there are >300 known epigenetic regulatory genes, ∼100 of which are linked to neurodevelopmental disorders. To date, approximately 50 signatures have been identified, which have proven unexpectedly successful as predictive tools for classifying variants of uncertain significance as pathogenic or benign. The molecular basis of these signatures is poorly understood. Furthermore, their relationships to primary disease pathophysiology have yet to be adequately investigated, despite clear demonstrations of potential connections. There are currently no published guidelines for signature development. As signatures are highly dependent on the samples and methods used to derive them, we propose a framework for consideration in signature development including sample size, statistical parameters, cell type of origin, and the value of detailed clinical and molecular information. We illustrate the relationship between signature output/efficacy and sample size by generating and testing 837 DNAm signatures of Kleefstra syndrome using downsampling analysis. Our findings highlight that no single DNAm signature encompasses all DNAm alterations present in a rare disorder, and that a substandard study design can generate a DNAm signature that misclassifies variants. Finally, we discuss the importance of further investigating DNAm signatures to inform disease pathophysiology and broaden their scope as a functional assay.


Assuntos
Metilação de DNA , Epigênese Genética , Regulação da Expressão Gênica , Mutação , Transtornos do Neurodesenvolvimento/patologia , Animais , Humanos , Transtornos do Neurodesenvolvimento/genética
3.
Am J Hum Genet ; 106(5): 596-610, 2020 05 07.
Artigo em Inglês | MEDLINE | ID: mdl-32243864

RESUMO

Weaver syndrome (WS), an overgrowth/intellectual disability syndrome (OGID), is caused by pathogenic variants in the histone methyltransferase EZH2, which encodes a core component of the Polycomb repressive complex-2 (PRC2). Using genome-wide DNA methylation (DNAm) data for 187 individuals with OGID and 969 control subjects, we show that pathogenic variants in EZH2 generate a highly specific and sensitive DNAm signature reflecting the phenotype of WS. This signature can be used to distinguish loss-of-function from gain-of-function missense variants and to detect somatic mosaicism. We also show that the signature can accurately classify sequence variants in EED and SUZ12, which encode two other core components of PRC2, and predict the presence of pathogenic variants in undiagnosed individuals with OGID. The discovery of a functionally relevant signature with utility for diagnostic classification of sequence variants in EZH2, EED, and SUZ12 supports the emerging paradigm shift for implementation of DNAm signatures into diagnostics and translational research.


Assuntos
Anormalidades Múltiplas/genética , Hipotireoidismo Congênito/genética , Anormalidades Craniofaciais/genética , Metilação de DNA , Proteína Potenciadora do Homólogo 2 de Zeste/genética , Deformidades Congênitas da Mão/genética , Deficiência Intelectual/genética , Mutação , Complexo Repressor Polycomb 2/genética , Adolescente , Adulto , Criança , Pré-Escolar , Estudos de Coortes , Feminino , Humanos , Lactente , Masculino , Mosaicismo , Mutação de Sentido Incorreto/genética , Proteínas de Neoplasias , Reprodutibilidade dos Testes , Fatores de Transcrição , Adulto Jovem
4.
Hum Mutat ; 43(6): 800-811, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35181971

RESUMO

Despite recent progress in the understanding of the genetic etiologies of rare diseases (RDs), a significant number remain intractable to diagnostic and discovery efforts. Broad data collection and sharing of information among RD researchers is therefore critical. In 2018, the Care4Rare Canada Consortium launched the project C4R-SOLVE, a subaim of which was to collect, harmonize, and share both retrospective and prospective Canadian clinical and multiomic data. Here, we introduce Genomics4RD, an integrated web-accessible platform to share Canadian phenotypic and multiomic data between researchers, both within Canada and internationally, for the purpose of discovering the mechanisms that cause RDs. Genomics4RD has been designed to standardize data collection and processing, and to help users systematically collect, prioritize, and visualize participant information. Data storage, authorization, and access procedures have been developed in collaboration with policy experts and stakeholders to ensure the trusted and secure access of data by external researchers. The breadth and standardization of data offered by Genomics4RD allows researchers to compare candidate disease genes and variants between participants (i.e., matchmaking) for discovery purposes, while facilitating the development of computational approaches for multiomic data analyses and enabling clinical translation efforts for new genetic technologies in the future.


Assuntos
Doenças Raras , Canadá , Estudos de Associação Genética , Humanos , Fenótipo , Estudos Prospectivos , Doenças Raras/diagnóstico , Doenças Raras/genética , Estudos Retrospectivos
5.
Bioinformatics ; 37(19): 3144-3151, 2021 Oct 11.
Artigo em Inglês | MEDLINE | ID: mdl-33944895

RESUMO

MOTIVATION: Current fusion detection tools use diverse calling approaches and provide varying results, making selection of the appropriate tool challenging. Ensemble fusion calling techniques appear promising; however, current options have limited accessibility and function. RESULTS: MetaFusion is a flexible metacalling tool that amalgamates outputs from any number of fusion callers. Individual caller results are standardized by conversion into the new file type Common Fusion Format. Calls are annotated, merged using graph clustering, filtered and ranked to provide a final output of high-confidence candidates. MetaFusion consistently achieves higher precision and recall than individual callers on real and simulated datasets, and reaches up to 100% precision, indicating that ensemble calling is imperative for high-confidence results. MetaFusion uses FusionAnnotator to annotate calls with information from cancer fusion databases and is provided with a Benchmarking Toolkit to calibrate new callers. AVAILABILITY AND IMPLEMENTATION: MetaFusion is freely available at https://github.com/ccmbioinfo/MetaFusion. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

6.
Hum Mol Genet ; 28(3): 372-385, 2019 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-30239726

RESUMO

Children conceived using Assisted Reproductive Technologies (ART) have a higher incidence of growth and birth defects, attributable in part to epigenetic perturbations. Both ART and germline defects associated with parental infertility could interfere with epigenetic reprogramming events in germ cells or early embryos. Mouse models indicate that the placenta is more susceptible to the induction of epigenetic abnormalities than the embryo, and thus the placental methylome may provide a sensitive indicator of 'at risk' conceptuses. Our goal was to use genome-wide profiling to examine the extent of epigenetic abnormalities in matched placentas from an ART/infertility group and control singleton pregnancies (n = 44/group) from a human prospective longitudinal birth cohort, the Design, Develop, Discover (3D) Study. Principal component analysis revealed a group of ART outliers. The ART outlier group was enriched for females and a subset of placentas showing loss of methylation of several imprinted genes including GNAS, SGCE, KCNQT1OT1 and BLCAP/NNAT. Within the ART group, placentas from pregnancies conceived with in vitro fertilization (IVF)/intracytoplasmic sperm injection (ICSI) showed distinct epigenetic profiles as compared to those conceived with less invasive procedures (ovulation induction, intrauterine insemination). Male factor infertility and paternal age further differentiated the IVF/ICSI group, suggesting an interaction of infertility and techniques in perturbing the placental epigenome. Together, the results suggest that the human placenta is sensitive to the induction of epigenetic defects by ART and/or infertility, and we stress the importance of considering both sex and paternal factors and that some but not all ART conceptuses will be susceptible.


Assuntos
Placenta/fisiologia , Placentação/genética , Técnicas de Reprodução Assistida/efeitos adversos , Adulto , Estudos de Coortes , DNA/metabolismo , Metilação de DNA/genética , Epigênese Genética/genética , Epigenômica , Feminino , Fertilização in vitro/efeitos adversos , Estudo de Associação Genômica Ampla/métodos , Impressão Genômica/genética , Humanos , Lactente , Recém-Nascido , Infertilidade Masculina/metabolismo , Estudos Longitudinais , Masculino , Pessoa de Meia-Idade , Modelos Animais , Indução da Ovulação/efeitos adversos , Placenta/metabolismo , Gravidez , Análise de Componente Principal , Estudos Prospectivos , Reprodução , Injeções de Esperma Intracitoplásmicas/efeitos adversos
7.
Gastroenterology ; 158(8): 2208-2220, 2020 06.
Artigo em Inglês | MEDLINE | ID: mdl-32084423

RESUMO

BACKGROUND & AIMS: A proportion of infants and young children with inflammatory bowel diseases (IBDs) have subtypes associated with a single gene variant (monogenic IBD). We aimed to determine the prevalence of monogenic disease in a cohort of pediatric patients with IBD. METHODS: We performed whole-exome sequencing analyses of blood samples from an unselected cohort of 1005 children with IBD, aged 0-18 years (median age at diagnosis, 11.96 years) at a single center in Canada and their family members (2305 samples total). Variants believed to cause IBD were validated using Sanger sequencing. Biopsies from patients were analyzed by immunofluorescence and histochemical analyses. RESULTS: We identified 40 rare variants associated with 21 monogenic genes among 31 of the 1005 children with IBD (including 5 variants in XIAP, 3 in DOCK8, and 2 each in FOXP3, GUCY2C, and LRBA). These variants occurred in 7.8% of children younger than 6 years and 2.3% of children aged 6-18 years. Of the 17 patients with monogenic Crohn's disease, 35% had abdominal pain, 24% had nonbloody loose stool, 18% had vomiting, 18% had weight loss, and 5% had intermittent bloody loose stool. The 14 patients with monogenic ulcerative colitis or IBD-unclassified received their diagnosis at a younger age, and their most predominant feature was bloody loose stool (78%). Features associated with monogenic IBD, compared to cases of IBD not associated with a single variant, were age of onset younger than 2 years (odds ratio [OR], 6.30; P = .020), family history of autoimmune disease (OR, 5.12; P = .002), extra-intestinal manifestations (OR, 15.36; P < .0001), and surgery (OR, 3.42; P = .042). Seventeen patients had variants in genes that could be corrected with allogeneic hematopoietic stem cell transplantation. CONCLUSIONS: In whole-exome sequencing analyses of more than 1000 children with IBD at a single center, we found that 3% had rare variants in genes previously associated with pediatric IBD. These were associated with different IBD phenotypes, and 1% of the patients had variants that could be potentially corrected with allogeneic hematopoietic stem cell transplantation. Monogenic IBD is rare, but should be considered in analysis of all patients with pediatric onset of IBD.


Assuntos
Colite Ulcerativa/genética , Doença de Crohn/genética , Sequenciamento do Exoma , Variação Genética , Adolescente , Fatores Etários , Produtos Biológicos/uso terapêutico , Criança , Pré-Escolar , Colite Ulcerativa/diagnóstico , Colite Ulcerativa/epidemiologia , Colite Ulcerativa/terapia , Doença de Crohn/diagnóstico , Doença de Crohn/epidemiologia , Doença de Crohn/terapia , Feminino , Predisposição Genética para Doença , Transplante de Células-Tronco Hematopoéticas , Humanos , Lactente , Recém-Nascido , Masculino , Ontário/epidemiologia , Fenótipo , Prevalência , Medição de Risco , Fatores de Risco , Transplante Homólogo , Resultado do Tratamento
8.
Hum Mutat ; 41(10): 1722-1733, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-32623772

RESUMO

Epigenetic processes play a key role in regulating gene expression. Genetic variants that disrupt chromatin-modifying proteins are associated with a broad range of diseases, some of which have specific epigenetic patterns, such as aberrant DNA methylation (DNAm), which may be used as disease biomarkers. While much of the epigenetic research has focused on cancer, there is a paucity of resources devoted to neurodevelopmental disorders (NDDs), which include autism spectrum disorder and many rare, clinically overlapping syndromes. To address this challenge, we created EpigenCentral, a free web resource for biomedical researchers, molecular diagnostic laboratories, and clinical practitioners to perform the interactive classification and analysis of DNAm data related to NDDs. It allows users to search for known disease-associated patterns in their DNAm data, classify genetic variants as pathogenic or benign to assist in molecular diagnostics, or analyze patterns of differential methylation in their data through a simple web form. EpigenCentral is freely available at http://epigen.ccm.sickkids.ca/.


Assuntos
Transtorno do Espectro Autista , Metilação de DNA , Transtorno do Espectro Autista/diagnóstico , Transtorno do Espectro Autista/genética , Metilação de DNA/genética , Análise de Dados , Epigênese Genética , Humanos , Doenças Raras/diagnóstico , Doenças Raras/genética
9.
Am J Hum Genet ; 100(5): 773-788, 2017 May 04.
Artigo em Inglês | MEDLINE | ID: mdl-28475860

RESUMO

Epigenetic dysregulation has emerged as a recurring mechanism in the etiology of neurodevelopmental disorders. Two such disorders, CHARGE and Kabuki syndromes, result from loss of function mutations in chromodomain helicase DNA-binding protein 7 (CHD7LOF) and lysine (K) methyltransferase 2D (KMT2DLOF), respectively. Although these two syndromes are clinically distinct, there is significant phenotypic overlap. We therefore expected that epigenetically driven developmental pathways regulated by CHD7 and KMT2D would overlap and that DNA methylation (DNAm) alterations downstream of the mutations in these genes would identify common target genes, elucidating a mechanistic link between these two conditions, as well as specific target genes for each disorder. Genome-wide DNAm profiles in individuals with CHARGE and Kabuki syndromes with CHD7LOF or KMT2DLOF identified distinct sets of DNAm differences in each of the disorders, which were used to generate two unique, highly specific and sensitive DNAm signatures. These DNAm signatures were able to differentiate pathogenic mutations in these two genes from controls and from each other. Analysis of the DNAm targets in each gene-specific signature identified both common gene targets, including homeobox A5 (HOXA5), which could account for some of the clinical overlap in CHARGE and Kabuki syndromes, as well as distinct gene targets. Our findings demonstrate how characterization of the epigenome can contribute to our understanding of disease pathophysiology for epigenetic disorders, paving the way for explorations of novel therapeutics.


Assuntos
Anormalidades Múltiplas/genética , Síndrome CHARGE/genética , Metilação de DNA , Epigênese Genética , Face/anormalidades , Doenças Hematológicas/genética , Doenças Vestibulares/genética , Anormalidades Múltiplas/diagnóstico , Síndrome CHARGE/diagnóstico , Linhagem Celular , DNA Helicases/genética , DNA Helicases/metabolismo , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Regulação da Expressão Gênica , Genoma Humano , Doenças Hematológicas/diagnóstico , Proteínas de Homeodomínio/genética , Proteínas de Homeodomínio/metabolismo , Humanos , Leucócitos Mononucleares/citologia , Leucócitos Mononucleares/metabolismo , Mutação , Proteínas de Neoplasias/genética , Proteínas de Neoplasias/metabolismo , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Doenças Vestibulares/diagnóstico
10.
Nucleic Acids Res ; 43(15): 7217-28, 2015 Sep 03.
Artigo em Inglês | MEDLINE | ID: mdl-26130710

RESUMO

With the development of High-Throughput Sequencing (HTS) thousands of human genomes have now been sequenced. Whenever different studies analyze the same genome they usually agree on the amount of single-nucleotide polymorphisms, but differ dramatically on the number of insertion and deletion variants (indels). Furthermore, there is evidence that indels are often severely under-reported. In this manuscript we derive the total number of indel variants in a human genome by combining data from different sequencing technologies, while assessing the indel detection accuracy. Our estimate of approximately 1 million indels in a Yoruban genome is much higher than the results reported in several recent HTS studies. We identify two key sources of difficulties in indel detection: the insufficient coverage, read length or alignment quality; and the presence of repeats, including short interspersed elements and homopolymers/dimers. We quantify the effect of these factors on indel detection. The quality of sequencing data plays a major role in improving indel detection by HTS methods. However, many indels exist in long homopolymers and repeats, where their detection is severely impeded. The true number of indel events is likely even higher than our current estimates, and new techniques and technologies will be required to detect them.


Assuntos
Análise Mutacional de DNA/métodos , Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Mutação INDEL , Elementos Alu , Animais , Composição de Bases , Evolução Molecular , Humanos , Primatas , Sequências Repetitivas de Ácido Nucleico
11.
Mol Cell Proteomics ; 13(2): 489-502, 2014 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-24319055

RESUMO

Endoglin and activin receptor-like kinase 1 are specialized transforming growth factor-beta (TGF-ß) superfamily receptors, primarily expressed in endothelial cells. Mutations in the corresponding ENG or ACVRL1 genes lead to hereditary hemorrhagic telangiectasia (HHT1 and HHT2 respectively). To discover proteins interacting with endoglin, ACVRL1 and TGF-ß receptor type 2 and involved in TGF-ß signaling, we applied LUMIER, a high-throughput mammalian interactome mapping technology. Using stringent criteria, we identified 181 novel unique and shared interactions with ACVRL1, TGF-ß receptor type 2, and endoglin, defining potential novel important vascular networks. In particular, the regulatory subunit B-beta of the protein phosphatase PP2A (PPP2R2B) interacted with all three receptors. Interestingly, the PPP2R2B gene lies in an interval in linkage disequilibrium with HHT3, for which the gene remains unidentified. We show that PPP2R2B protein interacts with the ACVRL1/TGFBR2/endoglin complex and recruits PP2A to nitric oxide synthase 3 (NOS3). Endoglin overexpression in endothelial cells inhibits the association of PPP2R2B with NOS3, whereas endoglin-deficient cells show enhanced PP2A-NOS3 interaction and lower levels of endogenous NOS3 Serine 1177 phosphorylation. Our data suggest that endoglin regulates NOS3 activation status by regulating PPP2R2B access to NOS3, and that PPP2R2B might be the HHT3 gene. Furthermore, endoglin and ACVRL1 contribute to several novel networks, including TGF-ß dependent and independent ones, critical for vascular function and potentially defective in HHT.


Assuntos
Receptores de Activinas Tipo II/metabolismo , Antígenos CD/metabolismo , Vasos Sanguíneos/metabolismo , Mapas de Interação de Proteínas , Receptores de Superfície Celular/metabolismo , Animais , Embrião de Mamíferos , Endoglina , Endotélio Vascular/metabolismo , Endotélio Vascular/patologia , Células HEK293 , Humanos , Camundongos , Camundongos Knockout , Ligação Proteica , Telangiectasia Hemorrágica Hereditária/metabolismo , Telangiectasia Hemorrágica Hereditária/patologia , Fator de Crescimento Transformador beta/metabolismo
12.
Mol Syst Biol ; 10: 741, 2014 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-25028490

RESUMO

The hematopoietic system is a distributed tissue that consists of functionally distinct cell types continuously produced through hematopoietic stem cell (HSC) differentiation. Combining genomic and phenotypic data with high-content experiments, we have built a directional cell-cell communication network between 12 cell types isolated from human umbilical cord blood. Network structure analysis revealed that ligand production is cell type dependent, whereas ligand binding is promiscuous. Consequently, additional control strategies such as cell frequency modulation and compartmentalization were needed to achieve specificity in HSC fate regulation. Incorporating the in vitro effects (quiescence, self-renewal, proliferation, or differentiation) of 27 HSC binding ligands into the topology of the cell-cell communication network allowed coding of cell type-dependent feedback regulation of HSC fate. Pathway enrichment analysis identified intracellular regulatory motifs enriched in these cell type- and ligand-coupled responses. This study uncovers cellular mechanisms of hematopoietic cell feedback in HSC fate regulation, provides insight into the design principles of the human hematopoietic system, and serves as a foundation for the analysis of intercellular regulation in multicellular systems.


Assuntos
Comunicação Celular , Células-Tronco Hematopoéticas/citologia , Células-Tronco Hematopoéticas/fisiologia , Algoritmos , Células Cultivadas , Perfilação da Expressão Gênica , Hematopoese , Humanos , Ligantes
13.
Am J Physiol Endocrinol Metab ; 307(7): E596-610, 2014 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-25117410

RESUMO

The multidrug resistance efflux transporter ATP-binding cassette subfamily G member 2 (ABCG2) is not only overexpressed in certain drug-resistant cancers but is also highly expressed in the mammary gland during lactation, carrying xenobiotics and nutrients into milk. We sought to investigate the molecular mechanisms involved in the upregulation of ABCG2 during lactation. Expression profiling of different mouse Abcg2 mRNA isoforms (E1a, E1b, and E1c) revealed that E1b is predominantly expressed and induced in the lactating mouse mammary gland. Despite this induction, analyses of CpG methylation status and published ChIP-seq datasets reveal that E1b promoter sequences in the virgin gland are already hypomethylated and marked with the open chromatin histone mark H3K4me2. Using a forced-weaning model to shut down lactation, we found that within 24 h there was a significant reduction in Abcg2 mRNA expression and a loss of signal transducer and activator of transcription-5 (STAT5) occupancy at the mouse Abcg2 gene. Luciferase reporter assays further showed that some of these STAT5-binding regions that contained interferon-γ-activated sequence (GAS) motifs function as an enhancer after prolactin treatment. We conclude that Abcg2 is already poised for expression in the virgin mammary gland and that STAT5 plays an important role in Abcg2 expression during lactation.


Assuntos
Transportadores de Cassetes de Ligação de ATP/genética , Epigênese Genética , Células Epiteliais/metabolismo , Lactação/genética , Glândulas Mamárias Animais/metabolismo , Isoformas de RNA/genética , RNA Mensageiro/genética , Fator de Transcrição STAT5/fisiologia , Membro 2 da Subfamília G de Transportadores de Cassetes de Ligação de ATP , Transportadores de Cassetes de Ligação de ATP/metabolismo , Animais , Ilhas de CpG , Metilação de DNA , Feminino , Camundongos , Camundongos Endogâmicos C57BL , Camundongos Endogâmicos , Regiões Promotoras Genéticas , Transdução de Sinais , Regulação para Cima
14.
Nat Commun ; 15(1): 5898, 2024 Jul 13.
Artigo em Inglês | MEDLINE | ID: mdl-39003323

RESUMO

Studying human fetal lungs can inform how developmental defects and disease states alter the function of the lungs. Here, we sequenced >150,000 single cells from 19 healthy human pseudoglandular fetal lung tissues ranging between gestational weeks 10-19. We capture dynamic developmental trajectories from progenitor cells that express abundant levels of the cystic fibrosis conductance transmembrane regulator (CFTR). These cells give rise to multiple specialized epithelial cell types. Combined with spatial transcriptomics, we show temporal regulation of key signalling pathways that may drive the temporal and spatial emergence of specialized epithelial cells including ciliated and pulmonary neuroendocrine cells. Finally, we show that human pluripotent stem cell-derived fetal lung models contain CFTR-expressing progenitor cells that capture similar lineage developmental trajectories as identified in the native tissue. Overall, this study provides a comprehensive single-cell atlas of the developing human lung, outlining the temporal and spatial complexities of cell lineage development and benchmarks fetal lung cultures from human pluripotent stem cell differentiations to similar developmental window.


Assuntos
Diferenciação Celular , Regulador de Condutância Transmembrana em Fibrose Cística , Células Epiteliais , Feto , Pulmão , Humanos , Pulmão/embriologia , Pulmão/citologia , Células Epiteliais/citologia , Células Epiteliais/metabolismo , Feto/citologia , Feto/embriologia , Regulador de Condutância Transmembrana em Fibrose Cística/metabolismo , Regulador de Condutância Transmembrana em Fibrose Cística/genética , Plasticidade Celular , Linhagem da Célula , Células-Tronco Pluripotentes/citologia , Células-Tronco Pluripotentes/metabolismo , Análise de Célula Única , Transcriptoma , Feminino , Regulação da Expressão Gênica no Desenvolvimento , Transdução de Sinais
15.
Nucleic Acids Res ; 39(Database issue): D889-94, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-20876685

RESUMO

Chromatin modification (CM) is a set of epigenetic processes that govern many aspects of DNA replication, transcription and repair. CM is carried out by groups of physically interacting proteins, and their disruption has been linked to a number of complex human diseases. CM remains largely unexplored, however, especially in higher eukaryotes such as human. Here we present the DAnCER resource, which integrates information on genes with CM function from five model organisms, including human. Currently integrated are gene functional annotations, Pfam domain architecture, protein interaction networks and associated human diseases. Additional supporting evidence includes orthology relationships across organisms, membership in protein complexes, and information on protein 3D structure. These data are available for 962 experimentally confirmed and manually curated CM genes and for over 5000 genes with predicted CM function on the basis of orthology and domain composition. DAnCER allows visual explorations of the integrated data and flexible query capabilities using a variety of data filters. In particular, disease information and functional annotations are mapped onto the protein interaction networks, enabling the user to formulate new hypotheses on the function and disease associations of a given gene based on those of its interaction partners. DAnCER is freely available at http://wodaklab.org/dancer/.


Assuntos
Cromatina/metabolismo , Bases de Dados Genéticas , Doença/genética , Epigenômica , Animais , Caenorhabditis elegans/genética , Drosophila melanogaster/genética , Humanos , Camundongos , Anotação de Sequência Molecular , Conformação Proteica , Mapeamento de Interação de Proteínas , Saccharomyces cerevisiae/genética
16.
Bioinformatics ; 27(6): 883-4, 2011 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-21257609

RESUMO

MOTIVATION: Protein interaction networks contain a wealth of biological information, but their large size often hinders cross-organism comparisons. We present OrthoNets, a Cytoscape plugin that displays protein-protein interaction (PPI) networks from two organisms simultaneously, highlighting orthology relationships and aggregating several types of biomedical annotations. OrthoNets also allows PPI networks derived from experiments to be overlaid on networks extracted from public databases, supporting the identification and verification of new interactors. Any newly identified PPIs can be validated by checking whether their orthologs interact in another organism. AVAILABILITY: OrthoNets is freely available at http://wodaklab.org/orthonets/.


Assuntos
Biologia Computacional/métodos , Mapeamento de Interação de Proteínas/métodos , Software , Bases de Dados de Proteínas , Proteínas/análise , Interface Usuário-Computador
17.
Nucleic Acids Res ; 38(22): 7927-42, 2010 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-20705649

RESUMO

Classifying proteins into subgroups with similar molecular function on the basis of sequence is an important step in deriving reliable functional annotations computationally. So far, however, available classification procedures have been evaluated against protein subgroups that are defined by experts using mainly qualitative descriptions of molecular function. Recently, in vitro DNA-binding preferences to all possible 8-nt DNA sequences have been measured for 178 mouse homeodomains using protein-binding microarrays, offering the unprecedented opportunity of evaluating the classification methods against quantitative measures of molecular function. To this end, we automatically derive homeodomain subtypes from the DNA-binding data and independently group the same domains using sequence information alone. We test five sequence-based methods, which use different sequence-similarity measures and algorithms to group sequences. Results show that methods that optimize the classification robustness reflect well the detailed functional specificity revealed by the experimental data. In some of these classifications, 73-83% of the subfamilies exactly correspond to, or are completely contained in, the function-based subtypes. Our findings demonstrate that certain sequence-based classifications are capable of yielding very specific molecular function annotations. The availability of quantitative descriptions of molecular function, such as DNA-binding data, will be a key factor in exploiting this potential in the future.


Assuntos
Proteínas de Homeodomínio/classificação , Animais , DNA/metabolismo , Proteínas de Homeodomínio/síntese química , Proteínas de Homeodomínio/metabolismo , Camundongos , Análise de Sequência de Proteína
18.
Comput Struct Biotechnol J ; 20: 6375-6387, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36420149

RESUMO

Tumors are complex biological entities that comprise cell types of different origins, with different mutational profiles and different patterns of transcriptional dysregulation. The exploration of data related to cancer biology requires careful analytical methods to reflect the heterogeneity of cell populations in cancer samples. Single-cell techniques are now able to capture the transcriptional profiles of individual cells. However, the complexity of RNA-seq data, especially in cancer samples, makes it challenging to cluster single-cell profiles into groups that reflect the underlying cell types. We have developed a framework for a systematic examination of single-cell RNA-seq clustering algorithms for cancer data, which uses a range of well-established metrics to generate a unified quality score and algorithm ranking. To demonstrate this framework, we examined clustering performance of 15 different single-cell RNA-seq clustering algorithms on eight different cancer datasets. Our results suggest that the single-cell RNA-seq clustering algorithms fall into distinct groups by performance, with the highest clustering quality on non-malignant cells achieved by three algorithms: Seurat, bigSCale and Cell Ranger. However, for malignant cells, two additional algorithms often reach a better performance, namely Monocle and SC3. Their ability to detect known rare cell types was also among the best, along with Seurat. Our approach and results can be used by a broad audience of practitioners who analyze single-cell transcriptomic data in cancer research.

19.
Curr Protoc ; 2(11): e597, 2022 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-36367395

RESUMO

There are more than 700 genes that encode proteins that function in epigenetic regulation and chromatin modification. Germline variants in these genes (typically heterozygous) are associated with rare neurodevelopmental disorders (NDDs) characterized by growth abnormalities and intellectual and developmental delay. Advancements in next-generation sequencing have dramatically increased the detection of pathogenic sequence variants in genes encoding epigenetic machinery associated with NDDs and, concurrently, the number of clinically uninterpretable variants classified as variants of uncertain significance (VUS). Recently, DNA methylation (DNAm) signatures, disorder-specific patterns of DNAm change, have emerged as a functional tool that provides insights into disorder pathophysiology and can classify pathogenicity of variants in NDDs. To date, our group and others have identified DNAm signatures for more than 60 Mendelian neurodevelopmental disorders caused by variants in genes encoding epigenetic machinery. There is broad interest in both the research and clinical communities to develop and catalog DNAm signatures in rare NDDs, but there are challenges in optimizing study design considerations and availability of platforms that integrate bioinformatics tools with the appropriate statistical framework required to analyze genome-wide DNAm data. We previously published EpigenCentral, a platform for analysis of DNAm data in rare NDDs. In this article, we utilize the published Weaver syndrome dataset to provide step-by-step protocols for using EpigenCentral for exploratory analysis to identify DNAm signatures and for classification of NDD variants. We also provide important considerations for experimental design and interpretation of DNAm results. © 2022 Wiley Periodicals LLC. Basic Protocol 1: Exploratory analysis to identify disorder-specific DNAm signatures Basic Protocol 2: Classification of variants associated with neurodevelopmental disorders.


Assuntos
Anormalidades Múltiplas , Deformidades Congênitas da Mão , Transtornos do Neurodesenvolvimento , Humanos , Metilação de DNA/genética , Epigênese Genética/genética , Transtornos do Neurodesenvolvimento/genética , Anormalidades Múltiplas/genética , Deformidades Congênitas da Mão/genética
20.
Methods Mol Biol ; 2199: 191-207, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33125652

RESUMO

iRefWeb is a resource that provides web interface to a large collection of protein-protein interactions aggregated from major primary databases. The underlying data-consolidation process, called iRefIndex, implements a rigorous methodology of identifying redundant protein sequences and integrating disparate data records that reference the same peptide sequences, despite many potential differences in data identifiers across various source databases. iRefWeb offers a unified user interface to all interaction records and associated information collected by iRefIndex, in addition to a number of data filters and visual features that present the supporting evidence. Users of iRefWeb can explore the consolidated landscape of protein-protein interactions, establish the provenance and reliability of each data record, and compare annotations performed by different data curator teams. The iRefWeb portal is freely available at http://wodaklab.org/iRefWeb .


Assuntos
Sistemas de Gerenciamento de Base de Dados , Bases de Dados de Proteínas , Internet , Mapeamento de Interação de Proteínas , Interface Usuário-Computador , Humanos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA