Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 43
Filtrar
Mais filtros

Bases de dados
Tipo de documento
Intervalo de ano de publicação
1.
Genome Res ; 30(12): 1789-1801, 2020 12.
Artigo em Inglês | MEDLINE | ID: mdl-33060171

RESUMO

The advances of large-scale genomics studies have enabled compilation of cell type-specific, genome-wide DNA functional elements at high resolution. With the growing volume of functional annotation data and sequencing variants, existing variant annotation algorithms lack the efficiency and scalability to process big genomic data, particularly when annotating whole-genome sequencing variants against a huge database with billions of genomic features. Here, we develop VarNote to rapidly annotate genome-scale variants in large and complex functional annotation resources. Equipped with a novel index system and a parallel random-sweep searching algorithm, VarNote shows substantial performance improvements (two to three orders of magnitude) over existing algorithms at different scales. It supports both region-based and allele-specific annotations and introduces advanced functions for the flexible extraction of annotations. By integrating massive base-wise and context-dependent annotations in the VarNote framework, we introduce three efficient and accurate pipelines to prioritize the causal regulatory variants for common diseases, Mendelian disorders, and cancers.


Assuntos
Biologia Computacional/métodos , Predisposição Genética para Doença/genética , Algoritmos , Bases de Dados Genéticas , Variação Genética , Genoma Humano , Humanos , Anotação de Sequência Molecular , Sequenciamento Completo do Genoma
2.
Int J Obes (Lond) ; 45(6): 1331-1341, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-33753887

RESUMO

BACKGROUND: Long chain omega-3 polyunsaturated fatty acids (ω-3PUFA) supplementation in animal models of diet-induced obesity has consistently shown to improve insulin sensitivity. The same is not always reported in human studies with insulin resistant (IR) subjects with obesity. OBJECTIVE: We studied whether high-dose ω-3PUFA supplementation for 3 months improves insulin sensitivity and adipose tissue (AT) inflammation in IR subjects with obesity. METHODS: Thirteen subjects (BMI = 39.3 ± 1.6 kg/m2) underwent 80 mU/m2·min euglycemic-hyperinsulinemic clamp with subcutaneous (Sc) AT biopsy before and after 3 months of ω-3PUFA (DHA and EPA, 4 g/daily) supplementation. Cytoadipokine plasma profiles were assessed before and after ω-3PUFA. AT-specific inflammatory gene expression was evaluated on Sc fat biopsies. Microarray analysis was performed on the fat biopsies collected during the program. RESULTS: Palmitic and stearic acid plasma levels were significantly reduced (P < 0.05) after ω-3PUFA. Gene expression of pro-inflammatory markers and adipokines were improved after ω-3PUFA (P < 0.05). Systemic inflammation was decreased after ω-3PUFA, as shown by cytokine assessment (P < 0.05). These changes were associated with a 25% increase in insulin-stimulated glucose disposal (4.7 ± 0.6 mg/kg ffm•min vs. 5.9 ± 0.9 mg/kg ffm•min) despite no change in body weight. Microarray analysis identified 53 probe sets significantly altered post- ω-3PUFA, with Apolipoprotein E (APOE) being one of the most upregulated genes. CONCLUSION: High dose of long chain ω-3PUFA supplementation modulates significant changes in plasma fatty acid profile, AT, and systemic inflammation. These findings are associated with significant improvement of insulin-stimulated glucose disposal. Unbiased microarray analysis of Sc fat biopsy identified APOE as among the most differentially regulated gene after ω-3PUFA supplementation. We speculate that ω-3PUFA increases macrophage-derived APOE mRNA levels with anti-inflammatory properties.


Assuntos
Tecido Adiposo , Ácidos Graxos Ômega-3 , Inflamação/metabolismo , Obesidade/metabolismo , Tecido Adiposo/efeitos dos fármacos , Tecido Adiposo/metabolismo , Adulto , Apolipoproteínas E/genética , Apolipoproteínas E/metabolismo , Glicemia/efeitos dos fármacos , Ácidos Graxos Ômega-3/administração & dosagem , Ácidos Graxos Ômega-3/farmacologia , Feminino , Humanos , Resistência à Insulina/fisiologia , Masculino , Gordura Subcutânea/metabolismo , Transcriptoma/efeitos dos fármacos , Transcriptoma/genética
3.
Cardiology ; 146(3): 311-314, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33735875

RESUMO

National Cardiovascular Data Registry (NCDR)-based logistic regression model is available for clinicians to predict in-hospital all-cause mortality after a percutaneous coronary intervention (PCI). However, this model has never been used to predict long-term all-cause mortality after PCI. Therefore, we sought to test the ability of the NCDR model to predict the short- and long-term risk of all-cause mortality in patients undergoing PCI. All patients undergoing PCI in the Mayo Clinic Health System were enrolled in the Mayo Clinic CathPCI registry. Patient-level demographic, clinical, and angiographic data from January 2006 to December 2017 were extracted from the registry. Patients who underwent coronary artery bypass graft surgery (CABG) were excluded. The area under the receiver operator characteristic curve (AUC) was calculated to assess the ability of the NCDR model to predict outcomes of interest (6-month, 1-year, 2-year, and 5-year all-cause mortality) after PCI. A total of 17,356 unique patients were included for the final analysis after excluding 165 patients who underwent CABG surgery. The mean age was 66.9 ± 12.5 years, and 71% were men. The 6-month, 1-year, 2-year, and 5-year all-cause mortality rates were 4.2% (n = 737), 5.8% (n = 1,005), 8.06% (n = 1,399), and 14.2% (n = 2,472), respectively. The AUCs of the NCDR model to predict 6-month, 1-year, 2-year, and 5-year all-cause mortality were 0.84 (95% CI: 0.82-0.86), 0.82 (95% CI: 0.80-0.84), 0.80 (95% CI: 0.79-0.81), and 0.78 (95% CI: 0.77-0.79), respectively. The NCDR model was able to accurately predict both short- and long-term all-cause mortality after PCI.


Assuntos
Doença da Artéria Coronariana , Intervenção Coronária Percutânea , Idoso , Mortalidade Hospitalar , Humanos , Masculino , Sistema de Registros , Medição de Risco , Fatores de Risco , Fatores de Tempo , Resultado do Tratamento
4.
Nucleic Acids Res ; 46(W1): W114-W120, 2018 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-29771388

RESUMO

Genome-wide association studies have generated over thousands of susceptibility loci for many human complex traits, and yet for most of these associations the true causal variants remain unknown. Tissue/cell type-specific prediction and prioritization of non-coding regulatory variants will facilitate the identification of causal variants and underlying pathogenic mechanisms for particular complex diseases and traits. By leveraging recent large-scale functional genomics/epigenomics data, we develop an intuitive web server, GWAS4D (http://mulinlab.tmu.edu.cn/gwas4d or http://mulinlab.org/gwas4d), that systematically evaluates GWAS signals and identifies context-specific regulatory variants. The updated web server includes six major features: (i) updates the regulatory variant prioritization method with our new algorithm; (ii) incorporates 127 tissue/cell type-specific epigenomes data; (iii) integrates motifs of 1480 transcriptional regulators from 13 public resources; (iv) uniformly processes Hi-C data and generates significant interactions at 5 kb resolution across 60 tissues/cell types; (v) adds comprehensive non-coding variant functional annotations; (vi) equips a highly interactive visualization function for SNP-target interaction. Using a GWAS fine-mapped set for 161 coronary artery disease risk loci, we demonstrate that GWAS4D is able to efficiently prioritize disease-causal regulatory variants.


Assuntos
Doenças Genéticas Inatas , Estudo de Associação Genômica Ampla , Locos de Características Quantitativas/genética , Software , Biologia Computacional/tendências , Genômica/métodos , Humanos , Polimorfismo de Nucleotídeo Único/genética
5.
Echocardiography ; 37(11): 1792-1802, 2020 11.
Artigo em Inglês | MEDLINE | ID: mdl-33012034

RESUMO

INTRODUCTION: The right ventricle (RV) strain measured by speckle tracking (RVS) is an echocardiographic parameter used to assess RV function. We compared RVS to RV fractional area change (FAC%), tricuspid annular plane systolic excursion (TAPSE) and Doppler tissue imaging-derived peak systolic velocity (S') in the assessment of right ventricular (RV) systolic function measured using cardiac magnetic resonance imaging (MRI). METHODS: We enrolled consecutive patients who underwent cardiac MRI between Jan 2012 and Dec 2017 and a transthoracic echocardiogram (TTE) within 1 month of the MRI with no interval event. Baseline clinical characteristics and MRI parameters were extracted from chart review. Echocardiographic parameters were measured prospectively. TTE parameters including RVS, TAPSE, S', and FAC% were tested for accuracy to identify impaired RV EF (EF < 45% & <30%) using receiver operator curves. RESULTS: The study cohort included 500 patients with mean age 55 years ± 18 and peak tricuspid regurgitation velocity 2.7 ± 1.4 m/s. The area under ROC for RVS was 0.69 (95% CI 0.63-0.75) and 0.78 (95% CI 0.70-0.88) to predict RVEF < 45% & RVEF < 30%, respectively. The RV FAC% had second highest accuracy of predicting RVEF among all the TTE parameters tested in study. CONCLUSION: Right ventricular strain is the most accurate echocardiographic method to detect impaired right ventricular systolic function when using MRI as the gold standard.


Assuntos
Disfunção Ventricular Direita , Função Ventricular Direita , Ecocardiografia , Humanos , Imageamento por Ressonância Magnética , Pessoa de Meia-Idade , Sensibilidade e Especificidade , Volume Sistólico , Disfunção Ventricular Direita/diagnóstico por imagem
6.
Nucleic Acids Res ; 45(22): e179, 2017 Dec 15.
Artigo em Inglês | MEDLINE | ID: mdl-28981748

RESUMO

Linnorm is a novel normalization and transformation method for the analysis of single cell RNA sequencing (scRNA-seq) data. Linnorm is developed to remove technical noises and simultaneously preserve biological variations in scRNA-seq data, such that existing statistical methods can be improved. Using real scRNA-seq data, we compared Linnorm with existing normalization methods, including NODES, SAMstrt, SCnorm, scran, DESeq and TMM. Linnorm shows advantages in speed, technical noise removal and preservation of cell heterogeneity, which can improve existing methods in the discovery of novel subtypes, pseudo-temporal ordering of cells, clustering analysis, etc. Linnorm also performs better than existing DEG analysis methods, including BASiCS, NODES, SAMstrt, Seurat and DESeq2, in false positive rate control and accuracy.


Assuntos
Algoritmos , Bioestatística/métodos , Perfilação da Expressão Gênica/métodos , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Análise por Conglomerados , Modelos Lineares , RNA/classificação , RNA/genética , Reprodutibilidade dos Testes
7.
Nucleic Acids Res ; 45(10): 5653-5665, 2017 Jun 02.
Artigo em Inglês | MEDLINE | ID: mdl-28472449

RESUMO

Competing endogenous RNAs (ceRNAs) are RNA molecules that sequester shared microRNAs (miRNAs) thereby affecting the expression of other targets of the miRNAs. Whether genetic variants in ceRNA can affect its biological function and disease development is still an open question. Here we identified a large number of genetic variants that are associated with ceRNA's function using Geuvaids RNA-seq data for 462 individuals from the 1000 Genomes Project. We call these loci competing endogenous RNA expression quantitative trait loci or 'cerQTL', and found that a large number of them were unexplored in conventional eQTL mapping. We identified many cerQTLs that have undergone recent positive selection in different human populations, and showed that single nucleotide polymorphisms in gene 3΄UTRs at the miRNA seed binding regions can simultaneously regulate gene expression changes in both cis and trans by the ceRNA mechanism. We also discovered that cerQTLs are significantly enriched in traits/diseases associated variants reported from genome-wide association studies in the miRNA binding sites, suggesting that disease susceptibilities could be attributed to ceRNA regulation. Further in vitro functional experiments demonstrated that a cerQTL rs11540855 can regulate ceRNA function. These results provide a comprehensive catalog of functional non-coding regulatory variants that may be responsible for ceRNA crosstalk at the post-transcriptional level.


Assuntos
Regulação da Expressão Gênica , Redes Reguladoras de Genes , Genoma Humano , MicroRNAs/genética , Locos de Características Quantitativas , RNA não Traduzido/genética , Regiões 3' não Traduzidas , Pareamento de Bases , Sítios de Ligação , Mapeamento Cromossômico , Estudo de Associação Genômica Ampla , Humanos , MicroRNAs/metabolismo , Polimorfismo de Nucleotídeo Único , RNA não Traduzido/metabolismo
8.
Nucleic Acids Res ; 45(W1): W215-W221, 2017 07 03.
Artigo em Inglês | MEDLINE | ID: mdl-28482068

RESUMO

Cancer therapies have experienced rapid progress in recent years, with a number of novel small-molecule kinase inhibitors and monoclonal antibodies now being widely used to treat various types of human cancers. During cancer treatments, mutations can have important effects on drug sensitivity. However, the relationship between tumor genomic profiles and the effectiveness of cancer drugs remains elusive. We introduce Mutation To Cancer Therapy Scan (mTCTScan) web server (http://jjwanglab.org/mTCTScan) that can systematically analyze mutations affecting cancer drug sensitivity based on individual genomic profiles. The platform was developed by leveraging the latest knowledge on mutation-cancer drug sensitivity associations and the results from large-scale chemical screening using human cancer cell lines. Using an evidence-based scoring scheme based on current integrative evidences, mTCTScan is able to prioritize mutations according to their associations with cancer drugs and preclinical compounds. It can also show related drugs/compounds with sensitivity classification by considering the context of the entire genomic profile. In addition, mTCTScan incorporates comprehensive filtering functions and cancer-related annotations to better interpret mutation effects and their association with cancer drugs. This platform will greatly benefit both researchers and clinicians for interrogating mechanisms of mutation-dependent drug response, which will have a significant impact on cancer precision medicine.


Assuntos
Resistencia a Medicamentos Antineoplásicos/genética , Mutação , Software , Antineoplásicos/farmacologia , Linhagem Celular Tumoral , Genômica , Humanos , Internet , Anotação de Sequência Molecular , Neoplasias/genética
9.
Nucleic Acids Res ; 44(D1): D869-76, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26615194

RESUMO

Genome-wide association studies (GWASs), now as a routine approach to study single-nucleotide polymorphism (SNP)-trait association, have uncovered over ten thousand significant trait/disease associated SNPs (TASs). Here, we updated GWASdb (GWASdb v2, http://jjwanglab.org/gwasdb) which provides comprehensive data curation and knowledge integration for GWAS TASs. These updates include: (i) Up to August 2015, we collected 2479 unique publications from PubMed and other resources; (ii) We further curated moderate SNP-trait associations (P-value < 1.0 × 10(-3)) from each original publication, and generated a total of 252,530 unique TASs in all GWASdb v2 collected studies; (iii) We manually mapped 1610 GWAS traits to 501 Human Phenotype Ontology (HPO) terms, 435 Disease Ontology (DO) terms and 228 Disease Ontology Lite (DOLite) terms. For each ontology term, we also predicted the putative causal genes; (iv) We curated the detailed sub-populations and related sample size for each study; (v) Importantly, we performed extensive function annotation for each TAS by incorporating gene-based information, ENCODE ChIP-seq assays, eQTL, population haplotype, functional prediction across multiple biological domains, evolutionary signals and disease-related annotation; (vi) Additionally, we compiled a SNP-drug response association dataset for 650 pharmacogenetic studies involving 257 drugs in this update; (vii) Last, we improved the user interface of website.


Assuntos
Bases de Dados Genéticas , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Ontologias Biológicas , Doença/genética , Genes , Humanos , Anotação de Sequência Molecular
10.
Bioinformatics ; 32(18): 2729-36, 2016 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-27273672

RESUMO

MOTIVATION: Prediction and prioritization of human non-coding regulatory variants is critical for understanding the regulatory mechanisms of disease pathogenesis and promoting personalized medicine. Existing tools utilize functional genomics data and evolutionary information to evaluate the pathogenicity or regulatory functions of non-coding variants. However, different algorithms lead to inconsistent and even conflicting predictions. Combining multiple methods may increase accuracy in regulatory variant prediction. RESULTS: Here, we compiled an integrative resource for predictions from eight different tools on functional annotation of non-coding variants. We further developed a composite strategy to integrate multiple predictions and computed the composite likelihood of a given variant being regulatory variant. Benchmarked by multiple independent causal variants datasets, we demonstrated that our composite model significantly improves the prediction performance. AVAILABILITY AND IMPLEMENTATION: We implemented our model and scoring procedure as a tool, named PRVCS, which is freely available to academic and non-profit usage at http://jjwanglab.org/PRVCS CONTACT: wang.junwen@mayo.edu, jliu@stat.harvard.edu, or limx54@gmail.com SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , Modelos Teóricos , Anotação de Sequência Molecular , Software , Evolução Biológica , Variação Genética , Humanos , RNA não Traduzido
11.
Nucleic Acids Res ; 43(W1): W264-9, 2015 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-25916854

RESUMO

Transcription factors (TFs) play an important role in gene regulation. The interconnections among TFs, chromatin interactions, epigenetic marks and cis-regulatory elements form a complex gene transcription apparatus. Our previous work, ChIP-Array, combined TF binding and transcriptome data to construct gene regulatory networks (GRNs). Here we present an enhanced version, ChIP-Array 2, to integrate additional types of omics data including long-range chromatin interaction, open chromatin region and histone modification data to dissect more comprehensive GRNs involving diverse regulatory components. Moreover, we substantially extended our motif database for human, mouse, rat, fruit fly, worm, yeast and Arabidopsis, and curated large amount of omics data for users to select as input or backend support. With ChIP-Array 2, we compiled a library containing regulatory networks of 18 TFs/chromatin modifiers in mouse embryonic stem cell (mESC). The web server and the mESC library are publicly free and accessible athttp://jjwanglab.org/chip-array.


Assuntos
Redes Reguladoras de Genes , Software , Animais , Cromatina/metabolismo , Imunoprecipitação da Cromatina , Células-Tronco Embrionárias/metabolismo , Perfilação da Expressão Gênica , Genômica , Histonas/metabolismo , Humanos , Internet , Camundongos , Análise de Sequência com Séries de Oligonucleotídeos , Ratos , Fatores de Transcrição/metabolismo
12.
Nucleic Acids Res ; 42(15): e121, 2014 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-25034693

RESUMO

Conventionally, overall gene expressions from microarrays are used to infer gene networks, but it is challenging to account splicing isoforms. High-throughput RNA Sequencing has made splice variant profiling practical. However, its true merit in quantifying splicing isoforms and isoform-specific exon expressions is not well explored in inferring gene networks. This study demonstrates SpliceNet, a method to infer isoform-specific co-expression networks from exon-level RNA-Seq data, using large dimensional trace. It goes beyond differentially expressed genes and infers splicing isoform network changes between normal and diseased samples. It eases the sample size bottleneck; evaluations on simulated data and lung cancer-specific ERBB2 and MAPK signaling pathways, with varying number of samples, evince the merit in handling high exon to sample size ratio datasets. Inferred network rewiring of well established Bcl-x and EGFR centered networks from lung adenocarcinoma expression data is in good agreement with literature. Gene level evaluations demonstrate a substantial performance of SpliceNet over canonical correlation analysis, a method that is currently applied to exon level RNA-Seq data. SpliceNet can also be applied to exon array data. SpliceNet is distributed as an R package available at http://www.jjwanglab.org/SpliceNet.


Assuntos
Processamento Alternativo , Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Neoplasias/genética , Isoformas de Proteínas/genética , Análise de Sequência de RNA/métodos , Carcinoma Pulmonar de Células não Pequenas/genética , Humanos , Neoplasias Pulmonares/genética , Isoformas de Proteínas/metabolismo , Transdução de Sinais , Software
13.
Nucleic Acids Res ; 42(Web Server issue): W130-6, 2014 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-24875471

RESUMO

Interactions among transcriptional factors (TFs), cofactors and other proteins or enzymes can affect transcriptional regulatory capabilities of eukaryotic organisms. Post-translational modifications (PTMs) cooperate with TFs and epigenetic alterations to constitute a hierarchical complexity in transcriptional gene regulation. While clearly implicated in biological processes, our understanding of these complex regulatory mechanisms is still limited and incomplete. Various online software have been proposed for uncovering transcriptional and epigenetic regulatory networks, however, there is a lack of effective web-based software capable of constructing underlying interactive organizations between post-translational and transcriptional regulatory components. Here, we present an open web server, post-translational hierarchical gene regulatory network (PTHGRN) to unravel relationships among PTMs, TFs, epigenetic modifications and gene expression. PTHGRN utilizes a graphical Gaussian model with partial least squares regression-based methodology, and is able to integrate protein-protein interactions, ChIP-seq and gene expression data and to capture essential regulation features behind high-throughput data. The server provides an integrative platform for users to analyze ready-to-use public high-throughput Omics resources or upload their own data for systems biology study. Users can choose various parameters in the method, build network topologies of interests and dissect their associations with biological functions. Application of the software to stem cell and breast cancer demonstrates that it is an effective tool for understanding regulatory mechanisms in biological complex systems. PTHGRN web server is publically available at web site http://www.byanbioinfo.org/pthgrn.


Assuntos
Imunoprecipitação da Cromatina , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Mapeamento de Interação de Proteínas , Software , Animais , Neoplasias da Mama/genética , Neoplasias da Mama/metabolismo , Diferenciação Celular/genética , Células-Tronco Embrionárias/citologia , Células-Tronco Embrionárias/metabolismo , Feminino , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Internet , Células MCF-7 , Camundongos , Processamento de Proteína Pós-Traducional , Ratos , Fatores de Transcrição/metabolismo
14.
Hum Mutat ; 36(5): 496-503, 2015 May.
Artigo em Inglês | MEDLINE | ID: mdl-25676918

RESUMO

With the rapid advances in high-throughput sequencing technologies, exome sequencing and targeted region sequencing have become routine approaches for identifying mutations of inherited disorders in both genetics research and molecular diagnosis. There is an imminent need for comprehensive and easy-to-use downstream analysis tools to isolate causal mutations in exome sequencing studies. We have developed a user-friendly online framework, wKGGSeq, to provide systematic annotation, filtration, prioritization, and visualization functions for characterizing causal mutation(s) in exome sequencing studies of inherited disorders. wKGGSeq provides: (1) a novel strategy-based procedure for downstream analysis of a large amount of exome sequencing data and (2) a disease-targeted analysis procedure to facilitate clinical diagnosis of well-studied genetic diseases. In addition, it is also equipped with abundant online annotation functions for sequence variants. We demonstrate that wKGGSeq either outperforms or is comparable to two popular tools in several real exome sequencing samples. This tool will greatly facilitate the downstream analysis of exome sequencing data and can play a useful role for researchers and clinicians in identifying causal mutations of inherited disorders. The wKGGSeq is freely available at http://statgenpro.psychiatry.hku.hk/wkggseq or http://jjwanglab.org/wkggseq, and will be updated frequently.


Assuntos
Biologia Computacional/métodos , Internet , Software , Bases de Dados Genéticas , Exoma , Estudos de Associação Genética/métodos , Doenças Genéticas Inatas/diagnóstico , Doenças Genéticas Inatas/genética , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Anotação de Sequência Molecular , Navegador
15.
Bioinformatics ; 30(17): 2498-500, 2014 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-24833803

RESUMO

UNLABELLED: Recent advances in high-throughput sequencing technologies have enabled us to sequence large number of cancer samples to reveal novel insights into oncogenetic mechanisms. However, the presence of intratumoral heterogeneity, normal cell contamination and insufficient sequencing depth, together pose a challenge for detecting somatic mutations. Here we propose a fast and an accurate somatic single-nucleotide variations (SNVs) detection program, FaSD-somatic. The performance of FaSD-somatic is extensively assessed on various types of cancer against several state-of-the-art somatic SNV detection programs. Benchmarked by somatic SNVs from either existing databases or de novo higher-depth sequencing data, FaSD-somatic has the best overall performance. Furthermore, FaSD-somatic is efficient, it finishes somatic SNV calling within 14 h on 50X whole genome sequencing data in paired samples. AVAILABILITY AND IMPLEMENTATION: The program, datasets and supplementary files are available at http://jjwanglab.org/FaSD-somatic/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , Variação Genética , Sequenciamento de Nucleotídeos em Larga Escala , Neoplasias/genética , Bases de Dados de Ácidos Nucleicos , Genômica , Humanos
16.
Bioinformatics ; 30(8): 1190-1192, 2014 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-24389658

RESUMO

ChIP-seq technology provides an accurate characterization of transcription or epigenetic factors binding on genomic sequences. With integration of such ChIP-based and other high-throughput information, it would be dedicated to dissecting cross-interactions among multilevel regulators, genes and biological functions. Here, we devised an integrative web server CMGRN (constructing multilevel gene regulatory networks), to unravel hierarchical interactive networks at different regulatory levels. The newly developed method used the Bayesian network modeling to infer causal interrelationships among transcription factors or epigenetic modifications by using ChIP-seq data. Moreover, it used Bayesian hierarchical model with Gibbs sampling to incorporate binding signals of these regulators and gene expression profile together for reconstructing gene regulatory networks. The example applications indicate that CMGRN provides an effective web-based framework that is able to integrate heterogeneous high-throughput data and to reveal hierarchical 'regulome' and the associated gene expression programs. AVAILABILITY: http://bioinfo.icts.hkbu.edu.hk/cmgrn; http://www.byanbioinfo.org/cmgrn CONTACT: yanbinai6017@gmail.com or junwen@hku.hk Supplementary Information: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Redes de Comunicação de Computadores , Redes Reguladoras de Genes , Genômica/métodos , Teorema de Bayes , Imunoprecipitação da Cromatina , Epigênese Genética , Expressão Gênica , Internet , Análise de Sequência com Séries de Oligonucleotídeos , Software , Fatores de Transcrição/metabolismo
17.
Mol Cell Proteomics ; 12(11): 3379-87, 2013 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-23924514

RESUMO

MicroRNAs (miRNAs) regulate gene expression through translational repression and RNA degradation. Recently developed high-throughput proteomic methods measure gene expression changes at protein level and therefore can reveal the direct effects of miRNAs' translational repression. Here, we present a web server, ProteoMirExpress, that integrates proteomic and mRNA expression data together to infer miRNA-centered regulatory networks. With both types of high-throughput data from the users, ProteoMirExpress is able to discover not only miRNA targets that have decreased mRNA, but also subgroups of targets with suppressed proteins whose mRNAs are not significantly changed or with decreased mRNA whose proteins are not significantly changed, which are usually ignored by most current methods. Furthermore, both direct and indirect targets of miRNAs can be detected. Therefore, ProteoMirExpress provides more comprehensive miRNA-centered regulatory networks. We used several published data to assess the quality of our inferred networks and prove the value of our server. ProteoMirExpress is available online, with free access to academic users.


Assuntos
MicroRNAs/genética , MicroRNAs/metabolismo , Proteoma/genética , Proteoma/metabolismo , Proteômica/métodos , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Perfilação da Expressão Gênica/estatística & dados numéricos , Redes Reguladoras de Genes , Células HeLa , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento de Nucleotídeos em Larga Escala/estatística & dados numéricos , Humanos , Proteômica/estatística & dados numéricos , Software
19.
Nucleic Acids Res ; 40(Database issue): D1047-54, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22139925

RESUMO

Recent advances in genome-wide association studies (GWAS) have enabled us to identify thousands of genetic variants (GVs) that are associated with human diseases. As next-generation sequencing technologies become less expensive, more GVs will be discovered in the near future. Existing databases, such as NHGRI GWAS Catalog, collect GVs with only genome-wide level significance. However, many true disease susceptibility loci have relatively moderate P values and are not included in these databases. We have developed GWASdb that contains 20 times more data than the GWAS Catalog and includes less significant GVs (P < 1.0 × 10(-3)) manually curated from the literature. In addition, GWASdb provides comprehensive functional annotations for each GV, including genomic mapping information, regulatory effects (transcription factor binding sites, microRNA target sites and splicing sites), amino acid substitutions, evolution, gene expression and disease associations. Furthermore, GWASdb classifies these GVs according to diseases using Disease-Ontology Lite and Human Phenotype Ontology. It can conduct pathway enrichment and PPI network association analysis for these diseases. GWASdb provides an intuitive, multifunctional database for biologists and clinicians to explore GVs and their functional inferences. It is freely available at http://jjwanglab.org/gwasdb and will be updated frequently.


Assuntos
Bases de Dados Genéticas , Doença/genética , Variação Genética , Estudo de Associação Genômica Ampla , Mapeamento Cromossômico , Humanos , Anotação de Sequência Molecular , Interface Usuário-Computador
20.
Nucleic Acids Res ; 39(Web Server issue): W430-6, 2011 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-21586587

RESUMO

Chromatin immunoprecipitation (ChIP) coupled with high-throughput techniques (ChIP-X), such as next generation sequencing (ChIP-Seq) and microarray (ChIP-chip), has been successfully used to map active transcription factor binding sites (TFBS) of a transcription factor (TF). The targeted genes can be activated or suppressed by the TF, or are unresponsive to the TF. Microarray technology has been used to measure the actual expression changes of thousands of genes under the perturbation of a TF, but is unable to determine if the affected genes are direct or indirect targets of the TF. Furthermore, both ChIP-X and microarray methods produce a large number of false positives. Combining microarray expression profiling and ChIP-X data allows more effective TFBS analysis for studying the function of a TF. However, current web servers only provide tools to analyze either ChIP-X or expression data, but not both. Here, we present ChIP-Array, a web server that integrates ChIP-X and expression data from human, mouse, yeast, fruit fly and Arabidopsis. This server will assist biologists to detect direct and indirect target genes regulated by a TF of interest and to aid in the functional characterization of the TF. ChIP-Array is available at http://jjwanglab.hku.hk/ChIP-Array, with free access to academic users.


Assuntos
Imunoprecipitação da Cromatina , Perfilação da Expressão Gênica , Análise de Sequência com Séries de Oligonucleotídeos , Software , Fatores de Transcrição/metabolismo , Animais , Sítios de Ligação , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Camundongos , Elementos Reguladores de Transcrição
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA