Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 43
Filtrar
1.
Genome Res ; 30(12): 1789-1801, 2020 12.
Artículo en Inglés | MEDLINE | ID: mdl-33060171

RESUMEN

The advances of large-scale genomics studies have enabled compilation of cell type-specific, genome-wide DNA functional elements at high resolution. With the growing volume of functional annotation data and sequencing variants, existing variant annotation algorithms lack the efficiency and scalability to process big genomic data, particularly when annotating whole-genome sequencing variants against a huge database with billions of genomic features. Here, we develop VarNote to rapidly annotate genome-scale variants in large and complex functional annotation resources. Equipped with a novel index system and a parallel random-sweep searching algorithm, VarNote shows substantial performance improvements (two to three orders of magnitude) over existing algorithms at different scales. It supports both region-based and allele-specific annotations and introduces advanced functions for the flexible extraction of annotations. By integrating massive base-wise and context-dependent annotations in the VarNote framework, we introduce three efficient and accurate pipelines to prioritize the causal regulatory variants for common diseases, Mendelian disorders, and cancers.


Asunto(s)
Biología Computacional/métodos , Predisposición Genética a la Enfermedad/genética , Algoritmos , Bases de Datos Genéticas , Variación Genética , Genoma Humano , Humanos , Anotación de Secuencia Molecular , Secuenciación Completa del Genoma
2.
Int J Obes (Lond) ; 45(6): 1331-1341, 2021 06.
Artículo en Inglés | MEDLINE | ID: mdl-33753887

RESUMEN

BACKGROUND: Long chain omega-3 polyunsaturated fatty acids (ω-3PUFA) supplementation in animal models of diet-induced obesity has consistently shown to improve insulin sensitivity. The same is not always reported in human studies with insulin resistant (IR) subjects with obesity. OBJECTIVE: We studied whether high-dose ω-3PUFA supplementation for 3 months improves insulin sensitivity and adipose tissue (AT) inflammation in IR subjects with obesity. METHODS: Thirteen subjects (BMI = 39.3 ± 1.6 kg/m2) underwent 80 mU/m2·min euglycemic-hyperinsulinemic clamp with subcutaneous (Sc) AT biopsy before and after 3 months of ω-3PUFA (DHA and EPA, 4 g/daily) supplementation. Cytoadipokine plasma profiles were assessed before and after ω-3PUFA. AT-specific inflammatory gene expression was evaluated on Sc fat biopsies. Microarray analysis was performed on the fat biopsies collected during the program. RESULTS: Palmitic and stearic acid plasma levels were significantly reduced (P < 0.05) after ω-3PUFA. Gene expression of pro-inflammatory markers and adipokines were improved after ω-3PUFA (P < 0.05). Systemic inflammation was decreased after ω-3PUFA, as shown by cytokine assessment (P < 0.05). These changes were associated with a 25% increase in insulin-stimulated glucose disposal (4.7 ± 0.6 mg/kg ffm•min vs. 5.9 ± 0.9 mg/kg ffm•min) despite no change in body weight. Microarray analysis identified 53 probe sets significantly altered post- ω-3PUFA, with Apolipoprotein E (APOE) being one of the most upregulated genes. CONCLUSION: High dose of long chain ω-3PUFA supplementation modulates significant changes in plasma fatty acid profile, AT, and systemic inflammation. These findings are associated with significant improvement of insulin-stimulated glucose disposal. Unbiased microarray analysis of Sc fat biopsy identified APOE as among the most differentially regulated gene after ω-3PUFA supplementation. We speculate that ω-3PUFA increases macrophage-derived APOE mRNA levels with anti-inflammatory properties.


Asunto(s)
Tejido Adiposo , Ácidos Grasos Omega-3 , Inflamación/metabolismo , Obesidad/metabolismo , Tejido Adiposo/efectos de los fármacos , Tejido Adiposo/metabolismo , Adulto , Apolipoproteínas E/genética , Apolipoproteínas E/metabolismo , Glucemia/efectos de los fármacos , Ácidos Grasos Omega-3/administración & dosificación , Ácidos Grasos Omega-3/farmacología , Femenino , Humanos , Resistencia a la Insulina/fisiología , Masculino , Grasa Subcutánea/metabolismo , Transcriptoma/efectos de los fármacos , Transcriptoma/genética
3.
Cardiology ; 146(3): 311-314, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-33735875

RESUMEN

National Cardiovascular Data Registry (NCDR)-based logistic regression model is available for clinicians to predict in-hospital all-cause mortality after a percutaneous coronary intervention (PCI). However, this model has never been used to predict long-term all-cause mortality after PCI. Therefore, we sought to test the ability of the NCDR model to predict the short- and long-term risk of all-cause mortality in patients undergoing PCI. All patients undergoing PCI in the Mayo Clinic Health System were enrolled in the Mayo Clinic CathPCI registry. Patient-level demographic, clinical, and angiographic data from January 2006 to December 2017 were extracted from the registry. Patients who underwent coronary artery bypass graft surgery (CABG) were excluded. The area under the receiver operator characteristic curve (AUC) was calculated to assess the ability of the NCDR model to predict outcomes of interest (6-month, 1-year, 2-year, and 5-year all-cause mortality) after PCI. A total of 17,356 unique patients were included for the final analysis after excluding 165 patients who underwent CABG surgery. The mean age was 66.9 ± 12.5 years, and 71% were men. The 6-month, 1-year, 2-year, and 5-year all-cause mortality rates were 4.2% (n = 737), 5.8% (n = 1,005), 8.06% (n = 1,399), and 14.2% (n = 2,472), respectively. The AUCs of the NCDR model to predict 6-month, 1-year, 2-year, and 5-year all-cause mortality were 0.84 (95% CI: 0.82-0.86), 0.82 (95% CI: 0.80-0.84), 0.80 (95% CI: 0.79-0.81), and 0.78 (95% CI: 0.77-0.79), respectively. The NCDR model was able to accurately predict both short- and long-term all-cause mortality after PCI.


Asunto(s)
Enfermedad de la Arteria Coronaria , Intervención Coronaria Percutánea , Anciano , Mortalidad Hospitalaria , Humanos , Masculino , Sistema de Registros , Medición de Riesgo , Factores de Riesgo , Factores de Tiempo , Resultado del Tratamiento
4.
Nucleic Acids Res ; 46(W1): W114-W120, 2018 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-29771388

RESUMEN

Genome-wide association studies have generated over thousands of susceptibility loci for many human complex traits, and yet for most of these associations the true causal variants remain unknown. Tissue/cell type-specific prediction and prioritization of non-coding regulatory variants will facilitate the identification of causal variants and underlying pathogenic mechanisms for particular complex diseases and traits. By leveraging recent large-scale functional genomics/epigenomics data, we develop an intuitive web server, GWAS4D (http://mulinlab.tmu.edu.cn/gwas4d or http://mulinlab.org/gwas4d), that systematically evaluates GWAS signals and identifies context-specific regulatory variants. The updated web server includes six major features: (i) updates the regulatory variant prioritization method with our new algorithm; (ii) incorporates 127 tissue/cell type-specific epigenomes data; (iii) integrates motifs of 1480 transcriptional regulators from 13 public resources; (iv) uniformly processes Hi-C data and generates significant interactions at 5 kb resolution across 60 tissues/cell types; (v) adds comprehensive non-coding variant functional annotations; (vi) equips a highly interactive visualization function for SNP-target interaction. Using a GWAS fine-mapped set for 161 coronary artery disease risk loci, we demonstrate that GWAS4D is able to efficiently prioritize disease-causal regulatory variants.


Asunto(s)
Enfermedades Genéticas Congénitas , Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo/genética , Programas Informáticos , Biología Computacional/tendencias , Genómica/métodos , Humanos , Polimorfismo de Nucleótido Simple/genética
5.
Echocardiography ; 37(11): 1792-1802, 2020 11.
Artículo en Inglés | MEDLINE | ID: mdl-33012034

RESUMEN

INTRODUCTION: The right ventricle (RV) strain measured by speckle tracking (RVS) is an echocardiographic parameter used to assess RV function. We compared RVS to RV fractional area change (FAC%), tricuspid annular plane systolic excursion (TAPSE) and Doppler tissue imaging-derived peak systolic velocity (S') in the assessment of right ventricular (RV) systolic function measured using cardiac magnetic resonance imaging (MRI). METHODS: We enrolled consecutive patients who underwent cardiac MRI between Jan 2012 and Dec 2017 and a transthoracic echocardiogram (TTE) within 1 month of the MRI with no interval event. Baseline clinical characteristics and MRI parameters were extracted from chart review. Echocardiographic parameters were measured prospectively. TTE parameters including RVS, TAPSE, S', and FAC% were tested for accuracy to identify impaired RV EF (EF < 45% & <30%) using receiver operator curves. RESULTS: The study cohort included 500 patients with mean age 55 years ± 18 and peak tricuspid regurgitation velocity 2.7 ± 1.4 m/s. The area under ROC for RVS was 0.69 (95% CI 0.63-0.75) and 0.78 (95% CI 0.70-0.88) to predict RVEF < 45% & RVEF < 30%, respectively. The RV FAC% had second highest accuracy of predicting RVEF among all the TTE parameters tested in study. CONCLUSION: Right ventricular strain is the most accurate echocardiographic method to detect impaired right ventricular systolic function when using MRI as the gold standard.


Asunto(s)
Disfunción Ventricular Derecha , Función Ventricular Derecha , Ecocardiografía , Humanos , Imagen por Resonancia Magnética , Persona de Mediana Edad , Sensibilidad y Especificidad , Volumen Sistólico , Disfunción Ventricular Derecha/diagnóstico por imagen
6.
Nucleic Acids Res ; 45(22): e179, 2017 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-28981748

RESUMEN

Linnorm is a novel normalization and transformation method for the analysis of single cell RNA sequencing (scRNA-seq) data. Linnorm is developed to remove technical noises and simultaneously preserve biological variations in scRNA-seq data, such that existing statistical methods can be improved. Using real scRNA-seq data, we compared Linnorm with existing normalization methods, including NODES, SAMstrt, SCnorm, scran, DESeq and TMM. Linnorm shows advantages in speed, technical noise removal and preservation of cell heterogeneity, which can improve existing methods in the discovery of novel subtypes, pseudo-temporal ordering of cells, clustering analysis, etc. Linnorm also performs better than existing DEG analysis methods, including BASiCS, NODES, SAMstrt, Seurat and DESeq2, in false positive rate control and accuracy.


Asunto(s)
Algoritmos , Bioestadística/métodos , Perfilación de la Expresión Génica/métodos , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Análisis por Conglomerados , Modelos Lineales , ARN/clasificación , ARN/genética , Reproducibilidad de los Resultados
7.
Nucleic Acids Res ; 45(10): 5653-5665, 2017 Jun 02.
Artículo en Inglés | MEDLINE | ID: mdl-28472449

RESUMEN

Competing endogenous RNAs (ceRNAs) are RNA molecules that sequester shared microRNAs (miRNAs) thereby affecting the expression of other targets of the miRNAs. Whether genetic variants in ceRNA can affect its biological function and disease development is still an open question. Here we identified a large number of genetic variants that are associated with ceRNA's function using Geuvaids RNA-seq data for 462 individuals from the 1000 Genomes Project. We call these loci competing endogenous RNA expression quantitative trait loci or 'cerQTL', and found that a large number of them were unexplored in conventional eQTL mapping. We identified many cerQTLs that have undergone recent positive selection in different human populations, and showed that single nucleotide polymorphisms in gene 3΄UTRs at the miRNA seed binding regions can simultaneously regulate gene expression changes in both cis and trans by the ceRNA mechanism. We also discovered that cerQTLs are significantly enriched in traits/diseases associated variants reported from genome-wide association studies in the miRNA binding sites, suggesting that disease susceptibilities could be attributed to ceRNA regulation. Further in vitro functional experiments demonstrated that a cerQTL rs11540855 can regulate ceRNA function. These results provide a comprehensive catalog of functional non-coding regulatory variants that may be responsible for ceRNA crosstalk at the post-transcriptional level.


Asunto(s)
Regulación de la Expresión Génica , Redes Reguladoras de Genes , Genoma Humano , MicroARNs/genética , Sitios de Carácter Cuantitativo , ARN no Traducido/genética , Regiones no Traducidas 3' , Emparejamiento Base , Sitios de Unión , Mapeo Cromosómico , Estudio de Asociación del Genoma Completo , Humanos , MicroARNs/metabolismo , Polimorfismo de Nucleótido Simple , ARN no Traducido/metabolismo
8.
Nucleic Acids Res ; 45(W1): W215-W221, 2017 07 03.
Artículo en Inglés | MEDLINE | ID: mdl-28482068

RESUMEN

Cancer therapies have experienced rapid progress in recent years, with a number of novel small-molecule kinase inhibitors and monoclonal antibodies now being widely used to treat various types of human cancers. During cancer treatments, mutations can have important effects on drug sensitivity. However, the relationship between tumor genomic profiles and the effectiveness of cancer drugs remains elusive. We introduce Mutation To Cancer Therapy Scan (mTCTScan) web server (http://jjwanglab.org/mTCTScan) that can systematically analyze mutations affecting cancer drug sensitivity based on individual genomic profiles. The platform was developed by leveraging the latest knowledge on mutation-cancer drug sensitivity associations and the results from large-scale chemical screening using human cancer cell lines. Using an evidence-based scoring scheme based on current integrative evidences, mTCTScan is able to prioritize mutations according to their associations with cancer drugs and preclinical compounds. It can also show related drugs/compounds with sensitivity classification by considering the context of the entire genomic profile. In addition, mTCTScan incorporates comprehensive filtering functions and cancer-related annotations to better interpret mutation effects and their association with cancer drugs. This platform will greatly benefit both researchers and clinicians for interrogating mechanisms of mutation-dependent drug response, which will have a significant impact on cancer precision medicine.


Asunto(s)
Resistencia a Antineoplásicos/genética , Mutación , Programas Informáticos , Antineoplásicos/farmacología , Línea Celular Tumoral , Genómica , Humanos , Internet , Anotación de Secuencia Molecular , Neoplasias/genética
9.
Nucleic Acids Res ; 44(D1): D869-76, 2016 Jan 04.
Artículo en Inglés | MEDLINE | ID: mdl-26615194

RESUMEN

Genome-wide association studies (GWASs), now as a routine approach to study single-nucleotide polymorphism (SNP)-trait association, have uncovered over ten thousand significant trait/disease associated SNPs (TASs). Here, we updated GWASdb (GWASdb v2, http://jjwanglab.org/gwasdb) which provides comprehensive data curation and knowledge integration for GWAS TASs. These updates include: (i) Up to August 2015, we collected 2479 unique publications from PubMed and other resources; (ii) We further curated moderate SNP-trait associations (P-value < 1.0 × 10(-3)) from each original publication, and generated a total of 252,530 unique TASs in all GWASdb v2 collected studies; (iii) We manually mapped 1610 GWAS traits to 501 Human Phenotype Ontology (HPO) terms, 435 Disease Ontology (DO) terms and 228 Disease Ontology Lite (DOLite) terms. For each ontology term, we also predicted the putative causal genes; (iv) We curated the detailed sub-populations and related sample size for each study; (v) Importantly, we performed extensive function annotation for each TAS by incorporating gene-based information, ENCODE ChIP-seq assays, eQTL, population haplotype, functional prediction across multiple biological domains, evolutionary signals and disease-related annotation; (vi) Additionally, we compiled a SNP-drug response association dataset for 650 pharmacogenetic studies involving 257 drugs in this update; (vii) Last, we improved the user interface of website.


Asunto(s)
Bases de Datos Genéticas , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Ontologías Biológicas , Enfermedad/genética , Genes , Humanos , Anotación de Secuencia Molecular
10.
Bioinformatics ; 32(18): 2729-36, 2016 09 15.
Artículo en Inglés | MEDLINE | ID: mdl-27273672

RESUMEN

MOTIVATION: Prediction and prioritization of human non-coding regulatory variants is critical for understanding the regulatory mechanisms of disease pathogenesis and promoting personalized medicine. Existing tools utilize functional genomics data and evolutionary information to evaluate the pathogenicity or regulatory functions of non-coding variants. However, different algorithms lead to inconsistent and even conflicting predictions. Combining multiple methods may increase accuracy in regulatory variant prediction. RESULTS: Here, we compiled an integrative resource for predictions from eight different tools on functional annotation of non-coding variants. We further developed a composite strategy to integrate multiple predictions and computed the composite likelihood of a given variant being regulatory variant. Benchmarked by multiple independent causal variants datasets, we demonstrated that our composite model significantly improves the prediction performance. AVAILABILITY AND IMPLEMENTATION: We implemented our model and scoring procedure as a tool, named PRVCS, which is freely available to academic and non-profit usage at http://jjwanglab.org/PRVCS CONTACT: wang.junwen@mayo.edu, jliu@stat.harvard.edu, or limx54@gmail.com SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Modelos Teóricos , Anotación de Secuencia Molecular , Programas Informáticos , Evolución Biológica , Variación Genética , Humanos , ARN no Traducido
11.
Nucleic Acids Res ; 43(W1): W264-9, 2015 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-25916854

RESUMEN

Transcription factors (TFs) play an important role in gene regulation. The interconnections among TFs, chromatin interactions, epigenetic marks and cis-regulatory elements form a complex gene transcription apparatus. Our previous work, ChIP-Array, combined TF binding and transcriptome data to construct gene regulatory networks (GRNs). Here we present an enhanced version, ChIP-Array 2, to integrate additional types of omics data including long-range chromatin interaction, open chromatin region and histone modification data to dissect more comprehensive GRNs involving diverse regulatory components. Moreover, we substantially extended our motif database for human, mouse, rat, fruit fly, worm, yeast and Arabidopsis, and curated large amount of omics data for users to select as input or backend support. With ChIP-Array 2, we compiled a library containing regulatory networks of 18 TFs/chromatin modifiers in mouse embryonic stem cell (mESC). The web server and the mESC library are publicly free and accessible athttp://jjwanglab.org/chip-array.


Asunto(s)
Redes Reguladoras de Genes , Programas Informáticos , Animales , Cromatina/metabolismo , Inmunoprecipitación de Cromatina , Células Madre Embrionarias/metabolismo , Perfilación de la Expresión Génica , Genómica , Histonas/metabolismo , Humanos , Internet , Ratones , Análisis de Secuencia por Matrices de Oligonucleótidos , Ratas , Factores de Transcripción/metabolismo
12.
Nucleic Acids Res ; 42(15): e121, 2014 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-25034693

RESUMEN

Conventionally, overall gene expressions from microarrays are used to infer gene networks, but it is challenging to account splicing isoforms. High-throughput RNA Sequencing has made splice variant profiling practical. However, its true merit in quantifying splicing isoforms and isoform-specific exon expressions is not well explored in inferring gene networks. This study demonstrates SpliceNet, a method to infer isoform-specific co-expression networks from exon-level RNA-Seq data, using large dimensional trace. It goes beyond differentially expressed genes and infers splicing isoform network changes between normal and diseased samples. It eases the sample size bottleneck; evaluations on simulated data and lung cancer-specific ERBB2 and MAPK signaling pathways, with varying number of samples, evince the merit in handling high exon to sample size ratio datasets. Inferred network rewiring of well established Bcl-x and EGFR centered networks from lung adenocarcinoma expression data is in good agreement with literature. Gene level evaluations demonstrate a substantial performance of SpliceNet over canonical correlation analysis, a method that is currently applied to exon level RNA-Seq data. SpliceNet can also be applied to exon array data. SpliceNet is distributed as an R package available at http://www.jjwanglab.org/SpliceNet.


Asunto(s)
Empalme Alternativo , Perfilación de la Expresión Génica/métodos , Redes Reguladoras de Genes , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Neoplasias/genética , Isoformas de Proteínas/genética , Análisis de Secuencia de ARN/métodos , Carcinoma de Pulmón de Células no Pequeñas/genética , Humanos , Neoplasias Pulmonares/genética , Isoformas de Proteínas/metabolismo , Transducción de Señal , Programas Informáticos
13.
Nucleic Acids Res ; 42(Web Server issue): W130-6, 2014 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-24875471

RESUMEN

Interactions among transcriptional factors (TFs), cofactors and other proteins or enzymes can affect transcriptional regulatory capabilities of eukaryotic organisms. Post-translational modifications (PTMs) cooperate with TFs and epigenetic alterations to constitute a hierarchical complexity in transcriptional gene regulation. While clearly implicated in biological processes, our understanding of these complex regulatory mechanisms is still limited and incomplete. Various online software have been proposed for uncovering transcriptional and epigenetic regulatory networks, however, there is a lack of effective web-based software capable of constructing underlying interactive organizations between post-translational and transcriptional regulatory components. Here, we present an open web server, post-translational hierarchical gene regulatory network (PTHGRN) to unravel relationships among PTMs, TFs, epigenetic modifications and gene expression. PTHGRN utilizes a graphical Gaussian model with partial least squares regression-based methodology, and is able to integrate protein-protein interactions, ChIP-seq and gene expression data and to capture essential regulation features behind high-throughput data. The server provides an integrative platform for users to analyze ready-to-use public high-throughput Omics resources or upload their own data for systems biology study. Users can choose various parameters in the method, build network topologies of interests and dissect their associations with biological functions. Application of the software to stem cell and breast cancer demonstrates that it is an effective tool for understanding regulatory mechanisms in biological complex systems. PTHGRN web server is publically available at web site http://www.byanbioinfo.org/pthgrn.


Asunto(s)
Inmunoprecipitación de Cromatina , Perfilación de la Expresión Génica , Redes Reguladoras de Genes , Mapeo de Interacción de Proteínas , Programas Informáticos , Animales , Neoplasias de la Mama/genética , Neoplasias de la Mama/metabolismo , Diferenciación Celular/genética , Células Madre Embrionarias/citología , Células Madre Embrionarias/metabolismo , Femenino , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Internet , Células MCF-7 , Ratones , Procesamiento Proteico-Postraduccional , Ratas , Factores de Transcripción/metabolismo
14.
Hum Mutat ; 36(5): 496-503, 2015 May.
Artículo en Inglés | MEDLINE | ID: mdl-25676918

RESUMEN

With the rapid advances in high-throughput sequencing technologies, exome sequencing and targeted region sequencing have become routine approaches for identifying mutations of inherited disorders in both genetics research and molecular diagnosis. There is an imminent need for comprehensive and easy-to-use downstream analysis tools to isolate causal mutations in exome sequencing studies. We have developed a user-friendly online framework, wKGGSeq, to provide systematic annotation, filtration, prioritization, and visualization functions for characterizing causal mutation(s) in exome sequencing studies of inherited disorders. wKGGSeq provides: (1) a novel strategy-based procedure for downstream analysis of a large amount of exome sequencing data and (2) a disease-targeted analysis procedure to facilitate clinical diagnosis of well-studied genetic diseases. In addition, it is also equipped with abundant online annotation functions for sequence variants. We demonstrate that wKGGSeq either outperforms or is comparable to two popular tools in several real exome sequencing samples. This tool will greatly facilitate the downstream analysis of exome sequencing data and can play a useful role for researchers and clinicians in identifying causal mutations of inherited disorders. The wKGGSeq is freely available at http://statgenpro.psychiatry.hku.hk/wkggseq or http://jjwanglab.org/wkggseq, and will be updated frequently.


Asunto(s)
Biología Computacional/métodos , Internet , Programas Informáticos , Bases de Datos Genéticas , Exoma , Estudios de Asociación Genética/métodos , Enfermedades Genéticas Congénitas/diagnóstico , Enfermedades Genéticas Congénitas/genética , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Anotación de Secuencia Molecular , Navegador Web
15.
Bioinformatics ; 30(17): 2498-500, 2014 Sep 01.
Artículo en Inglés | MEDLINE | ID: mdl-24833803

RESUMEN

UNLABELLED: Recent advances in high-throughput sequencing technologies have enabled us to sequence large number of cancer samples to reveal novel insights into oncogenetic mechanisms. However, the presence of intratumoral heterogeneity, normal cell contamination and insufficient sequencing depth, together pose a challenge for detecting somatic mutations. Here we propose a fast and an accurate somatic single-nucleotide variations (SNVs) detection program, FaSD-somatic. The performance of FaSD-somatic is extensively assessed on various types of cancer against several state-of-the-art somatic SNV detection programs. Benchmarked by somatic SNVs from either existing databases or de novo higher-depth sequencing data, FaSD-somatic has the best overall performance. Furthermore, FaSD-somatic is efficient, it finishes somatic SNV calling within 14 h on 50X whole genome sequencing data in paired samples. AVAILABILITY AND IMPLEMENTATION: The program, datasets and supplementary files are available at http://jjwanglab.org/FaSD-somatic/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Variación Genética , Secuenciación de Nucleótidos de Alto Rendimiento , Neoplasias/genética , Bases de Datos de Ácidos Nucleicos , Genómica , Humanos
16.
Bioinformatics ; 30(8): 1190-1192, 2014 04 15.
Artículo en Inglés | MEDLINE | ID: mdl-24389658

RESUMEN

ChIP-seq technology provides an accurate characterization of transcription or epigenetic factors binding on genomic sequences. With integration of such ChIP-based and other high-throughput information, it would be dedicated to dissecting cross-interactions among multilevel regulators, genes and biological functions. Here, we devised an integrative web server CMGRN (constructing multilevel gene regulatory networks), to unravel hierarchical interactive networks at different regulatory levels. The newly developed method used the Bayesian network modeling to infer causal interrelationships among transcription factors or epigenetic modifications by using ChIP-seq data. Moreover, it used Bayesian hierarchical model with Gibbs sampling to incorporate binding signals of these regulators and gene expression profile together for reconstructing gene regulatory networks. The example applications indicate that CMGRN provides an effective web-based framework that is able to integrate heterogeneous high-throughput data and to reveal hierarchical 'regulome' and the associated gene expression programs. AVAILABILITY: http://bioinfo.icts.hkbu.edu.hk/cmgrn; http://www.byanbioinfo.org/cmgrn CONTACT: yanbinai6017@gmail.com or junwen@hku.hk Supplementary Information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Biología Computacional/métodos , Redes de Comunicación de Computadores , Redes Reguladoras de Genes , Genómica/métodos , Teorema de Bayes , Inmunoprecipitación de Cromatina , Epigénesis Genética , Expresión Génica , Internet , Análisis de Secuencia por Matrices de Oligonucleótidos , Programas Informáticos , Factores de Transcripción/metabolismo
17.
Mol Cell Proteomics ; 12(11): 3379-87, 2013 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-23924514

RESUMEN

MicroRNAs (miRNAs) regulate gene expression through translational repression and RNA degradation. Recently developed high-throughput proteomic methods measure gene expression changes at protein level and therefore can reveal the direct effects of miRNAs' translational repression. Here, we present a web server, ProteoMirExpress, that integrates proteomic and mRNA expression data together to infer miRNA-centered regulatory networks. With both types of high-throughput data from the users, ProteoMirExpress is able to discover not only miRNA targets that have decreased mRNA, but also subgroups of targets with suppressed proteins whose mRNAs are not significantly changed or with decreased mRNA whose proteins are not significantly changed, which are usually ignored by most current methods. Furthermore, both direct and indirect targets of miRNAs can be detected. Therefore, ProteoMirExpress provides more comprehensive miRNA-centered regulatory networks. We used several published data to assess the quality of our inferred networks and prove the value of our server. ProteoMirExpress is available online, with free access to academic users.


Asunto(s)
MicroARNs/genética , MicroARNs/metabolismo , Proteoma/genética , Proteoma/metabolismo , Proteómica/métodos , ARN Mensajero/genética , ARN Mensajero/metabolismo , Perfilación de la Expresión Génica/estadística & datos numéricos , Redes Reguladoras de Genes , Células HeLa , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/estadística & datos numéricos , Humanos , Proteómica/estadística & datos numéricos , Programas Informáticos
19.
Nucleic Acids Res ; 40(Database issue): D1047-54, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22139925

RESUMEN

Recent advances in genome-wide association studies (GWAS) have enabled us to identify thousands of genetic variants (GVs) that are associated with human diseases. As next-generation sequencing technologies become less expensive, more GVs will be discovered in the near future. Existing databases, such as NHGRI GWAS Catalog, collect GVs with only genome-wide level significance. However, many true disease susceptibility loci have relatively moderate P values and are not included in these databases. We have developed GWASdb that contains 20 times more data than the GWAS Catalog and includes less significant GVs (P < 1.0 × 10(-3)) manually curated from the literature. In addition, GWASdb provides comprehensive functional annotations for each GV, including genomic mapping information, regulatory effects (transcription factor binding sites, microRNA target sites and splicing sites), amino acid substitutions, evolution, gene expression and disease associations. Furthermore, GWASdb classifies these GVs according to diseases using Disease-Ontology Lite and Human Phenotype Ontology. It can conduct pathway enrichment and PPI network association analysis for these diseases. GWASdb provides an intuitive, multifunctional database for biologists and clinicians to explore GVs and their functional inferences. It is freely available at http://jjwanglab.org/gwasdb and will be updated frequently.


Asunto(s)
Bases de Datos Genéticas , Enfermedad/genética , Variación Genética , Estudio de Asociación del Genoma Completo , Mapeo Cromosómico , Humanos , Anotación de Secuencia Molecular , Interfaz Usuario-Computador
20.
Nucleic Acids Res ; 39(Web Server issue): W430-6, 2011 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-21586587

RESUMEN

Chromatin immunoprecipitation (ChIP) coupled with high-throughput techniques (ChIP-X), such as next generation sequencing (ChIP-Seq) and microarray (ChIP-chip), has been successfully used to map active transcription factor binding sites (TFBS) of a transcription factor (TF). The targeted genes can be activated or suppressed by the TF, or are unresponsive to the TF. Microarray technology has been used to measure the actual expression changes of thousands of genes under the perturbation of a TF, but is unable to determine if the affected genes are direct or indirect targets of the TF. Furthermore, both ChIP-X and microarray methods produce a large number of false positives. Combining microarray expression profiling and ChIP-X data allows more effective TFBS analysis for studying the function of a TF. However, current web servers only provide tools to analyze either ChIP-X or expression data, but not both. Here, we present ChIP-Array, a web server that integrates ChIP-X and expression data from human, mouse, yeast, fruit fly and Arabidopsis. This server will assist biologists to detect direct and indirect target genes regulated by a TF of interest and to aid in the functional characterization of the TF. ChIP-Array is available at http://jjwanglab.hku.hk/ChIP-Array, with free access to academic users.


Asunto(s)
Inmunoprecipitación de Cromatina , Perfilación de la Expresión Génica , Análisis de Secuencia por Matrices de Oligonucleótidos , Programas Informáticos , Factores de Transcripción/metabolismo , Animales , Sitios de Unión , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Ratones , Elementos Reguladores de la Transcripción
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA