Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
1.
BMC Bioinformatics ; 15: 280, 2014 Aug 15.
Artículo en Inglés | MEDLINE | ID: mdl-25128017

RESUMEN

BACKGROUND: Chromatin immunoprecipitation (ChIP) followed by next-generation sequencing (ChIP-Seq) has been widely used to identify genomic loci of transcription factor (TF) binding and histone modifications. ChIP-Seq data analysis involves multiple steps from read mapping and peak calling to data integration and interpretation. It remains challenging and time-consuming to process large amounts of ChIP-Seq data derived from different antibodies or experimental designs using the same approach. To address this challenge, there is a need for a comprehensive analysis pipeline with flexible settings to accelerate the utilization of this powerful technology in epigenetics research. RESULTS: We have developed a highly integrative pipeline, termed HiChIP for systematic analysis of ChIP-Seq data. HiChIP incorporates several open source software packages selected based on internal assessments and published comparisons. It also includes a set of tools developed in-house. This workflow enables the analysis of both paired-end and single-end ChIP-Seq reads, with or without replicates for the characterization and annotation of both punctate and diffuse binding sites. The main functionality of HiChIP includes: (a) read quality checking; (b) read mapping and filtering; (c) peak calling and peak consistency analysis; and (d) result visualization. In addition, this pipeline contains modules for generating binding profiles over selected genomic features, de novo motif finding from transcription factor (TF) binding sites and functional annotation of peak associated genes. CONCLUSIONS: HiChIP is a comprehensive analysis pipeline that can be configured to analyze ChIP-Seq data derived from varying antibodies and experiment designs. Using public ChIP-Seq data we demonstrate that HiChIP is a fast and reliable pipeline for processing large amounts of ChIP-Seq data.


Asunto(s)
Inmunoprecipitación de Cromatina/métodos , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Programas Informáticos , Animales , Sitios de Unión , Mapeo Cromosómico , Interpretación Estadística de Datos , Humanos , Ratones , Anotación de Secuencia Molecular , Factores de Transcripción/metabolismo
2.
Proteins ; 74(3): 566-82, 2009 Feb 15.
Artículo en Inglés | MEDLINE | ID: mdl-18655063

RESUMEN

Protein function prediction is a central problem in bioinformatics, increasing in importance recently due to the rapid accumulation of biological data awaiting interpretation. Sequence data represents the bulk of this new stock and is the obvious target for consideration as input, as newly sequenced organisms often lack any other type of biological characterization. We have previously introduced PFP (Protein Function Prediction) as our sequence-based predictor of Gene Ontology (GO) functional terms. PFP interprets the results of a PSI-BLAST search by extracting and scoring individual functional attributes, searching a wide range of E-value sequence matches, and utilizing conventional data mining techniques to fill in missing information. We have shown it to be effective in predicting both specific and low-resolution functional attributes when sufficient data is unavailable. Here we describe (1) significant improvements to the PFP infrastructure, including the addition of prediction significance and confidence scores, (2) a thorough benchmark of performance and comparisons to other related prediction methods, and (3) applications of PFP predictions to genome-scale data. We applied PFP predictions to uncharacterized protein sequences from 15 organisms. Among these sequences, 60-90% could be annotated with a GO molecular function term at high confidence (>or=80%). We also applied our predictions to the protein-protein interaction network of the Malaria plasmodium (Plasmodium falciparum). High confidence GO biological process predictions (>or=90%) from PFP increased the number of fully enriched interactions in this dataset from 23% of interactions to 94%. Our benchmark comparison shows significant performance improvement of PFP relative to GOtcha, InterProScan, and PSI-BLAST predictions. This is consistent with the performance of PFP as the overall best predictor in both the AFP-SIG '05 and CASP7 function (FN) assessments. PFP is available as a web service at http://dragon.bio.purdue.edu/pfp/.


Asunto(s)
Proteínas/genética , Análisis de Secuencia de Proteína/métodos , Algoritmos , Biología Computacional , Bases de Datos de Proteínas , Genes , Mapeo de Interacción de Proteínas , Proteínas/química , Proteoma/análisis , Programas Informáticos
3.
Protein Sci ; 15(6): 1550-6, 2006 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-16672240

RESUMEN

The impetus for the recent development and emergence of automated function prediction methods is an exponentially growing flood of new experimental data, the interpretation of which is hindered by a shortage of reliable annotations for proteins that lack experimental characterization or significant homologs in current databases. Here we introduce PFP, an automated function prediction server that provides the most probable annotations for a query sequence in each of the three branches of the Gene Ontology: biological process, molecular function, and cellular component. Rather than utilizing precise pattern matching to identify functional motifs in the sequences and structures of these proteins, we designed PFP to increase the coverage of function annotation by lowering resolution of predictions when a detailed function is not predictable. To do this we extend a traditional PSI-BLAST search by extracting and scoring annotations (GO terms) individually, including annotations from distantly related sequences, and applying a novel data mining tool, the Function Association Matrix, to score strongly associated pairs of annotations. We show that PFP can correctly assign function using only weakly similar sequences with a significantly better accuracy and coverage than a standard PSI-BLAST search, improving it more than fivefold. The most descriptive annotations predicted by PFP (GO depth > or = 8) can identify a significant subgraph in the GO with > 60% accuracy and approximately 100% coverage for our benchmark set. We also provide examples of the superb performance of PFP in an assessment of automated function prediction servers at the Automated Function Prediction Special Interest Group meeting at ISMB 2005 (AFP-SIG '05).


Asunto(s)
Algoritmos , Biología Computacional/métodos , Proteínas/química , Proteínas/metabolismo , Bases de Datos de Proteínas
4.
Int J Rheum Dis ; 13(4): 284-7, 2010 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-21199462

RESUMEN

Citrullinated peptides in autoimmune diseases have been extensively studied in the last two decades. It is suggested that citrullination and the anti-citrullinated peptide antibodies (ACPA) plays a critical role in initiating inflammatory responses in autoimmune diseases, such as rheumatoid arthritis (RA). The most commonly accepted molecular mechanism for citrullinated peptides/proteins in RA is that the modified antigen resulting from cell damage or uncontrolled apoptosis could evoke an immune response leading to autoantibodies against these peptide or the whole protein. Citrullination of arginine is catalyzed by the enzyme peptidylarginine-deiminase (PAD) in the presence of calcium, changing the positively charged arginine to a polar but neutral citrulline. These citrullinated peptides/proteins and the relevant antibodies (ACPA) are important, not only in initiation of RA, but also in the diagnosis of the disease. In this evidence-based clinical review, we summarize recently published data on peptide citrullination and ACPA gauging the ability of anti-cyclic citrullinated peptide (anti-CCP) antibodies for diagnosis of RA. We also recapitulate results of studies elucidating the mechanism underlying the disease.


Asunto(s)
Artritis Reumatoide/inmunología , Autoanticuerpos/sangre , Citrulina/inmunología , Péptidos/inmunología , Arginina/inmunología , Artritis Reumatoide/diagnóstico , Artritis Reumatoide/genética , Biomarcadores/sangre , Epítopos , Predisposición Genética a la Enfermedad , Humanos , Péptidos Cíclicos/inmunología , Valor Predictivo de las Pruebas , Pronóstico , Medición de Riesgo , Factores de Riesgo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA