Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
1.
Hum Mutat ; 39(11): 1690-1701, 2018 11.
Artículo en Inglés | MEDLINE | ID: mdl-30311374

RESUMEN

Effective exchange of information about genetic variants is currently hampered by the lack of readily available globally unique variant identifiers that would enable aggregation of information from different sources. The ClinGen Allele Registry addresses this problem by providing (1) globally unique "canonical" variant identifiers (CAids) on demand, either individually or in large batches; (2) access to variant-identifying information in a searchable Registry; (3) links to allele-related records in many commonly used databases; and (4) services for adding links to information about registered variants in external sources. A core element of the Registry is a canonicalization service, implemented using in-memory sequence alignment-based index, which groups variant identifiers denoting the same nucleotide variant and assigns unique and dereferenceable CAids. More than 650 million distinct variants are currently registered, including those from gnomAD, ExAC, dbSNP, and ClinVar, including a small number of variants registered by Registry users. The Registry is accessible both via a web interface and programmatically via well-documented Hypertext Transfer Protocol (HTTP) Representational State Transfer Application Programming Interface (REST-APIs). For programmatic interoperability, the Registry content is accessible in the JavaScript Object Notation for Linked Data (JSON-LD) format. We present several use cases and demonstrate how the linked information may provide raw material for reasoning about variant's pathogenicity.


Asunto(s)
Bases de Datos Genéticas , Variación Genética/genética , Alelos , Humanos , Sistema de Registros , Programas Informáticos
2.
BMC Bioinformatics ; 15 Suppl 7: S2, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25080362

RESUMEN

BACKGROUND: Interactions between the epigenome and structural genomic variation are potentially bi-directional. In one direction, structural variants may cause epigenomic changes in cis. In the other direction, specific local epigenomic states such as DNA hypomethylation associate with local genomic instability. METHODS: To study these interactions, we have developed several tools and exposed them to the scientific community using the Software-as-a-Service model via the Genboree Workbench. One key tool is Breakout, an algorithm for fast and accurate detection of structural variants from mate pair sequencing data. RESULTS: By applying Breakout and other Genboree Workbench tools we map breakpoints in breast and prostate cancer cell lines and tumors, discriminate between polymorphic breakpoints of germline origin and those of somatic origin, and analyze both types of breakpoints in the context of the Human Epigenome Atlas, ENCODE databases, and other sources of epigenomic profiles. We confirm previous findings that genomic instability in human germline associates with hypomethylation of DNA, binding sites of Suz12, a key member of the PRC2 Polycomb complex, and with PRC2-associated histone marks H3K27me3 and H3K9me3. Breakpoints in germline and in breast cancer associate with distal regulatory of active gene transcription. Breast cancer cell lines and tumors show distinct patterns of structural mutability depending on their ER, PR, or HER2 status. CONCLUSIONS: The patterns of association that we detected suggest that cell-type specific epigenomes may determine cell-type specific patterns of selective structural mutability of the genome.


Asunto(s)
Algoritmos , Metilación de ADN , Epigenómica/métodos , Genoma Humano , Programas Informáticos , ADN/genética , ADN/metabolismo , Epigénesis Genética , Inestabilidad Genómica , Células Germinativas/metabolismo , Histonas/metabolismo , Humanos , Neoplasias/genética
3.
BMC Bioinformatics ; 13: 8, 2012 Jan 12.
Artículo en Inglés | MEDLINE | ID: mdl-22239737

RESUMEN

BACKGROUND: Whole exome capture sequencing allows researchers to cost-effectively sequence the coding regions of the genome. Although the exome capture sequencing methods have become routine and well established, there is currently a lack of tools specialized for variant calling in this type of data. RESULTS: Using statistical models trained on validated whole-exome capture sequencing data, the Atlas2 Suite is an integrative variant analysis pipeline optimized for variant discovery on all three of the widely used next generation sequencing platforms (SOLiD, Illumina, and Roche 454). The suite employs logistic regression models in conjunction with user-adjustable cutoffs to accurately separate true SNPs and INDELs from sequencing and mapping errors with high sensitivity (96.7%). CONCLUSION: We have implemented the Atlas2 Suite and applied it to 92 whole exome samples from the 1000 Genomes Project. The Atlas2 Suite is available for download at http://sourceforge.net/projects/atlas2/. In addition to a command line version, the suite has been integrated into the Genboree Workbench, allowing biomedical scientists with minimal informatics expertise to remotely call, view, and further analyze variants through a simple web interface. The existing genomic databases displayed via the Genboree browser also streamline the process from variant discovery to functional genomics analysis, resulting in an off-the-shelf toolkit for the broader community.


Asunto(s)
Exoma , Genoma Humano , Programas Informáticos , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Mutación INDEL , Sistemas de Lectura Abierta , Polimorfismo de Nucleótido Simple
4.
BMC Bioinformatics ; 13 Suppl 13: S11, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-23320832

RESUMEN

BACKGROUND: Microbial metagenomic analyses rely on an increasing number of publicly available tools. Installation, integration, and maintenance of the tools poses significant burden on many researchers and creates a barrier to adoption of microbiome analysis, particularly in translational settings. METHODS: To address this need we have integrated a rich collection of microbiome analysis tools into the Genboree Microbiome Toolset and exposed them to the scientific community using the Software-as-a-Service model via the Genboree Workbench. The Genboree Microbiome Toolset provides an interactive environment for users at all bioinformatic experience levels in which to conduct microbiome analysis. The Toolset drives hypothesis generation by providing a wide range of analyses including alpha diversity and beta diversity, phylogenetic profiling, supervised machine learning, and feature selection. RESULTS: We validate the Toolset in two studies of the gut microbiota, one involving obese and lean twins, and the other involving children suffering from the irritable bowel syndrome. CONCLUSIONS: By lowering the barrier to performing a comprehensive set of microbiome analyses, the Toolset empowers investigators to translate high-volume sequencing data into valuable biomedical discoveries.


Asunto(s)
Metagenómica/métodos , ARN Ribosómico 16S/genética , Análisis de Secuencia de ARN/métodos , Niño , Biología Computacional , Tracto Gastrointestinal/microbiología , Humanos , Síndrome del Colon Irritable/microbiología , Metagenoma , Obesidad/genética , Filogenia , Programas Informáticos
5.
BMC Genomics ; 13 Suppl 6: S19, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-23134663

RESUMEN

BACKGROUND: Until recently, sequencing has primarily been carried out in large genome centers which have invested heavily in developing the computational infrastructure that enables genomic sequence analysis. The recent advancements in next generation sequencing (NGS) have led to a wide dissemination of sequencing technologies and data, to highly diverse research groups. It is expected that clinical sequencing will become part of diagnostic routines shortly. However, limited accessibility to computational infrastructure and high quality bioinformatic tools, and the demand for personnel skilled in data analysis and interpretation remains a serious bottleneck. To this end, the cloud computing and Software-as-a-Service (SaaS) technologies can help address these issues. RESULTS: We successfully enabled the Atlas2 Cloud pipeline for personal genome analysis on two different cloud service platforms: a community cloud via the Genboree Workbench, and a commercial cloud via the Amazon Web Services using Software-as-a-Service model. We report a case study of personal genome analysis using our Atlas2 Genboree pipeline. We also outline a detailed cost structure for running Atlas2 Amazon on whole exome capture data, providing cost projections in terms of storage, compute and I/O when running Atlas2 Amazon on a large data set. CONCLUSIONS: We find that providing a web interface and an optimized pipeline clearly facilitates usage of cloud computing for personal genome analysis, but for it to be routinely used for large scale projects there needs to be a paradigm shift in the way we develop tools, in standard operating procedures, and in funding mechanisms.


Asunto(s)
Genoma Humano , Programas Informáticos , Bases de Datos Genéticas , Humanos , Internet , Interfaz Usuario-Computador
6.
Proteomics ; 9(17): 4192-208, 2009 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-19722191

RESUMEN

Wolbachia is an intracellular endosymbiont of Brugia malayi parasite whose presence is essential for the survival of the parasite. Treatment of B. malayi-infected jirds with tetracycline eliminates Wolbachia, which affects parasite survival and fitness. In the present study we have tried to identify parasite proteins that are affected when Wolbachia is targeted by tetracycline. For this Wolbachia depleted parasites (B. malayi) were obtained by tetracycline treatment of infected Mongolian jirds (Meriones unguiculatus) and their protein profile after 2-DE separation was compared with that of untreated parasites harboring Wolbachia. Approximately 100 protein spots could be visualized followed by CBB staining of 2-D gel and included for comparative analysis. Of these, 54 showed differential expressions, while two new protein spots emerged (of 90.3 and 64.4 kDa). These proteins were subjected to further analysis by MALDI-TOF for their identification using Brugia coding sequence database composed of both genomic and EST sequences. Our study unravels two crucial findings: (i) the parasite or Wolbachia proteins, which disappeared/down-regulated appear be essential for parasite survival and may be used as drug targets and (ii) tetracycline treatment interferes with the regulatory machinery vital for parasites cellular integrity and defense and thus could possibly be a molecular mechanism for the killing of filarial parasite. This is the first proteomic study substantiating the wolbachial genome integrity with its nematode host and providing functional genomic data of human lymphatic filarial parasite B. malayi.


Asunto(s)
Brugia Malayi/metabolismo , Brugia Malayi/microbiología , Proteínas del Helminto/metabolismo , Parásitos/metabolismo , Parásitos/microbiología , Tetraciclina/farmacología , Wolbachia/efectos de los fármacos , Animales , Anticuerpos Antihelmínticos , Brugia Malayi/efectos de los fármacos , Brugia Malayi/inmunología , Electroforesis en Gel Bidimensional , Electroforesis en Gel de Poliacrilamida , Immunoblotting , Parásitos/efectos de los fármacos , Reacción en Cadena de la Polimerasa , Proteoma/análisis , Reproducibilidad de los Resultados
7.
Genome Med ; 9(1): 3, 2017 01 12.
Artículo en Inglés | MEDLINE | ID: mdl-28081714

RESUMEN

BACKGROUND: The success of the clinical use of sequencing based tests (from single gene to genomes) depends on the accuracy and consistency of variant interpretation. Aiming to improve the interpretation process through practice guidelines, the American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology (AMP) have published standards and guidelines for the interpretation of sequence variants. However, manual application of the guidelines is tedious and prone to human error. Web-based tools and software systems may not only address this problem but also document reasoning and supporting evidence, thus enabling transparency of evidence-based reasoning and resolution of discordant interpretations. RESULTS: In this report, we describe the design, implementation, and initial testing of the Clinical Genome Resource (ClinGen) Pathogenicity Calculator, a configurable system and web service for the assessment of pathogenicity of Mendelian germline sequence variants. The system allows users to enter the applicable ACMG/AMP-style evidence tags for a specific allele with links to supporting data for each tag and generate guideline-based pathogenicity assessment for the allele. Through automation and comprehensive documentation of evidence codes, the system facilitates more accurate application of the ACMG/AMP guidelines, improves standardization in variant classification, and facilitates collaborative resolution of discordances. The rules of reasoning are configurable with gene-specific or disease-specific guideline variations (e.g. cardiomyopathy-specific frequency thresholds and functional assays). The software is modular, equipped with robust application program interfaces (APIs), and available under a free open source license and as a cloud-hosted web service, thus facilitating both stand-alone use and integration with existing variant curation and interpretation systems. The Pathogenicity Calculator is accessible at http://calculator.clinicalgenome.org . CONCLUSIONS: By enabling evidence-based reasoning about the pathogenicity of genetic variants and by documenting supporting evidence, the Calculator contributes toward the creation of a knowledge commons and more accurate interpretation of sequence variants in research and clinical care.


Asunto(s)
Enfermedad/genética , Variación Genética , Genoma Humano , Programas Informáticos , Alelos , Biología Computacional , Genética Médica , Guías como Asunto , Humanos , Mutación
8.
Nat Commun ; 6: 6370, 2015 Feb 18.
Artículo en Inglés | MEDLINE | ID: mdl-25691256

RESUMEN

Tissue-specific expression of lincRNAs suggests developmental and cell-type-specific functions, yet tissue specificity was established for only a small fraction of lincRNAs. Here, by analysing 111 reference epigenomes from the NIH Roadmap Epigenomics project, we determine tissue-specific epigenetic regulation for 3,753 (69% examined) lincRNAs, with 54% active in one of the 14 cell/tissue clusters and an additional 15% in two or three clusters. A larger fraction of lincRNA TSSs is marked in a tissue-specific manner by H3K4me1 than by H3K4me3. The tissue-specific lincRNAs are strongly linked to tissue-specific pathways and undergo distinct chromatin state transitions during cellular differentiation. Polycomb-regulated lincRNAs reside in the bivalent state in embryonic stem cells and many of them undergo H3K27me3-mediated silencing at early stages of differentiation. The exquisitely tissue-specific epigenetic regulation of lincRNAs and the assignment of a majority of them to specific tissue types will inform future studies of this newly discovered class of genes.


Asunto(s)
Diferenciación Celular , Epigénesis Genética , Epigenómica , ARN Largo no Codificante/metabolismo , Elementos Reguladores de la Transcripción , Células Madre Embrionarias/fisiología , Humanos , Especificidad de Órganos , Fenotipo , Proteínas del Grupo Polycomb/fisiología
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA