Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
Genet Med ; 22(5): 974-978, 2020 05.
Artigo em Inglês | MEDLINE | ID: mdl-31965078

RESUMO

PURPOSE: Exome sequencing (ES) is increasingly used for the diagnosis of rare genetic disease. However, some pathogenic sequence variants within the exome go undetected due to the technical difficulty of identifying them. Mobile element insertions (MEIs) are a known cause of genetic disease in humans but have been historically difficult to detect via ES and similar targeted sequencing methods. METHODS: We developed and applied a novel MEI detection method prospectively to samples received for clinical ES beginning in November 2017. Positive MEI findings were confirmed by an orthogonal method and reported back to the ordering provider. In this study, we examined 89,874 samples from 38,871 cases. RESULTS: Diagnostic MEIs were present in 0.03% (95% binomial test confidence interval: 0.02-0.06%) of all cases and account for 0.15% (95% binomial test confidence interval: 0.08-0.25%) of cases with a molecular diagnosis. One diagnostic MEI was a novel founder event. Most patients with pathogenic MEIs had prior genetic testing, three of whom had previous negative DNA sequencing analysis of the diagnostic gene. CONCLUSION: MEI detection from ES is a valuable diagnostic tool, reveals molecular findings that may be undetected by other sequencing assays, and increases diagnostic yield by 0.15%.


Assuntos
Exoma , Testes Genéticos , Exoma/genética , Humanos , Análise de Sequência de DNA , Sequenciamento do Exoma
2.
BMC Genomics ; 18(1): 332, 2017 04 27.
Artigo em Inglês | MEDLINE | ID: mdl-28449639

RESUMO

BACKGROUND: The benefit of increasing genomic sequence data to the scientific community depends on easy-to-use, scalable bioinformatics support. CloVR-Comparative combines commonly used bioinformatics tools into an intuitive, automated, and cloud-enabled analysis pipeline for comparative microbial genomics. RESULTS: CloVR-Comparative runs on annotated complete or draft genome sequences that are uploaded by the user or selected via a taxonomic tree-based user interface and downloaded from NCBI. CloVR-Comparative runs reference-free multiple whole-genome alignments to determine unique, shared and core coding sequences (CDSs) and single nucleotide polymorphisms (SNPs). Output includes short summary reports and detailed text-based results files, graphical visualizations (phylogenetic trees, circular figures), and a database file linked to the Sybil comparative genome browser. Data up- and download, pipeline configuration and monitoring, and access to Sybil are managed through CloVR-Comparative web interface. CloVR-Comparative and Sybil are distributed as part of the CloVR virtual appliance, which runs on local computers or the Amazon EC2 cloud. Representative datasets (e.g. 40 draft and complete Escherichia coli genomes) are processed in <36 h on a local desktop or at a cost of <$20 on EC2. CONCLUSIONS: CloVR-Comparative allows anybody with Internet access to run comparative genomics projects, while eliminating the need for on-site computational resources and expertise.


Assuntos
Computação em Nuvem , Genômica/métodos , Software , Automação , Genoma Microbiano/genética , Alinhamento de Sequência , Análise de Sequência
3.
Nucleic Acids Res ; 38(Database issue): D408-14, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19843611

RESUMO

Pathema (http://pathema.jcvi.org) is one of the eight Bioinformatics Resource Centers (BRCs) funded by the National Institute of Allergy and Infectious Disease (NIAID) designed to serve as a core resource for the bio-defense and infectious disease research community. Pathema strives to support basic research and accelerate scientific progress for understanding, detecting, diagnosing and treating an established set of six target NIAID Category A-C pathogens: Category A priority pathogens; Bacillus anthracis and Clostridium botulinum, and Category B priority pathogens; Burkholderia mallei, Burkholderia pseudomallei, Clostridium perfringens and Entamoeba histolytica. Each target pathogen is represented in one of four distinct clade-specific Pathema web resources and underlying databases developed to target the specific data and analysis needs of each scientific community. All publicly available complete genome projects of phylogenetically related organisms are also represented, providing a comprehensive collection of organisms for comparative analyses. Pathema facilitates the scientific exploration of genomic and related data through its integration with web-based analysis tools, customized to obtain, display, and compute results relevant to ongoing pathogen research. Pathema serves the bio-defense and infectious disease research community by disseminating data resulting from pathogen genome sequencing projects and providing access to the results of inter-genomic comparisons for these organisms.


Assuntos
Infecções Bacterianas/microbiologia , Doenças Transmissíveis/microbiologia , Biologia Computacional/métodos , Bases de Dados Genéticas , Sequência de Aminoácidos , Animais , Infecções Bacterianas/diagnóstico , Biologia Computacional/tendências , Genoma Bacteriano , Humanos , Armazenamento e Recuperação da Informação/métodos , Internet , Dados de Sequência Molecular , National Institute of Allergy and Infectious Diseases (U.S.) , Homologia de Sequência de Aminoácidos , Software , Estados Unidos
4.
BMC Bioinformatics ; 12: 356, 2011 Aug 30.
Artigo em Inglês | MEDLINE | ID: mdl-21878105

RESUMO

BACKGROUND: Next-generation sequencing technologies have decentralized sequence acquisition, increasing the demand for new bioinformatics tools that are easy to use, portable across multiple platforms, and scalable for high-throughput applications. Cloud computing platforms provide on-demand access to computing infrastructure over the Internet and can be used in combination with custom built virtual machines to distribute pre-packaged with pre-configured software. RESULTS: We describe the Cloud Virtual Resource, CloVR, a new desktop application for push-button automated sequence analysis that can utilize cloud computing resources. CloVR is implemented as a single portable virtual machine (VM) that provides several automated analysis pipelines for microbial genomics, including 16S, whole genome and metagenome sequence analysis. The CloVR VM runs on a personal computer, utilizes local computer resources and requires minimal installation, addressing key challenges in deploying bioinformatics workflows. In addition CloVR supports use of remote cloud computing resources to improve performance for large-scale sequence processing. In a case study, we demonstrate the use of CloVR to automatically process next-generation sequencing data on multiple cloud computing platforms. CONCLUSION: The CloVR VM and associated architecture lowers the barrier of entry for utilizing complex analysis protocols on both local single- and multi-core computers and cloud systems for high throughput data processing.


Assuntos
Computadores , Internet , Análise de Sequência de DNA , Software , Biologia Computacional , Genômica , Sequenciamento de Nucleotídeos em Larga Escala
5.
Bioinformatics ; 26(12): 1488-92, 2010 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-20413634

RESUMO

MOTIVATION: The growth of sequence data has been accompanied by an increasing need to analyze data on distributed computer clusters. The use of these systems for routine analysis requires scalable and robust software for data management of large datasets. Software is also needed to simplify data management and make large-scale bioinformatics analysis accessible and reproducible to a wide class of target users. RESULTS: We have developed a workflow management system named Ergatis that enables users to build, execute and monitor pipelines for computational analysis of genomics data. Ergatis contains preconfigured components and template pipelines for a number of common bioinformatics tasks such as prokaryotic genome annotation and genome comparisons. Outputs from many of these components can be loaded into a Chado relational database. Ergatis was designed to be accessible to a broad class of users and provides a user friendly, web-based interface. Ergatis supports high-throughput batch processing on distributed compute clusters and has been used for data management in a number of genome annotation and comparative genomics projects. AVAILABILITY: Ergatis is an open-source project and is freely available at http://ergatis.sourceforge.net.


Assuntos
Biologia Computacional/métodos , Internet , Software , Bases de Dados Genéticas , Bases de Dados de Proteínas , Fluxo de Trabalho
6.
PLoS Genet ; 4(4): e1000046, 2008 Apr 11.
Artigo em Inglês | MEDLINE | ID: mdl-18404212

RESUMO

We present the genome sequences of a new clinical isolate of the important human pathogen, Aspergillus fumigatus, A1163, and two closely related but rarely pathogenic species, Neosartorya fischeri NRRL181 and Aspergillus clavatus NRRL1. Comparative genomic analysis of A1163 with the recently sequenced A. fumigatus isolate Af293 has identified core, variable and up to 2% unique genes in each genome. While the core genes are 99.8% identical at the nucleotide level, identity for variable genes can be as low 40%. The most divergent loci appear to contain heterokaryon incompatibility (het) genes associated with fungal programmed cell death such as developmental regulator rosA. Cross-species comparison has revealed that 8.5%, 13.5% and 12.6%, respectively, of A. fumigatus, N. fischeri and A. clavatus genes are species-specific. These genes are significantly smaller in size than core genes, contain fewer exons and exhibit a subtelomeric bias. Most of them cluster together in 13 chromosomal islands, which are enriched for pseudogenes, transposons and other repetitive elements. At least 20% of A. fumigatus-specific genes appear to be functional and involved in carbohydrate and chitin catabolism, transport, detoxification, secondary metabolism and other functions that may facilitate the adaptation to heterogeneous environments such as soil or a mammalian host. Contrary to what was suggested previously, their origin cannot be attributed to horizontal gene transfer (HGT), but instead is likely to involve duplication, diversification and differential gene loss (DDL). The role of duplication in the origin of lineage-specific genes is further underlined by the discovery of genomic islands that seem to function as designated "gene dumps" and, perhaps, simultaneously, as "gene factories".


Assuntos
Aspergillus fumigatus/genética , Ilhas Genômicas , Alérgenos/genética , Aspergillus/classificação , Aspergillus/genética , Aspergillus/fisiologia , Aspergillus fumigatus/classificação , Aspergillus fumigatus/patogenicidade , Aspergillus fumigatus/fisiologia , Cromossomos Fúngicos/genética , Eurotiales/classificação , Eurotiales/genética , Eurotiales/fisiologia , Evolução Molecular , Proteínas Fúngicas/genética , Proteínas Fúngicas/imunologia , Genoma Fúngico , Humanos , Filogenia , Especificidade da Espécie , Virulência/genética
7.
Biochem J ; 394(Pt 3): 665-74, 2006 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-16336194

RESUMO

YZGD from Paenibacillus thiaminolyticus is a novel bifunctional enzyme with both PLPase (pyridoxal phosphatase) and Nudix (nucleoside diphosphate x) hydrolase activities. The PLPase activity is catalysed by the HAD (haloacid dehalogenase) superfamily motif of the enzyme, and the Nudix hydrolase activity is catalysed by the conserved Nudix signature sequence within a separate portion of the enzyme, as confirmed by site-directed mutagenesis. YZGD's phosphatase activity is very specific, with pyridoxal phosphate being the only natural substrate, while YZGD's Nudix activity is just the opposite, with YZGD being the most versatile Nudix hydrolase characterized to date. YZGD's Nudix substrates include the CDP-alcohols (CDP-ethanol, CDP-choline and CDP-glycerol), the ADP-coenzymes (NADH, NAD and FAD), ADP-sugars, TDP-glucose and, to a lesser extent, UDP- and GDP-sugars. Regardless of the Nudix substrate, one of the products is always a nucleoside monophosphate, suggesting a role in nucleotide salvage. Both the PLPase and Nudix hydrolase activities require a bivalent metal cation, but while PLPase activity is supported by Co2+, Mg2+, Zn2+ and Mn2+, the Nudix hydrolase activity is Mn2+-specific. YZGD's phosphatase activity is optimal at an acidic pH (pH 5), while YZGD's Nudix activities are optimal at an alkaline pH (pH 8.5). YZGD is the first enzyme reported to be a member of both the HAD and Nudix hydrolase superfamilies, the first PLPase to be recognized as a member of the HAD superfamily and the first Nudix hydrolase capable of hydrolysing ADP-x, CDP-x and TDP-x substrates with comparable substrate specificity.


Assuntos
Bactérias/enzimologia , Monoéster Fosfórico Hidrolases/classificação , Monoéster Fosfórico Hidrolases/metabolismo , Pirofosfatases/classificação , Pirofosfatases/metabolismo , Difosfato de Adenosina/metabolismo , Álcoois/metabolismo , Sequência de Aminoácidos , Metabolismo dos Carboidratos , Clonagem Molecular , Estabilidade Enzimática , Regulação Bacteriana da Expressão Gênica , Regulação Enzimológica da Expressão Gênica , Concentração de Íons de Hidrogênio , Cinética , Dados de Sequência Molecular , Mutagênese Sítio-Dirigida , Monoéster Fosfórico Hidrolases/química , Monoéster Fosfórico Hidrolases/genética , Fosforilação , Estrutura Terciária de Proteína , Pirofosfatases/química , Pirofosfatases/genética , Homologia de Sequência de Aminoácidos , Especificidade por Substrato , Nudix Hidrolases
8.
Sci Transl Med ; 7(283): 283ra53, 2015 Apr 15.
Artigo em Inglês | MEDLINE | ID: mdl-25877891

RESUMO

Massively parallel sequencing approaches are beginning to be used clinically to characterize individual patient tumors and to select therapies based on the identified mutations. A major question in these analyses is the extent to which these methods identify clinically actionable alterations and whether the examination of the tumor tissue alone is sufficient or whether matched normal DNA should also be analyzed to accurately identify tumor-specific (somatic) alterations. To address these issues, we comprehensively evaluated 815 tumor-normal paired samples from patients of 15 tumor types. We identified genomic alterations using next-generation sequencing of whole exomes or 111 targeted genes that were validated with sensitivities >95% and >99%, respectively, and specificities >99.99%. These analyses revealed an average of 140 and 4.3 somatic mutations per exome and targeted analysis, respectively. More than 75% of cases had somatic alterations in genes associated with known therapies or current clinical trials. Analyses of matched normal DNA identified germline alterations in cancer-predisposing genes in 3% of patients with apparently sporadic cancers. In contrast, a tumor-only sequencing approach could not definitively identify germline changes in cancer-predisposing genes and led to additional false-positive findings comprising 31% and 65% of alterations identified in targeted and exome analyses, respectively, including in potentially actionable genes. These data suggest that matched tumor-normal sequencing analyses are essential for precise identification and interpretation of somatic and germline alterations and have important implications for the diagnostic and therapeutic management of cancer patients.


Assuntos
Análise Mutacional de DNA , Genômica , Mutação , Neoplasias/genética , Medicina de Precisão , Biologia Computacional , Exoma , Reações Falso-Positivas , Biblioteca Gênica , Predisposição Genética para Doença , Mutação em Linhagem Germinativa , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Polimorfismo de Nucleotídeo Único , Estudos Retrospectivos
9.
mBio ; 5(6): e01721, 2014 Nov 04.
Artigo em Inglês | MEDLINE | ID: mdl-25370488

RESUMO

UNLABELLED: For centuries, cholera has been one of the most feared diseases. The causative agent Vibrio cholerae is a waterborne Gram-negative enteric pathogen eliciting a severe watery diarrheal disease. In October 2010, the seventh pandemic reached Haiti, a country that had not experienced cholera for more than a century. By using whole-genome sequence typing and mapping strategies of 116 serotype O1 strains from global sources, including 44 Haitian genomes, we present a detailed reconstructed evolutionary history of the seventh pandemic with a focus on the Haitian outbreak. We catalogued subtle genomic alterations at the nucleotide level in the genome core and architectural rearrangements from whole-genome map comparisons. Isolates closely related to the Haitian isolates caused several recent outbreaks in southern Asia. This study provides evidence for a single-source introduction of cholera from Nepal into Haiti followed by rapid, extensive, and continued clonal expansion. The phylogeographic patterns in both southern Asia and Haiti argue for the rapid dissemination of V. cholerae across the landscape necessitating real-time surveillance efforts to complement the whole-genome epidemiological analysis. As eradication efforts move forward, phylogeographic knowledge will be important for identifying persistent sources and monitoring success at regional levels. The results of molecular and epidemiological analyses of this outbreak suggest that an indigenous Haitian source of V. cholerae is unlikely and that an indigenous source has not contributed to the genomic evolution of this clade. IMPORTANCE: In this genomic epidemiology study, we have applied high-resolution whole-genome-based sequence typing methodologies on a comprehensive set of genome sequences that have become available in the aftermath of the Haitian cholera epidemic. These sequence resources enabled us to reassess the degree of genomic heterogeneity within the Vibrio cholerae O1 serotype and to refine boundaries and evolutionary relationships. The established phylogenomic framework showed how outbreak isolates fit into the global phylogeographic patterns compared to a comprehensive globally and temporally diverse strain collection and provides strong molecular evidence that points to a nonindigenous source of the 2010 Haitian cholera outbreak and refines epidemiological standards used in outbreak investigations for outbreak inclusion/exclusion following the concept of genomic epidemiology. The generated phylogenomic data have major public health relevance in translating sequence-based information to assist in future diagnostic, epidemiological, surveillance, and forensic studies of cholera.


Assuntos
Cólera/epidemiologia , Cólera/microbiologia , Epidemias , Genoma Bacteriano , Tipagem Molecular , Vibrio cholerae O1/classificação , Vibrio cholerae O1/genética , Cólera/transmissão , Genótipo , Haiti/epidemiologia , Epidemiologia Molecular , Nepal , Filogeografia , Análise de Sequência de DNA , Vibrio cholerae O1/isolamento & purificação
10.
Genome Announc ; 1(2): e0013412, 2013 Feb 28.
Artigo em Inglês | MEDLINE | ID: mdl-23516226

RESUMO

First identified in 1982, Escherichia coli O157:H7 is the dominant enterohemorrhagic serotype underlying food-borne human infections in North America. Here, we report the genomes of twenty-six strains derived from patients and the bovine reservoir. These resources enable detailed whole-genome comparisons and permit investigations of genotypic and phenotypic plasticity.

11.
Stand Genomic Sci ; 4(2): 244-51, 2011 Apr 29.
Artigo em Inglês | MEDLINE | ID: mdl-21677861

RESUMO

The Institute for Genome Sciences (IGS) has developed a prokaryotic annotation pipeline that is used for coding gene/RNA prediction and functional annotation of Bacteria and Archaea. The fully automated pipeline accepts one or many genomic sequences as input and produces output in a variety of standard formats. Functional annotation is primarily based on similarity searches and motif finding combined with a hierarchical rule based annotation system. The output annotations can also be loaded into a relational database and accessed through visualization tools.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA