Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 25
Filtrar
1.
Hum Mutat ; 43(6): 674-681, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35165961

RESUMO

A major challenge in validating genetic causes for patients with rare diseases (RDs) is the difficulty in identifying other RD patients with overlapping phenotypes and variants in the same candidate gene. This process, known as matchmaking, requires robust data sharing solutions to be effective. In 2014 we launched PhenomeCentral, a RD data repository capable of collecting computer-readable genotypic and phenotypic data for the purposes of RD matchmaking. Over the past 7 years PhenomeCentral's features have been expanded and its data set has consistently grown. There are currently 1615 users registered on PhenomeCentral, which have contributed over 12,000 patient cases. Most of these cases contain detailed phenotypic terms, with a significant portion also providing genomic sequence data or other forms of clinical information. Matchmaking within PhenomeCentral, and with connections to other data repositories in the Matchmaker Exchange, have collectively resulted in over 60,000 matches, which have facilitated multiple gene discoveries. The collection of deep phenotypic and genotypic data has also positioned PhenomeCentral well to support next generation of matchmaking initiatives that utilize genome sequencing data, ensuring that PhenomeCentral will remain a useful tool in solving undiagnosed RD cases in the years to come.


Assuntos
Disseminação de Informação , Doenças Raras , Genômica/métodos , Genótipo , Humanos , Disseminação de Informação/métodos , Fenótipo , Doenças Raras/diagnóstico , Doenças Raras/genética
2.
Hum Mutat ; 43(6): 800-811, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35181971

RESUMO

Despite recent progress in the understanding of the genetic etiologies of rare diseases (RDs), a significant number remain intractable to diagnostic and discovery efforts. Broad data collection and sharing of information among RD researchers is therefore critical. In 2018, the Care4Rare Canada Consortium launched the project C4R-SOLVE, a subaim of which was to collect, harmonize, and share both retrospective and prospective Canadian clinical and multiomic data. Here, we introduce Genomics4RD, an integrated web-accessible platform to share Canadian phenotypic and multiomic data between researchers, both within Canada and internationally, for the purpose of discovering the mechanisms that cause RDs. Genomics4RD has been designed to standardize data collection and processing, and to help users systematically collect, prioritize, and visualize participant information. Data storage, authorization, and access procedures have been developed in collaboration with policy experts and stakeholders to ensure the trusted and secure access of data by external researchers. The breadth and standardization of data offered by Genomics4RD allows researchers to compare candidate disease genes and variants between participants (i.e., matchmaking) for discovery purposes, while facilitating the development of computational approaches for multiomic data analyses and enabling clinical translation efforts for new genetic technologies in the future.


Assuntos
Doenças Raras , Canadá , Estudos de Associação Genética , Humanos , Fenótipo , Estudos Prospectivos , Doenças Raras/diagnóstico , Doenças Raras/genética , Estudos Retrospectivos
3.
Hum Mutat ; 43(6): 717-733, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35178824

RESUMO

Rare disease patients are more likely to receive a rapid molecular diagnosis nowadays thanks to the wide adoption of next-generation sequencing. However, many cases remain undiagnosed even after exome or genome analysis, because the methods used missed the molecular cause in a known gene, or a novel causative gene could not be identified and/or confirmed. To address these challenges, the RD-Connect Genome-Phenome Analysis Platform (GPAP) facilitates the collation, discovery, sharing, and analysis of standardized genome-phenome data within a collaborative environment. Authorized clinicians and researchers submit pseudonymised phenotypic profiles encoded using the Human Phenotype Ontology, and raw genomic data which is processed through a standardized pipeline. After an optional embargo period, the data are shared with other platform users, with the objective that similar cases in the system and queries from peers may help diagnose the case. Additionally, the platform enables bidirectional discovery of similar cases in other databases from the Matchmaker Exchange network. To facilitate genome-phenome analysis and interpretation by clinical researchers, the RD-Connect GPAP provides a powerful user-friendly interface and leverages tens of information sources. As a result, the resource has already helped diagnose hundreds of rare disease patients and discover new disease causing genes.


Assuntos
Genômica , Doenças Raras , Exoma , Estudos de Associação Genética , Genômica/métodos , Humanos , Fenótipo , Doenças Raras/diagnóstico , Doenças Raras/genética
4.
Genet Med ; 24(1): 100-108, 2022 01.
Artigo em Inglês | MEDLINE | ID: mdl-34906465

RESUMO

PURPOSE: Matchmaking has emerged as a useful strategy for building evidence toward causality of novel disease genes in patients with undiagnosed rare diseases. The Matchmaker Exchange (MME) is a collaborative initiative that facilitates international data sharing for matchmaking purposes; however, data on user experience is limited. METHODS: Patients enrolled as part of the Finding of Rare Disease Genes in Canada (FORGE) and Care4Rare Canada research programs had their exome sequencing data reanalyzed by a multidisciplinary research team over a 2-year period. Compelling variants in genes not previously associated with a human phenotype were submitted through the MME node PhenomeCentral, and outcomes were collected. RESULTS: In this study, 194 novel candidate genes were submitted to the MME, resulting in 1514 matches, and 15% of the genes submitted resulted in collaborations. Most submissions resulted in at least 1 match, and most matches were with GeneMatcher (82%), where additional email exchange was required to evaluate the match because of the lack of phenotypic or inheritance information. CONCLUSION: Matchmaking through the MME is an effective way to investigate novel candidate genes; however, it is a labor-intensive process. Engagement from the community to contribute phenotypic, genotypic, and inheritance data will ensure that matchmaking continues to be a useful approach in the future.


Assuntos
Bases de Dados Genéticas , Disseminação de Informação , Doenças Raras , Canadá , Estudos de Associação Genética , Humanos , Disseminação de Informação/métodos , Fenótipo , Doenças Raras/diagnóstico , Doenças Raras/genética
5.
Genet Med ; 24(6): 1336-1348, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35305867

RESUMO

PURPOSE: This study aimed to provide comprehensive diagnostic and candidate analyses in a pediatric rare disease cohort through the Genomic Answers for Kids program. METHODS: Extensive analyses of 960 families with suspected genetic disorders included short-read exome sequencing and short-read genome sequencing (srGS); PacBio HiFi long-read genome sequencing (HiFi-GS); variant calling for single nucleotide variants (SNV), structural variant (SV), and repeat variants; and machine-learning variant prioritization. Structured phenotypes, prioritized variants, and pedigrees were stored in PhenoTips database, with data sharing through controlled access the database of Genotypes and Phenotypes. RESULTS: Diagnostic rates ranged from 11% in patients with prior negative genetic testing to 34.5% in naive patients. Incorporating SVs from genome sequencing added up to 13% of new diagnoses in previously unsolved cases. HiFi-GS yielded increased discovery rate with >4-fold more rare coding SVs compared with srGS. Variants and genes of unknown significance remain the most common finding (58% of nondiagnostic cases). CONCLUSION: Computational prioritization is efficient for diagnostic SNVs. Thorough identification of non-SNVs remains challenging and is partly mitigated using HiFi-GS sequencing. Importantly, community research is supported by sharing real-time data to accelerate gene validation and by providing HiFi variant (SNV/SV) resources from >1000 human alleles to facilitate implementation of new sequencing platforms for rare disease diagnoses.


Assuntos
Genômica , Doenças Raras , Criança , Genoma , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Linhagem , Doenças Raras/diagnóstico , Doenças Raras/genética , Análise de Sequência de DNA
6.
Am J Hum Genet ; 103(3): 389-399, 2018 09 06.
Artigo em Inglês | MEDLINE | ID: mdl-30173820

RESUMO

Recently, to speed up the differential-diagnosis process based on symptoms and signs observed from an affected individual in the diagnosis of rare diseases, researchers have developed and implemented phenotype-driven differential-diagnosis systems. The performance of those systems relies on the quantity and quality of underlying databases of disease-phenotype associations (DPAs). Although such databases are often developed by manual curation, they inherently suffer from limited coverage. To address this problem, we propose a text-mining approach to increase the coverage of DPA databases and consequently improve the performance of differential-diagnosis systems. Our analysis showed that a text-mining approach using one million case reports obtained from PubMed could increase the coverage of manually curated DPAs in Orphanet by 125.6%. We also present PubCaseFinder (see Web Resources), a new phenotype-driven differential-diagnosis system in a freely available web application. By utilizing automatically extracted DPAs from case reports in addition to manually curated DPAs, PubCaseFinder improves the performance of automated differential diagnosis. Moreover, PubCaseFinder helps clinicians search for relevant case reports by using phenotype-based comparisons and confirm the results with detailed contextual information.


Assuntos
Doenças Raras/diagnóstico , Doenças Raras/genética , Mineração de Dados/métodos , Bases de Dados Genéticas , Diagnóstico Diferencial , Humanos , Fenótipo
7.
Nucleic Acids Res ; 45(D1): D865-D876, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899602

RESUMO

Deep phenotyping has been defined as the precise and comprehensive analysis of phenotypic abnormalities in which the individual components of the phenotype are observed and described. The three components of the Human Phenotype Ontology (HPO; www.human-phenotype-ontology.org) project are the phenotype vocabulary, disease-phenotype annotations and the algorithms that operate on these. These components are being used for computational deep phenotyping and precision medicine as well as integration of clinical data into translational research. The HPO is being increasingly adopted as a standard for phenotypic abnormalities by diverse groups such as international rare disease organizations, registries, clinical labs, biomedical resources, and clinical software tools and will thereby contribute toward nascent efforts at global data exchange for identifying disease etiologies. This update article reviews the progress of the HPO project since the debut Nucleic Acids Research database article in 2014, including specific areas of expansion such as common (complex) disease, new algorithms for phenotype driven genomic discovery and diagnostics, integration of cross-species mapping efforts with the Mammalian Phenotype Ontology, an improved quality control pipeline, and the addition of patient-friendly terminology.


Assuntos
Ontologias Biológicas , Biologia Computacional , Genômica , Fenótipo , Algoritmos , Biologia Computacional/métodos , Estudos de Associação Genética/métodos , Genômica/métodos , Humanos , Medicina de Precisão/métodos , Doenças Raras/diagnóstico , Doenças Raras/etiologia , Software , Pesquisa Translacional Biomédica/métodos
8.
Genome Res ; 23(3): 519-29, 2013 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-23204306

RESUMO

High-throughput RNA sequencing (RNA-seq) promises to revolutionize our understanding of genes and their role in human disease by characterizing the RNA content of tissues and cells. The realization of this promise, however, is conditional on the development of effective computational methods for the identification and quantification of transcripts from incomplete and noisy data. In this article, we introduce iReckon, a method for simultaneous determination of the isoforms and estimation of their abundances. Our probabilistic approach incorporates multiple biological and technical phenomena, including novel isoforms, intron retention, unspliced pre-mRNA, PCR amplification biases, and multimapped reads. iReckon utilizes regularized expectation-maximization to accurately estimate the abundances of known and novel isoforms. Our results on simulated and real data demonstrate a superior ability to discover novel isoforms with a significantly reduced number of false-positive predictions, and our abundance accuracy prediction outmatches that of other state-of-the-art tools. Furthermore, we have applied iReckon to two cancer transcriptome data sets, a triple-negative breast cancer patient sample and the MCF7 breast cancer cell line, and show that iReckon is able to reconstruct the complex splicing changes that were not previously identified. QT-PCR validations of the isoforms detected in the MCF7 cell line confirmed all of iReckon's predictions and also showed strong agreement (r(2) = 0.94) with the predicted abundances.


Assuntos
Algoritmos , Simulação por Computador , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Splicing de RNA , Análise de Sequência de RNA/métodos , Feminino , Humanos , Células MCF-7 , Precursores de RNA/genética , Precursores de RNA/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Transcriptoma , Proteína Supressora de Tumor p53/genética , Proteína Supressora de Tumor p53/metabolismo
9.
Genet Med ; 18(6): 608-17, 2016 06.
Artigo em Inglês | MEDLINE | ID: mdl-26562225

RESUMO

PURPOSE: Medical diagnosis and molecular or biochemical confirmation typically rely on the knowledge of the clinician. Although this is very difficult in extremely rare diseases, we hypothesized that the recording of patient phenotypes in Human Phenotype Ontology (HPO) terms and computationally ranking putative disease-associated sequence variants improves diagnosis, particularly for patients with atypical clinical profiles. METHODS: Using simulated exomes and the National Institutes of Health Undiagnosed Diseases Program (UDP) patient cohort and associated exome sequence, we tested our hypothesis using Exomiser. Exomiser ranks candidate variants based on patient phenotype similarity to (i) known disease-gene phenotypes, (ii) model organism phenotypes of candidate orthologs, and (iii) phenotypes of protein-protein association neighbors. RESULTS: Benchmarking showed Exomiser ranked the causal variant as the top hit in 97% of known disease-gene associations and ranked the correct seeded variant in up to 87% when detectable disease-gene associations were unavailable. Using UDP data, Exomiser ranked the causative variant(s) within the top 10 variants for 11 previously diagnosed variants and achieved a diagnosis for 4 of 23 cases undiagnosed by clinical evaluation. CONCLUSION: Structured phenotyping of patients and computational analysis are effective adjuncts for diagnosing patients with genetic disorders.Genet Med 18 6, 608-617.


Assuntos
Sequenciamento do Exoma/métodos , Exoma/genética , Doenças Raras/genética , Doenças Raras/fisiopatologia , Animais , Biologia Computacional , Bases de Dados Genéticas , Modelos Animais de Doenças , Estudos de Associação Genética , Variação Genética , Humanos , Camundongos , National Institutes of Health (U.S.) , Pacientes , Fenótipo , Doenças Raras/diagnóstico , Doenças Raras/epidemiologia , Estados Unidos , Peixe-Zebra
10.
Hum Mutat ; 36(10): 989-97, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26239817

RESUMO

Genomic matchmaking databases (GMDs) allow participants to submit genomic and phenotypic data with the goal of identifying previously uncharacterized disease-associated genes by "matching" to other comparable cases. Current estimates suggest that there are at least 3,000 Mendelian disease-associated genes that have not yet been characterized as such, but the true number may be substantially higher. Therefore, GMDs are addressing a pressing medical need, and it is important to ask how they should be designed and how much data they should strive to contain in order to identify a certain number of these genes. In this work, we argue that genomic matchmaking has similarities to the so-called "birthday paradox," which refers to the observation that within a group of just 23 persons, two people will have the same birthday with probability greater than 50%. We develop a series of simulations to provide a rough estimate of the number of cases required and to explore the influence of parameters such as genetic heterogeneity, mode of inheritance, background variation, precision of phenotypic descriptions, disease prevalence, and the accuracy of bioinformatics pathogenicity prediction programs on the performance of genomic matchmaking.


Assuntos
Doença/genética , Modelos Genéticos , Simulação por Computador , Epidemiologia , Heterogeneidade Genética , Genoma Humano , Humanos
11.
Hum Mutat ; 36(10): 922-7, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26255989

RESUMO

Despite the increasing prevalence of clinical sequencing, the difficulty of identifying additional affected families is a key obstacle to solving many rare diseases. There may only be a handful of similar patients worldwide, and their data may be stored in diverse clinical and research databases. Computational methods are necessary to enable finding similar patients across the growing number of patient repositories and registries. We present the Matchmaker Exchange Application Programming Interface (MME API), a protocol and data format for exchanging phenotype and genotype profiles to enable matchmaking among patient databases, facilitate the identification of additional cohorts, and increase the rate with which rare diseases can be researched and diagnosed. We designed the API to be straightforward and flexible in order to simplify its adoption on a large number of data types and workflows. We also provide a public test data set, curated from the literature, to facilitate implementation of the API and development of new matching algorithms. The initial version of the API has been successfully implemented by three members of the Matchmaker Exchange and was immediately able to reproduce previously identified matches and generate several new leads currently being validated. The API is available at https://github.com/ga4gh/mme-apis.


Assuntos
Biologia Computacional/métodos , Disseminação de Informação/métodos , Doenças Raras/genética , Algoritmos , Bases de Dados Genéticas , Predisposição Genética para Doença , Genótipo , Humanos , Fenótipo , Doenças Raras/patologia , Navegador
12.
Hum Mutat ; 36(10): 931-40, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26251998

RESUMO

The discovery of disease-causing mutations typically requires confirmation of the variant or gene in multiple unrelated individuals, and a large number of rare genetic diseases remain unsolved due to difficulty identifying second families. To enable the secure sharing of case records by clinicians and rare disease scientists, we have developed the PhenomeCentral portal (https://phenomecentral.org). Each record includes a phenotypic description and relevant genetic information (exome or candidate genes). PhenomeCentral identifies similar patients in the database based on semantic similarity between clinical features, automatically prioritized genes from whole-exome data, and candidate genes entered by the users, enabling both hypothesis-free and hypothesis-driven matchmaking. Users can then contact other submitters to follow up on promising matches. PhenomeCentral incorporates data for over 1,000 patients with rare genetic diseases, contributed by the FORGE and Care4Rare Canada projects, the US NIH Undiagnosed Diseases Program, the EU Neuromics and ANDDIrare projects, as well as numerous independent clinicians and scientists. Though the majority of these records have associated exome data, most lack a molecular diagnosis. PhenomeCentral has already been used to identify causative mutations for several patients, and its ability to find matching patients and diagnose these diseases will grow with each additional patient that is entered.


Assuntos
Predisposição Genética para Doença/genética , Disseminação de Informação/métodos , Doenças Raras/genética , Bases de Dados Genéticas , Variação Genética , Genótipo , Humanos , Fenótipo , Software , Interface Usuário-Computador , Navegador
13.
Hum Mutat ; 36(10): 915-21, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26295439

RESUMO

There are few better examples of the need for data sharing than in the rare disease community, where patients, physicians, and researchers must search for "the needle in a haystack" to uncover rare, novel causes of disease within the genome. Impeding the pace of discovery has been the existence of many small siloed datasets within individual research or clinical laboratory databases and/or disease-specific organizations, hoping for serendipitous occasions when two distant investigators happen to learn they have a rare phenotype in common and can "match" these cases to build evidence for causality. However, serendipity has never proven to be a reliable or scalable approach in science. As such, the Matchmaker Exchange (MME) was launched to provide a robust and systematic approach to rare disease gene discovery through the creation of a federated network connecting databases of genotypes and rare phenotypes using a common application programming interface (API). The core building blocks of the MME have been defined and assembled. Three MME services have now been connected through the API and are available for community use. Additional databases that support internal matching are anticipated to join the MME network as it continues to grow.


Assuntos
Predisposição Genética para Doença/genética , Disseminação de Informação/métodos , Doenças Raras/genética , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Estudos de Associação Genética , Humanos , Software
14.
Nat Methods ; 9(5): 473-6, 2012 Mar 18.
Artigo em Inglês | MEDLINE | ID: mdl-22426492

RESUMO

We trained Segway, a dynamic Bayesian network method, simultaneously on chromatin data from multiple experiments, including positions of histone modifications, transcription-factor binding and open chromatin, all derived from a human chronic myeloid leukemia cell line. In an unsupervised fashion, we identified patterns associated with transcription start sites, gene ends, enhancers, transcriptional regulator CTCF-binding regions and repressed regions. Software and genome browser tracks are at http://noble.gs.washington.edu/proj/segway/.


Assuntos
Cromatina/fisiologia , Genoma Humano , Histonas/fisiologia , Sítio de Iniciação de Transcrição , Teorema de Bayes , Cromatina/genética , Histonas/genética , Humanos , Células K562 , Dados de Sequência Molecular , Regiões Promotoras Genéticas , Sequências Reguladoras de Ácido Nucleico , Fatores de Transcrição/genética , Fatores de Transcrição/fisiologia
15.
Bioinformatics ; 29(15): 1843-50, 2013 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-23736532

RESUMO

MOTIVATION: The prioritization and identification of disease-causing mutations is one of the most significant challenges in medical genomics. Currently available methods address this problem for non-synonymous single nucleotide variants (SNVs) and variation in promoters/enhancers; however, recent research has implicated synonymous (silent) exonic mutations in a number of disorders. RESULTS: We have curated 33 such variants from literature and developed the Silent Variant Analyzer (SilVA), a machine-learning approach to separate these from among a large set of rare polymorphisms. We evaluate SilVA's performance on in silico 'infection' experiments, in which we implant known disease-causing mutations into a human genome, and show that for 15 of 33 disorders, we rank the implanted mutation among the top five most deleterious ones. Furthermore, we apply the SilVA method to two additional datasets: synonymous variants associated with Meckel syndrome, and a collection of silent variants clinically observed and stratified by a molecular diagnostics laboratory, and show that SilVA is able to accurately predict the harmfulness of silent variants in these datasets. AVAILABILITY: SilVA is open source and is freely available from the project website: http://compbio.cs.toronto.edu/silva CONTACT: silva-snv@cs.toronto.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Inteligência Artificial , Doença/genética , Genoma Humano , Mutação , Transtornos da Motilidade Ciliar/genética , Simulação por Computador , Encefalocele/genética , Éxons , Genômica/métodos , Humanos , Doenças Renais Policísticas/genética , Polimorfismo Genético , Retinose Pigmentar
16.
Orphanet J Rare Dis ; 18(1): 109, 2023 05 09.
Artigo em Inglês | MEDLINE | ID: mdl-37161573

RESUMO

BACKGROUND: Many patients with rare diseases are still lacking a timely diagnosis and approved therapies for their condition despite the tremendous efforts of the research community, biopharmaceutical, medical device industries, and patient support groups. The development of clinical research networks for rare diseases offers a tremendous opportunity for patients and multi-disciplinary teams to collaborate, share expertise, gain better understanding on specific rare diseases, and accelerate clinical research and innovation. Clinical Research Networks have been developed at a national or continental level, but global collaborative efforts to connect them are still lacking. The International Rare Diseases Research Consortium set a Task Force on Clinical Research Networks for Rare Diseases with the objective to analyse the structure and attributes of these networks and to identify the barriers and needs preventing their international collaboration. The Task Force created a survey and sent it to pre-identified clinical research networks located worldwide. RESULTS: A total of 34 responses were received. The survey analysis demonstrated that clinical research networks are diverse in their membership composition and emphasize community partnerships including patient groups, health care providers and researchers. The sustainability of the networks is mostly supported by public funding. Activities and research carried out at the networks span the research continuum from basic to clinical to translational research studies. Key elements and infrastructures conducive to collaboration are well adopted by the networks, but barriers to international interoperability are clearly identified. These hurdles can be grouped into five categories: funding limitation; lack of harmonization in regulatory and contracting process; need for common tools and data standards; need for a governance framework and coordination structures; and lack of awareness and robust interactions between networks. CONCLUSIONS: Through this analysis, the Task Force identified key elements that should support both developing and established clinical research networks for rare diseases in implementing the appropriate structures to achieve international interoperability worldwide. A global roadmap of actions and a specific research agenda, as suggested by this group, provides a platform to identify common goals between these networks.


Assuntos
Produtos Biológicos , Doenças Raras , Humanos , Comitês Consultivos , Pessoal de Saúde , Pesquisa Translacional Biomédica
17.
BMC Bioinformatics ; 12: 415, 2011 Oct 26.
Artigo em Inglês | MEDLINE | ID: mdl-22029426

RESUMO

BACKGROUND: As genome-wide experiments and annotations become more prevalent, researchers increasingly require tools to help interpret data at this scale. Many functional genomics experiments involve partitioning the genome into labeled segments, such that segments sharing the same label exhibit one or more biochemical or functional traits. For example, a collection of ChlP-seq experiments yields a compendium of peaks, each labeled with one or more associated DNA-binding proteins. Similarly, manually or automatically generated annotations of functional genomic elements, including cis-regulatory modules and protein-coding or RNA genes, can also be summarized as genomic segmentations. RESULTS: We present a software toolkit called Segtools that simplifies and automates the exploration of genomic segmentations. The software operates as a series of interacting tools, each of which provides one mode of summarization. These various tools can be pipelined and summarized in a single HTML page. We describe the Segtools toolkit and demonstrate its use in interpreting a collection of human histone modification data sets and Plasmodium falciparum local chromatin structure data sets. CONCLUSIONS: Segtools provides a convenient, powerful means of interpreting a genomic segmentation.


Assuntos
Genoma Humano , Plasmodium falciparum/genética , Software , Expressão Gênica , Genômica , Código das Histonas , Histonas/metabolismo , Humanos
18.
Bioinformatics ; 26(11): 1458-9, 2010 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-20435580

RESUMO

SUMMARY: We present a format for efficient storage of multiple tracks of numeric data anchored to a genome. The format allows fast random access to hundreds of gigabytes of data, while retaining a small disk space footprint. We have also developed utilities to load data into this format. We show that retrieving data from this format is more than 2900 times faster than a naive approach using wiggle files. AVAILABILITY AND IMPLEMENTATION: Reference implementation in Python and C components available at http://noble.gs.washington.edu/proj/genomedata/ under the GNU General Public License.


Assuntos
Genoma , Genômica/métodos , Bases de Dados Genéticas , Software
20.
Cell Genom ; 1(2)2021 Nov 10.
Artigo em Inglês | MEDLINE | ID: mdl-35072136

RESUMO

The Global Alliance for Genomics and Health (GA4GH) aims to accelerate biomedical advances by enabling the responsible sharing of clinical and genomic data through both harmonized data aggregation and federated approaches. The decreasing cost of genomic sequencing (along with other genome-wide molecular assays) and increasing evidence of its clinical utility will soon drive the generation of sequence data from tens of millions of humans, with increasing levels of diversity. In this perspective, we present the GA4GH strategies for addressing the major challenges of this data revolution. We describe the GA4GH organization, which is fueled by the development efforts of eight Work Streams and informed by the needs of 24 Driver Projects and other key stakeholders. We present the GA4GH suite of secure, interoperable technical standards and policy frameworks and review the current status of standards, their relevance to key domains of research and clinical care, and future plans of GA4GH. Broad international participation in building, adopting, and deploying GA4GH standards and frameworks will catalyze an unprecedented effort in data sharing that will be critical to advancing genomic medicine and ensuring that all populations can access its benefits.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA