Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 41
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
J Pathol ; 254(2): 109-120, 2021 06.
Artículo en Inglés | MEDLINE | ID: mdl-33779999

RESUMEN

The description of genetic alterations in tumours is of increasing importance. In human genetics, and in pathology reports, sequence alterations are given using the human genome variation society (HGVS) guidelines for the description of such variants. However, there is less adherence to these guidelines for sequence variations in histone genes. Due to early cleavage of the N-terminal methionine in most histones, the description of histone sequence alterations follows their own nomenclature and differs from the HGVS-compliant numbering by omitting this first amino acid. Next generation sequencing reports, however, follow the HGVS guidelines and as a result, an unambiguous description of sequence variants in histones cannot be provided. The coexistence of these two nomenclatures leads to confusions for pathologists, oncologists, and researchers. This review provides an overview of tumour entities with sequence alterations of the H3-3A gene (HGNC ID = HGNC:4764), highlights the problems associated with the coexistence of these two nomenclatures, and proposes a standard for the reporting of histone sequence variants that allows an unambiguous description of these variants according to HGVS principles. We hope that scientific journals will adopt the new notation, and that both geneticists and pathologists will include it in their reports. © 2021 The Authors. The Journal of Pathology published by John Wiley & Sons, Ltd. on behalf of The Pathological Society of Great Britain and Ireland.


Asunto(s)
Variación Genética , Genoma Humano/genética , Histonas/genética , Neoplasias/clasificación , Terminología como Asunto , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Mutación , Neoplasias/genética , Patólogos , Análisis de Secuencia de ADN
2.
Hum Mutat ; 42(1): 3-7, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-33252176

RESUMEN

Documenting variation in our genomes is important for research and clinical care. Accuracy in the description of DNA variants is therefore essential. To address this issue, the Human Variome Project convened a committee to evaluate the feasibility of requiring authors to verify that all variants submitted for publication complied with a widely accepted standard for description. After a pilot study of two journals, the committee agreed that requiring authors to verify that variants complied with Human Genome Variation Society nomenclature is a reasonable step toward standardizing the worldwide inventory of human variation.


Asunto(s)
ADN , Genoma Humano , Publicaciones Periódicas como Asunto , Terminología como Asunto , ADN/genética , Variación Genética , Proyecto Genoma Humano , Humanos , Publicaciones Periódicas como Asunto/normas , Proyectos Piloto , Publicaciones/normas
3.
Hum Mutat ; 39(1): 61-68, 2018 01.
Artículo en Inglés | MEDLINE | ID: mdl-28967166

RESUMEN

The Human Genome Variation Society (HGVS) variant nomenclature is widely used to describe sequence variants in scientific publications, clinical reports, and databases. However, the HGVS recommendations are complex and this often results in inaccurate variant descriptions being reported. The open-source hgvs Python package (https://github.com/biocommons/hgvs) provides a programmatic interface for parsing, manipulating, formatting, and validating of variants according to the HGVS recommendations, but does not provide a user-friendly Web interface. We have developed a Web-based variant validation tool, VariantValidator (https://variantvalidator.org/), which utilizes the hgvs Python package and provides additional functionality to assist users who wish to accurately describe and report sequence-level variations that are compliant with the HGVS recommendations. VariantValidator was designed to ensure that users are guided through the intricacies of the HGVS nomenclature, for example, if the user makes a mistake, VariantValidator automatically corrects the mistake if it can, or provides helpful guidance if it cannot. In addition, VariantValidator has the facility to interconvert genomic variant descriptions in HGVS and Variant Call Format with a degree of accuracy that surpasses most competing solutions.


Asunto(s)
Biología Computacional/métodos , Variación Genética , Programas Informáticos , Mapeo Cromosómico/métodos , Bases de Datos Genéticas , Exones , Humanos , Intrones , Reproducibilidad de los Resultados , Análisis de Secuencia de ADN/métodos , Interfaz Usuario-Computador , Navegador Web
4.
Hum Mutat ; 39(12): 1803-1813, 2018 12.
Artículo en Inglés | MEDLINE | ID: mdl-30129167

RESUMEN

The Human Genome Variation Society (HGVS) nomenclature guidelines encourage the accurate and standard description of DNA, RNA, and protein sequence variants in public variant databases and the scientific literature. Inconsistent application of the HGVS guidelines can lead to misinterpretation of variants in clinical settings. Reliable software tools are essential to ensure consistent application of the HGVS guidelines when reporting and interpreting variants. We present the hgvs Python package, a comprehensive tool for manipulating sequence variants according to the HGVS nomenclature guidelines. Distinguishing features of the hgvs package include: (1) parsing, formatting, validating, and normalizing variants on genome, transcript, and protein sequences; (2) projecting variants between aligned sequences, including those with gapped alignments; (3) flexible installation using remote or local data (fully local installations eliminate network dependencies); (4) extensive automated tests; and (5) open source development by a community from eight organizations worldwide. This report summarizes recent and significant updates to the hgvs package since its original release in 2014, and presents results of extensive validation using clinical relevant variants from ClinVar and HGMD.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Variación Genética , Genoma Humano , Guías como Asunto , Humanos , Sociedades Médicas , Programas Informáticos
6.
Hum Mutat ; 37(6): 532-9, 2016 06.
Artículo en Inglés | MEDLINE | ID: mdl-26919551

RESUMEN

Locus specific databases (LSDBs) make a key contribution to our understanding of heritable and acquired human disorders, disease susceptibility, and adverse drug reactions. As data have accumulated in LSDBs, a greater reliance on their use has arisen in clinical practice. Even though LSDBs have existed in recognizable form for only a quarter of a century, their origin lies in the manual cataloging of data that began around 50 years ago. Analysis and recording of sequence variation in the globin genes, and the proteins which they encode, can confidently be said to be the foundation for what we now refer to as LSDBs. Their growth over the years has primarily been underpinned by software developments and the advent of the World Wide Web. However, it is also important to recognize the evolution of reporting standards and reference sequences, without which accurate and consistent reporting of sequence variants would be impossible. Nowadays, LSDBs exist for many human protein-coding genes and the focus of efforts has moved toward minor tidying up of the variant reporting nomenclature and processes for assuring the completeness, correctness, and consistency of the data. The next 25 years will doubtless witness further developments in the evolution of LSDBs.


Asunto(s)
Bases de Datos Genéticas/normas , Biología Computacional , Predisposición Genética a la Enfermedad , Variación Genética , Humanos , Navegador Web
7.
Hum Mutat ; 37(6): 564-9, 2016 06.
Artículo en Inglés | MEDLINE | ID: mdl-26931183

RESUMEN

The consistent and unambiguous description of sequence variants is essential to report and exchange information on the analysis of a genome. In particular, DNA diagnostics critically depends on accurate and standardized description and sharing of the variants detected. The sequence variant nomenclature system proposed in 2000 by the Human Genome Variation Society has been widely adopted and has developed into an internationally accepted standard. The recommendations are currently commissioned through a Sequence Variant Description Working Group (SVD-WG) operating under the auspices of three international organizations: the Human Genome Variation Society (HGVS), the Human Variome Project (HVP), and the Human Genome Organization (HUGO). Requests for modifications and extensions go through the SVD-WG following a standard procedure including a community consultation step. Version numbers are assigned to the nomenclature system to allow users to specify the version used in their variant descriptions. Here, we present the current recommendations, HGVS version 15.11, and briefly summarize the changes that were made since the 2000 publication. Most focus has been on removing inconsistencies and tightening definitions allowing automatic data processing. An extensive version of the recommendations is available online, at http://www.HGVS.org/varnomen.


Asunto(s)
Variación Genética , Proyecto Genoma Humano/organización & administración , Terminología como Asunto , Genoma Humano , Guías como Asunto , Humanos , Análisis de Secuencia de ADN
8.
Nucleic Acids Res ; 42(Database issue): D873-8, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24285302

RESUMEN

Locus Reference Genomic (LRG; http://www.lrg-sequence.org/) records contain internationally recognized stable reference sequences designed specifically for reporting clinically relevant sequence variants. Each LRG is contained within a single file consisting of a stable 'fixed' section and a regularly updated 'updatable' section. The fixed section contains stable genomic DNA sequence for a genomic region, essential transcripts and proteins for variant reporting and an exon numbering system. The updatable section contains mapping information, annotation of all transcripts and overlapping genes in the region and legacy exon and amino acid numbering systems. LRGs provide a stable framework that is vital for reporting variants, according to Human Genome Variation Society (HGVS) conventions, in genomic DNA, transcript or protein coordinates. To enable translation of information between LRG and genomic coordinates, LRGs include mapping to the human genome assembly. LRGs are compiled and maintained by the National Center for Biotechnology Information (NCBI) and European Bioinformatics Institute (EBI). LRG reference sequences are selected in collaboration with the diagnostic and research communities, locus-specific database curators and mutation consortia. Currently >700 LRGs have been created, of which >400 are publicly available. The aim is to create an LRG for every locus with clinical implications.


Asunto(s)
Bases de Datos Genéticas , Variación Genética , Genoma Humano , Exones , Sitios Genéticos , Genómica/normas , Humanos , Internet , Proteínas/genética , ARN Mensajero/química , Estándares de Referencia
9.
Hum Mutat ; 36(10): 957-64, 2015 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-26224250

RESUMEN

Biomedical data sharing is desirable, but problematic. Data "discovery" approaches-which establish the existence rather than the substance of data-precisely connect data owners with data seekers, and thereby promote data sharing. Cafe Variome (http://www.cafevariome.org) was therefore designed to provide a general-purpose, Web-based, data discovery tool that can be quickly installed by any genotype-phenotype data owner, or network of data owners, to make safe or sensitive content appropriately discoverable. Data fields or content of any type can be accommodated, from simple ID and label fields through to extensive genotype and phenotype details based on ontologies. The system provides a "shop window" in front of data, with main interfaces being a simple search box and a powerful "query-builder" that enable very elaborate queries to be formulated. After a successful search, counts of records are reported grouped by "openAccess" (data may be directly accessed), "linkedAccess" (a source link is provided), and "restrictedAccess" (facilitated data requests and subsequent provision of approved records). An administrator interface provides a wide range of options for system configuration, enabling highly customized single-site or federated networks to be established. Current uses include rare disease data discovery, patient matchmaking, and a Beacon Web service.


Asunto(s)
Bases de Datos Bibliográficas , Difusión de la Información/métodos , Enfermedades Raras/genética , Predisposición Genética a la Enfermedad , Genotipo , Humanos , Fenotipo , Programas Informáticos , Interfaz Usuario-Computador , Navegador Web
10.
Pediatr Nephrol ; 30(11): 1893-901, 2015 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-25384529

RESUMEN

A recent review identified 60 common inherited renal diseases caused by DNA variants in 132 different genes. These diseases can be diagnosed with DNA sequencing, but each gene probably also has a thousand normal variants. Many more normal variants have been characterised by individual laboratories than are reported in the literature or found in publicly accessible collections. At present, testing laboratories must assess each novel change they identify for pathogenicity, even when this has been done elsewhere previously, and the distinction between normal and disease-associated variants is particularly an issue with the recent surge in exomic sequencing and gene discovery projects. The Human Variome Project recommends the establishment of gene-specific DNA variant databases to facilitate the sharing of DNA variants and decisions about likely disease causation. Databases improve diagnostic accuracy and testing efficiency, and reduce costs. They also help with genotype-phenotype correlations and predictive algorithms. The Human Variome Project advocates databases that use standardised descriptions, are up-to-date, include clinical information and are freely available. Currently, the genes affected in the most common inherited renal diseases correspond to 350 different variant databases, many of which are incomplete or have insufficient clinical details for genotype-phenotype correlations. Assistance is needed from nephrologists to maximise the usefulness of these databases for the diagnosis and management of inherited renal disease.


Asunto(s)
Bases de Datos de Ácidos Nucleicos/normas , Enfermedades Renales/genética , Predisposición Genética a la Enfermedad/genética , Humanos , Mutación
11.
Hum Mutat ; 35(2): 202-7, 2014 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-24186831

RESUMEN

Triggered by the sequencing of the human genome, personalized medicine has been one of the fastest growing research areas in the last decade. Multiple software and hardware technologies have been developed by several projects, culminating in the exponential growth of genetic data. Considering the technological developments in this field, it is now fairly easy and inexpensive to obtain genetic profiles for unique individuals, such as those performed by several genetic analysis companies. The availability of computational tools that simplify genetic data analysis and the disclosure of biomedical evidences are of utmost importance. We present Variobox, a desktop tool to annotate, analyze, and compare human genes. Variobox obtains variant annotation data from WAVe, protein metadata annotations from Protein Data Bank, and sequences are obtained from Locus Reference Genomic or RefSeq databases. To explore the data, Variobox provides an advanced sequence visualization that enables agile navigation through genetic regions. DNA sequencing data can be compared with reference sequences retrieved from LRG or RefSeq records, identifying and automatically annotating new potential variants. These features and data, ranging from patient sequences to HGVS-compliant variant descriptions, are combined in an intuitive interface to analyze genes and variants. Variobox is a Java application, available at http://bioinformatics.ua.pt/variobox.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Variación Genética , Genoma Humano , Anotación de Secuencia Molecular , Secuencia de Aminoácidos , Secuencia de Bases , Humanos , Medicina de Precisión , Reproducibilidad de los Resultados , Programas Informáticos
12.
Orphanet J Rare Dis ; 19(1): 272, 2024 Jul 19.
Artículo en Inglés | MEDLINE | ID: mdl-39030555

RESUMEN

A paper published in Orphanet Journal of Rare Diseases proposes a new classification of osteogenesis imperfecta (OI) based upon underlying pathological mechanisms. The proposed numbering of OI types conflicts with the currently used numbering and is likely to lead to confusion. In addition, classification of OI according to underlying pathogenic mechanisms is not novel.


Asunto(s)
Osteogénesis Imperfecta , Osteogénesis Imperfecta/clasificación , Osteogénesis Imperfecta/patología , Humanos
13.
medRxiv ; 2024 May 29.
Artículo en Inglés | MEDLINE | ID: mdl-38854034

RESUMEN

The Global Alliance for Genomics and Health (GA4GH) Phenopacket Schema was released in 2022 and approved by ISO as a standard for sharing clinical and genomic information about an individual, including phenotypic descriptions, numerical measurements, genetic information, diagnoses, and treatments. A phenopacket can be used as an input file for software that supports phenotype-driven genomic diagnostics and for algorithms that facilitate patient classification and stratification for identifying new diseases and treatments. There has been a great need for a collection of phenopackets to test software pipelines and algorithms. Here, we present phenopacket-store. Version 0.1.12 of phenopacket-store includes 4916 phenopackets representing 277 Mendelian and chromosomal diseases associated with 236 genes, and 2872 unique pathogenic alleles curated from 605 different publications. This represents the first large-scale collection of case-level, standardized phenotypic information derived from case reports in the literature with detailed descriptions of the clinical data and will be useful for many purposes, including the development and testing of software for prioritizing genes and diseases in diagnostic genomics, machine learning analysis of clinical phenotype data, patient stratification, and genotype-phenotype correlations. This corpus also provides best-practice examples for curating literature-derived data using the GA4GH Phenopacket Schema.

14.
BMC Bioinformatics ; 13: 254, 2012 Oct 03.
Artículo en Inglés | MEDLINE | ID: mdl-23031277

RESUMEN

BACKGROUND: Sharing of data about variation and the associated phenotypes is a critical need, yet variant information can be arbitrarily complex, making a single standard vocabulary elusive and re-formatting difficult. Complex standards have proven too time-consuming to implement. RESULTS: The GEN2PHEN project addressed these difficulties by developing a comprehensive data model for capturing biomedical observations, Observ-OM, and building the VarioML format around it. VarioML pairs a simplified open specification for describing variants, with a toolkit for adapting the specification into one's own research workflow. Straightforward variant data can be captured, federated, and exchanged with no overhead; more complex data can be described, without loss of compatibility. The open specification enables push-button submission to gene variant databases (LSDBs) e.g., the Leiden Open Variation Database, using the Cafe Variome data publishing service, while VarioML bidirectionally transforms data between XML and web-application code formats, opening up new possibilities for open source web applications building on shared data. A Java implementation toolkit makes VarioML easily integrated into biomedical applications. VarioML is designed primarily for LSDB data submission and transfer scenarios, but can also be used as a standard variation data format for JSON and XML document databases and user interface components. CONCLUSIONS: VarioML is a set of tools and practices improving the availability, quality, and comprehensibility of human variation information. It enables researchers, diagnostic laboratories, and clinics to share that information with ease, clarity, and without ambiguity.


Asunto(s)
Bases de Datos Genéticas , Enfermedad/genética , Variación Genética , Difusión de la Información/métodos , Sistemas de Computación , Humanos
15.
Hum Mutat ; 33(2): 298-305, 2012 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-22052659

RESUMEN

Information about genetic variation has been collected for some 20 years into registries, known as locus specific databases (LSDBs), which nowadays often contain information in addition to the actual genetic variation. Several issues have to be taken into account when considering establishing and maintaining LSDBs and these have been discussed previously in a number of articles describing guidelines and recommendations. This information is widely scattered and, for a newcomer, it would be difficult to obtain the latest information and guidance. Here, a sequence of steps essential for establishing an LSDB is discussed together with guidelines for each step. Curators need to collect information from various sources, code it in systematic way, and distribute to the research and clinical communities. In doing this, ethical issues have to be taken into account. To facilitate integration of information to, for example, analyze genotype-phenotype correlations, systematic data representation using established nomenclatures, data models, and ontologies is essential. LSDB curation and maintenance comprises a number of tasks that can be managed by following logical steps. These resources are becoming ever more important and new curators are essential to ensure that we will have expertly curated databases for all disease-related genes in the near future.


Asunto(s)
Bases de Datos Genéticas/normas , Sitios Genéticos , Biología Computacional/métodos , Biología Computacional/normas , Sistemas de Administración de Bases de Datos , Bases de Datos Genéticas/ética , Documentación , Genes , Estudios de Asociación Genética , Guías como Asunto , Humanos , Difusión de la Información
16.
Hum Mutat ; 33(2): 291-7, 2012 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-21990126

RESUMEN

Gene variant databases or Locus-Specific DataBases (LSDBs) are used to collect and display information on sequence variants on a gene-by-gene basis. Their most frequent use is in relation to DNA-based diagnostics, giving clinicians and scientists easy access to an up-to-date overview of all gene variants identified worldwide and whether they influence the function of the gene ("pathogenic or not"). While literature on gene variant databases is extensive, little has been published on the process of database curation itself. Based on our extensive experience as LSDB curators and our contributions to database curation courses, we discuss the subject of database curation. We describe the tasks involved, the steps to take, and the issues that might occur. Our overview is a first step toward establishing overall guidelines for database curation and ultimately covers one aspect of establishing quality-assured gene variant databases.


Asunto(s)
Bases de Datos Genéticas/normas , Genes , Variación Genética , Biología Computacional/métodos , Biología Computacional/normas , Humanos , Internet , Programas Informáticos
17.
Hum Mutat ; 33(10): 1494-6, 2012 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-22623360

RESUMEN

The joint Open PHACTS/GEN2PHEN workshop on "Solving Bottlenecks in Data Sharing in the Life Sciences" was held in Volendam, the Netherlands, on September 19 and 20, 2011, and was attended by representatives from academia, industry, publishing, and funding agencies. The aim of the workshop was to explore the issues that influence the extent to which data in the life sciences are shared, and to explore sustainability scenarios that would enable and promote "open" data sharing. Several key challenges were identified and solutions to each of these were proposed.


Asunto(s)
Disciplinas de las Ciencias Biológicas/organización & administración , Difusión de la Información , Disciplinas de las Ciencias Biológicas/legislación & jurisprudencia , Humanos
18.
Dis Model Mech ; 15(5)2022 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-35575034

RESUMEN

Osteogenesis imperfecta (OI) is a heterogeneous family of collagen type I-related diseases characterized by bone fragility. OI is most commonly caused by single-nucleotide substitutions that replace glycine residues or exon splicing defects in the COL1A1 and COL1A2 genes that encode the α1(I) and α2(I) collagen chains. Mutant collagen is partially retained intracellularly, impairing cell homeostasis. Upon secretion, it assembles in disorganized fibrils, altering mineralization. OI is characterized by a wide range of clinical outcomes, even in the presence of identical sequence variants. Given the heterotrimeric nature of collagen I, its amino acid composition and the peculiarity of its folding, several causes may underlie the phenotypic variability of OI. A deep analysis of entries regarding glycine and splice site collagen substitution of the largest publicly available patient database reveals a higher risk of lethal phenotype for carriers of variants in α1(I) than in α2(I) chain. However, splice site variants are predominantly associated with lethal phenotype when they occur in COL1A2. In addition, lethality is increased when mutations occur in regions of importance for extracellular matrix interactions. Both extracellular and intracellular determinants of OI clinical severity are discussed in light of the findings from in vitro and in vivo OI models. Combined with meticulous tracking of clinical cases via a publicly available database, the available OI animal models have proven to be a unique tool to shed light on new modulators of phenotype determination for this rare heterogeneous disease.


Asunto(s)
Osteogénesis Imperfecta , Animales , Variación Biológica Poblacional , Colágeno/metabolismo , Cadena alfa 1 del Colágeno Tipo I , Glicina/genética , Humanos , Mutación/genética , Osteogénesis Imperfecta/genética , Fenotipo
19.
Hum Mutat ; 32(7): 729-34, 2011 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-21394829

RESUMEN

DNA sequence variation is the underlying basis of common human traits and rarer single-gene disorders. Understanding the variome, the variants in an individual's genome, is essential to enable the ultimate goals of personalized medicine. This critical research field has grown dramatically in recent years, mostly due to the spread and development of genotyping technologies. Despite these activities being promoted by the Human Genome Variation Society and projects such as the Human Variome Project or the European GEN2PHEN Project, variome data-integration systems are far from being widely used in the research community workflow. Most of ongoing research is focused on improving locus-specific databases. Although the quality and manual curation of LSDBs adds true value to this domain, they are often narrow, heterogeneous, and independent systems. This hampers data harmonization and interoperability between systems, stifling the aggregation of data from LSDBs and related data sources. A new platform entitled Web Analysis of the Variome, WAVe, is introduced. It offers direct and programmatic access to multiple locus-specific databases, with the integration of genetic variation datasets and enrichment with relevant information. WAVe's agile and innovative Web interface is accessible at http://bioinformatics.ua.pt/WAVe.


Asunto(s)
Bases de Datos Genéticas , Variación Genética , Medicina de Precisión , Programas Informáticos , Estudios de Asociación Genética , Genoma Humano , Genotipo , Humanos , Individualidad , Internet , Integración de Sistemas
20.
Leukemia ; 35(11): 3040-3043, 2021 11.
Artículo en Inglés | MEDLINE | ID: mdl-34615987

RESUMEN

Gene fusions have been discussed in the scientific literature since they were first detected in cancer cells in the early 1980s. There is currently no standardized way to denote the genes involved in fusions, but in the majority of publications the gene symbols in question are listed either separated by a hyphen (-) or by a forward slash (/). Both types of designation suffer from important shortcomings. HGNC has worked with the scientific community to determine a new, instantly recognizable and unique separator-a double colon (::)-to be used in the description of fusion genes, and advocates its usage in all databases and articles describing gene fusions.


Asunto(s)
Bases de Datos Genéticas , Genómica/métodos , Guías como Asunto/normas , Leucemia/genética , Proteínas de Fusión Oncogénica/clasificación , Proteínas de Fusión Oncogénica/genética , Terminología como Asunto , Consenso , Humanos , Leucemia/patología
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA