Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 133
Filtrar
1.
Environ Sci Technol ; 58(17): 7256-7269, 2024 Apr 30.
Artículo en Inglés | MEDLINE | ID: mdl-38641325

RESUMEN

Through investigating the combined impact of the environmental exposures experienced by an individual throughout their lifetime, exposome research provides opportunities to understand and mitigate negative health outcomes. While current exposome research is driven by epidemiological studies that identify associations between exposures and effects, new frameworks integrating more substantial population-level metadata, including electronic health and administrative records, will shed further light on characterizing environmental exposure risks. Molecular biology offers methods and concepts to study the biological and health impacts of exposomes in experimental and computational systems. Of particular importance is the growing use of omics readouts in epidemiological and clinical studies. This paper calls for the adoption of mechanistic molecular biology approaches in exposome research as an essential step in understanding the genotype and exposure interactions underlying human phenotypes. A series of recommendations are presented to make the necessary and appropriate steps to move from exposure association to causation, with a huge potential to inform precision medicine and population health. This includes establishing hypothesis-driven laboratory testing within the exposome field, supported by appropriate methods to read across from model systems research to human.


Asunto(s)
Exposición a Riesgos Ambientales , Exposoma , Humanos , Biología Molecular
2.
Genome Med ; 16(1): 33, 2024 Feb 19.
Artículo en Inglés | MEDLINE | ID: mdl-38373998

RESUMEN

Polygenic scores (PGS) can be used for risk stratification by quantifying individuals' genetic predisposition to disease, and many potentially clinically useful applications have been proposed. Here, we review the latest potential benefits of PGS in the clinic and challenges to implementation. PGS could augment risk stratification through combined use with traditional risk factors (demographics, disease-specific risk factors, family history, etc.), to support diagnostic pathways, to predict groups with therapeutic benefits, and to increase the efficiency of clinical trials. However, there exist challenges to maximizing the clinical utility of PGS, including FAIR (Findable, Accessible, Interoperable, and Reusable) use and standardized sharing of the genomic data needed to develop and recalculate PGS, the equitable performance of PGS across populations and ancestries, the generation of robust and reproducible PGS calculations, and the responsible communication and interpretation of results. We outline how these challenges may be overcome analytically and with more diverse data as well as highlight sustained community efforts to achieve equitable, impactful, and responsible use of PGS in healthcare.


Asunto(s)
Comunicación , Predisposición Genética a la Enfermedad , Humanos , Genómica , Herencia Multifactorial , Factores de Riesgo , Estudio de Asociación del Genoma Completo
3.
EMBO J ; 42(23): e115008, 2023 Dec 01.
Artículo en Inglés | MEDLINE | ID: mdl-37964598

RESUMEN

The main goals and challenges for the life science communities in the Open Science framework are to increase reuse and sustainability of data resources, software tools, and workflows, especially in large-scale data-driven research and computational analyses. Here, we present key findings, procedures, effective measures and recommendations for generating and establishing sustainable life science resources based on the collaborative, cross-disciplinary work done within the EOSC-Life (European Open Science Cloud for Life Sciences) consortium. Bringing together 13 European life science research infrastructures, it has laid the foundation for an open, digital space to support biological and medical research. Using lessons learned from 27 selected projects, we describe the organisational, technical, financial and legal/ethical challenges that represent the main barriers to sustainability in the life sciences. We show how EOSC-Life provides a model for sustainable data management according to FAIR (findability, accessibility, interoperability, and reusability) principles, including solutions for sensitive- and industry-related resources, by means of cross-disciplinary training and best practices sharing. Finally, we illustrate how data harmonisation and collaborative work facilitate interoperability of tools, data, solutions and lead to a better understanding of concepts, semantics and functionalities in the life sciences.


Asunto(s)
Disciplinas de las Ciencias Biológicas , Investigación Biomédica , Programas Informáticos , Flujo de Trabajo
4.
PLoS Genet ; 19(9): e1010932, 2023 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-37721944

RESUMEN

The eQTL Catalogue is an open database of uniformly processed human molecular quantitative trait loci (QTLs). We are continuously updating the resource to further increase its utility for interpreting genetic associations with complex traits. Over the past two years, we have increased the number of uniformly processed studies from 21 to 31 and added X chromosome QTLs for 19 compatible studies. We have also implemented Leafcutter to directly identify splice-junction usage QTLs in all RNA sequencing datasets. Finally, to improve the interpretability of transcript-level QTLs, we have developed static QTL coverage plots that visualise the association between the genotype and average RNA sequencing read coverage in the region for all 1.7 million fine mapped associations. To illustrate the utility of these updates to the eQTL Catalogue, we performed colocalisation analysis between vitamin D levels in the UK Biobank and all molecular QTLs in the eQTL Catalogue. Although most GWAS loci colocalised both with eQTLs and transcript-level QTLs, we found that visual inspection could sometimes be used to distinguish primary splicing QTLs from those that appear to be secondary consequences of large-effect gene expression QTLs. While these visually confirmed primary splicing QTLs explain just 6/53 of the colocalising signals, they are significantly less pleiotropic than eQTLs and identify a prioritised causal gene in 4/6 cases.


Asunto(s)
Herencia Multifactorial , Sitios de Carácter Cuantitativo , Humanos , Sitios de Carácter Cuantitativo/genética , Genotipo , Secuencia de Bases , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple
5.
J Biomed Semantics ; 14(1): 6, 2023 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-37264430

RESUMEN

BACKGROUND: The Findable, Accessible, Interoperable and Reusable(FAIR) Principles explicitly require the use of FAIR vocabularies, but what precisely constitutes a FAIR vocabulary remains unclear. Being able to define FAIR vocabularies, identify features of FAIR vocabularies, and provide assessment approaches against the features can guide the development of vocabularies. RESULTS: We differentiate data, data resources and vocabularies used for FAIR, examine the application of the FAIR Principles to vocabularies, align their requirements with the Open Biomedical Ontologies principles, and propose FAIR Vocabulary Features. We also design assessment approaches for FAIR vocabularies by mapping the FVFs with existing FAIR assessment indicators. Finally, we demonstrate how they can be used for evaluating and improving vocabularies using exemplary biomedical vocabularies. CONCLUSIONS: Our work proposes features of FAIR vocabularies and corresponding indicators for assessing the FAIR levels of different types of vocabularies, identifies use cases for vocabulary engineers, and guides the evolution of vocabularies.


Asunto(s)
Ontologías Biológicas , Vocabulario Controlado , Vocabulario
6.
bioRxiv ; 2023 Apr 07.
Artículo en Inglés | MEDLINE | ID: mdl-37066341

RESUMEN

Splicing quantitative trait loci (QTLs) have been implicated as a common mechanism underlying complex trait associations. However, utilising splicing QTLs in target discovery and prioritisation has been challenging due to extensive data normalisation which often renders the direction of the genetic effect as well as its magnitude difficult to interpret. This is further complicated by the fact that strong expression QTLs often manifest as weak splicing QTLs and vice versa, making it difficult to uniquely identify the underlying molecular mechanism at each locus. We find that these ambiguities can be mitigated by visualising the association between the genotype and average RNA sequencing read coverage in the region. Here, we generate these QTL coverage plots for 1.7 million molecular QTL associations in the eQTL Catalogue identified with five quantification methods. We illustrate the utility of these QTL coverage plots by performing colocalisation between vitamin D levels in the UK Biobank and all molecular QTLs in the eQTL Catalogue. We find that while visually confirmed splicing QTLs explain just 6/53 of the colocalising signals, they are significantly less pleiotropic than eQTLs and identify a prioritised causal gene in 4/6 cases. All our association summary statistics and QTL coverage plots are freely available at https://www.ebi.ac.uk/eqtl/.

7.
Nature ; 616(7955): 123-131, 2023 04.
Artículo en Inglés | MEDLINE | ID: mdl-36991119

RESUMEN

The use of omic modalities to dissect the molecular underpinnings of common diseases and traits is becoming increasingly common. But multi-omic traits can be genetically predicted, which enables highly cost-effective and powerful analyses for studies that do not have multi-omics1. Here we examine a large cohort (the INTERVAL study2; n = 50,000 participants) with extensive multi-omic data for plasma proteomics (SomaScan, n = 3,175; Olink, n = 4,822), plasma metabolomics (Metabolon HD4, n = 8,153), serum metabolomics (Nightingale, n = 37,359) and whole-blood Illumina RNA sequencing (n = 4,136), and use machine learning to train genetic scores for 17,227 molecular traits, including 10,521 that reach Bonferroni-adjusted significance. We evaluate the performance of genetic scores through external validation across cohorts of individuals of European, Asian and African American ancestries. In addition, we show the utility of these multi-omic genetic scores by quantifying the genetic control of biological pathways and by generating a synthetic multi-omic dataset of the UK Biobank3 to identify disease associations using a phenome-wide scan. We highlight a series of biological insights with regard to genetic mechanisms in metabolism and canonical pathway associations with disease; for example, JAK-STAT signalling and coronary atherosclerosis. Finally, we develop a portal ( https://www.omicspred.org/ ) to facilitate public access to all genetic scores and validation results, as well as to serve as a platform for future extensions and enhancements of multi-omic genetic scores.


Asunto(s)
Enfermedad de la Arteria Coronaria , Multiómica , Humanos , Enfermedad de la Arteria Coronaria/genética , Enfermedad de la Arteria Coronaria/metabolismo , Metabolómica/métodos , Fenotipo , Proteómica/métodos , Aprendizaje Automático , Negro o Afroamericano/genética , Asiático/genética , Pueblo Europeo/genética , Reino Unido , Conjuntos de Datos como Asunto , Internet , Reproducibilidad de los Resultados , Estudios de Cohortes , Proteoma/análisis , Proteoma/metabolismo , Metaboloma , Plasma/metabolismo , Bases de Datos Factuales
8.
Front Physiol ; 14: 1076533, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-36776967

RESUMEN

As a model organism, Drosophila is uniquely placed to contribute to our understanding of how brains control complex behavior. Not only does it have complex adaptive behaviors, but also a uniquely powerful genetic toolkit, increasingly complete dense connectomic maps of the central nervous system and a rapidly growing set of transcriptomic profiles of cell types. But this also poses a challenge: Given the massive amounts of available data, how are researchers to Find, Access, Integrate and Reuse (FAIR) relevant data in order to develop an integrated anatomical and molecular picture of circuits, inform hypothesis generation, and find reagents for experiments to test these hypotheses? The Virtual Fly Brain (virtualflybrain.org) web application & API provide a solution to this problem, using FAIR principles to integrate 3D images of neurons and brain regions, connectomics, transcriptomics and reagent expression data covering the whole CNS in both larva and adult. Users can search for neurons, neuroanatomy and reagents by name, location, or connectivity, via text search, clicking on 3D images, search-by-image, and queries by type (e.g., dopaminergic neuron) or properties (e.g., synaptic input in the antennal lobe). Returned results include cross-registered 3D images that can be explored in linked 2D and 3D browsers or downloaded under open licenses, and extensive descriptions of cell types and regions curated from the literature. These solutions are potentially extensible to cover similar atlasing and data integration challenges in vertebrates.

9.
BMC Biol ; 21(1): 22, 2023 02 03.
Artículo en Inglés | MEDLINE | ID: mdl-36737727

RESUMEN

BACKGROUND: Microphthalmia, anophthalmia, and coloboma (MAC) spectrum disease encompasses a group of eye malformations which play a role in childhood visual impairment. Although the predominant cause of eye malformations is known to be heritable in nature, with 80% of cases displaying loss-of-function mutations in the ocular developmental genes OTX2 or SOX2, the genetic abnormalities underlying the remaining cases of MAC are incompletely understood. This study intended to identify the novel genes and pathways required for early eye development. Additionally, pathways involved in eye formation during embryogenesis are also incompletely understood. This study aims to identify the novel genes and pathways required for early eye development through systematic forward screening of the mammalian genome. RESULTS: Query of the International Mouse Phenotyping Consortium (IMPC) database (data release 17.0, August 01, 2022) identified 74 unique knockout lines (genes) with genetically associated eye defects in mouse embryos. The vast majority of eye abnormalities were small or absent eyes, findings most relevant to MAC spectrum disease in humans. A literature search showed that 27 of the 74 lines had previously published knockout mouse models, of which only 15 had ocular defects identified in the original publications. These 12 previously published gene knockouts with no reported ocular abnormalities and the 47 unpublished knockouts with ocular abnormalities identified by the IMPC represent 59 genes not previously associated with early eye development in mice. Of these 59, we identified 19 genes with a reported human eye phenotype. Overall, mining of the IMPC data yielded 40 previously unimplicated genes linked to mammalian eye development. Bioinformatic analysis showed that several of the IMPC genes colocalized to several protein anabolic and pluripotency pathways in early eye development. Of note, our analysis suggests that the serine-glycine pathway producing glycine, a mitochondrial one-carbon donator to folate one-carbon metabolism (FOCM), is essential for eye formation. CONCLUSIONS: Using genome-wide phenotype screening of single-gene knockout mouse lines, STRING analysis, and bioinformatic methods, this study identified genes heretofore unassociated with MAC phenotypes providing models to research novel molecular and cellular mechanisms involved in eye development. These findings have the potential to hasten the diagnosis and treatment of this congenital blinding disease.


Asunto(s)
Anoftalmos , Coloboma , Anomalías del Ojo , Microftalmía , Humanos , Ratones , Animales , Anomalías del Ojo/genética , Anoftalmos/genética , Microftalmía/genética , Coloboma/genética , Ratones Noqueados , Desarrollo Embrionario/genética , Fenotipo , Ojo , Mamíferos
10.
Bioinformatics ; 39(1)2023 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-36511598

RESUMEN

MOTIVATION: Since early 2020, the coronavirus disease 2019 (COVID-19) pandemic has confronted the biomedical community with an unprecedented challenge. The rapid spread of COVID-19 and ease of transmission seen worldwide is due to increased population flow and international trade. Front-line medical care, treatment research and vaccine development also require rapid and informative interpretation of the literature and COVID-19 data produced around the world, with 177 500 papers published between January 2020 and November 2021, i.e. almost 8500 papers per month. To extract knowledge and enable interoperability across resources, we developed the COVID-19 Vocabulary (COVoc), an application ontology related to the research on this pandemic. The main objective of COVoc development was to enable seamless navigation from biomedical literature to core databases and tools of ELIXIR, a European-wide intergovernmental organization for life sciences. RESULTS: This collaborative work provided data integration into SIB Literature services, an application ontology (COVoc) and a triage service named COVTriage and based on annotation processing to search for COVID-related information across pre-defined aspects with daily updates. Thanks to its interoperability potential, COVoc lends itself to wider applications, hopefully through further connections with other novel COVID-19 ontologies as has been established with Coronavirus Infectious Disease Ontology. AVAILABILITY AND IMPLEMENTATION: The data at https://github.com/EBISPOT/covoc and the service at https://candy.hesge.ch/COVTriage.


Asunto(s)
COVID-19 , Humanos , COVID-19/diagnóstico , Triaje , Comercio , Internacionalidad
11.
Nucleic Acids Res ; 51(D1): D1360-D1366, 2023 01 06.
Artículo en Inglés | MEDLINE | ID: mdl-36399494

RESUMEN

PDCM Finder (www.cancermodels.org) is a cancer research platform that aggregates clinical, genomic and functional data from patient-derived xenografts, organoids and cell lines. It was launched in April 2022 as a successor of the PDX Finder portal, which focused solely on patient-derived xenograft models. Currently the portal has over 6200 models across 13 cancer types, including rare paediatric models (17%) and models from minority ethnic backgrounds (33%), making it the largest free to consumer and open access resource of this kind. The PDCM Finder standardises, harmonises and integrates the complex and diverse data associated with PDCMs for the cancer community and displays over 90 million data points across a variety of data types (clinical metadata, molecular and treatment-based). PDCM data is FAIR and underpins the generation and testing of new hypotheses in cancer mechanisms and personalised medicine development.


Asunto(s)
Neoplasias , Humanos , Niño , Neoplasias/genética , Neoplasias/terapia , Organoides , Ensayos Antitumor por Modelo de Xenoinjerto
12.
Nucleic Acids Res ; 51(D1): D1038-D1045, 2023 01 06.
Artículo en Inglés | MEDLINE | ID: mdl-36305825

RESUMEN

The International Mouse Phenotyping Consortium (IMPC; https://www.mousephenotype.org/) web portal makes available curated, integrated and analysed knockout mouse phenotyping data generated by the IMPC project consisting of 85M data points and over 95,000 statistically significant phenotype hits mapped to human diseases. The IMPC portal delivers a substantial reference dataset that supports the enrichment of various domain-specific projects and databases, as well as the wider research and clinical community, where the IMPC genotype-phenotype knowledge contributes to the molecular diagnosis of patients affected by rare disorders. Data from 9,000 mouse lines and 750 000 images provides vital resources enabling the interpretation of the ignorome, and advancing our knowledge on mammalian gene function and the mechanisms underlying phenotypes associated with human diseases. The resource is widely integrated and the lines have been used in over 4,600 publications indicating the value of the data and the materials.


Asunto(s)
Bases de Datos Factuales , Modelos Animales de Enfermedad , Ratones Noqueados , Animales , Humanos , Ratones , Fenotipo
13.
Nucleic Acids Res ; 51(D1): D977-D985, 2023 01 06.
Artículo en Inglés | MEDLINE | ID: mdl-36350656

RESUMEN

The NHGRI-EBI GWAS Catalog (www.ebi.ac.uk/gwas) is a FAIR knowledgebase providing detailed, structured, standardised and interoperable genome-wide association study (GWAS) data to >200 000 users per year from academic research, healthcare and industry. The Catalog contains variant-trait associations and supporting metadata for >45 000 published GWAS across >5000 human traits, and >40 000 full P-value summary statistics datasets. Content is curated from publications or acquired via author submission of prepublication summary statistics through a new submission portal and validation tool. GWAS data volume has vastly increased in recent years. We have updated our software to meet this scaling challenge and to enable rapid release of submitted summary statistics. The scope of the repository has expanded to include additional data types of high interest to the community, including sequencing-based GWAS, gene-based analyses and copy number variation analyses. Community outreach has increased the number of shared datasets from under-represented traits, e.g. cancer, and we continue to contribute to awareness of the lack of population diversity in GWAS. Interoperability of the Catalog has been enhanced through links to other resources including the Polygenic Score Catalog and the International Mouse Phenotyping Consortium, refinements to GWAS trait annotation, and the development of a standard format for GWAS data.


Asunto(s)
Estudio de Asociación del Genoma Completo , Bases del Conocimiento , Animales , Humanos , Ratones , Variaciones en el Número de Copia de ADN , National Human Genome Research Institute (U.S.) , Fenotipo , Polimorfismo de Nucleótido Simple , Programas Informáticos , Estados Unidos
14.
Genome Med ; 14(1): 119, 2022 10 13.
Artículo en Inglés | MEDLINE | ID: mdl-36229886

RESUMEN

BACKGROUND: The diagnostic rate of Mendelian disorders in sequencing studies continues to increase, along with the pace of novel disease gene discovery. However, variant interpretation in novel genes not currently associated with disease is particularly challenging and strategies combining gene functional evidence with approaches that evaluate the phenotypic similarities between patients and model organisms have proven successful. A full spectrum of intolerance to loss-of-function variation has been previously described, providing evidence that gene essentiality should not be considered as a simple and fixed binary property. METHODS: Here we further dissected this spectrum by assessing the embryonic stage at which homozygous loss-of-function results in lethality in mice from the International Mouse Phenotyping Consortium, classifying the set of lethal genes into one of three windows of lethality: early, mid, or late gestation lethal. We studied the correlation between these windows of lethality and various gene features including expression across development, paralogy and constraint metrics together with human disease phenotypes. We explored a gene similarity approach for novel gene discovery and investigated unsolved cases from the 100,000 Genomes Project. RESULTS: We found that genes in the early gestation lethal category have distinct characteristics and are enriched for genes linked with recessive forms of inherited metabolic disease. We identified several genes sharing multiple features with known biallelic forms of inborn errors of the metabolism and found signs of enrichment of biallelic predicted pathogenic variants among early gestation lethal genes in patients recruited under this disease category. We highlight two novel gene candidates with phenotypic overlap between the patients and the mouse knockouts. CONCLUSIONS: Information on the developmental period at which embryonic lethality occurs in the knockout mouse may be used for novel disease gene discovery that helps to prioritise variants in unsolved rare disease cases.


Asunto(s)
Embrión de Mamíferos , Genes Letales , Animales , Femenino , Homocigoto , Humanos , Ratones , Ratones Noqueados , Fenotipo , Embarazo
15.
Pain ; 163(6): 1139-1157, 2022 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-35552317

RESUMEN

ABSTRACT: Identifying the genetic determinants of pain is a scientific imperative given the magnitude of the global health burden that pain causes. Here, we report a genetic screen for nociception, performed under the auspices of the International Mouse Phenotyping Consortium. A biased set of 110 single-gene knockout mouse strains was screened for 1 or more nociception and hypersensitivity assays, including chemical nociception (formalin) and mechanical and thermal nociception (von Frey filaments and Hargreaves tests, respectively), with or without an inflammatory agent (complete Freund's adjuvant). We identified 13 single-gene knockout strains with altered nocifensive behavior in 1 or more assays. All these novel mouse models are openly available to the scientific community to study gene function. Two of the 13 genes (Gria1 and Htr3a) have been previously reported with nociception-related phenotypes in genetically engineered mouse strains and represent useful benchmarking standards. One of the 13 genes (Cnrip1) is known from human studies to play a role in pain modulation and the knockout mouse reported herein can be used to explore this function further. The remaining 10 genes (Abhd13, Alg6, BC048562, Cgnl1, Cp, Mmp16, Oxa1l, Tecpr2, Trim14, and Trim2) reveal novel pathways involved in nociception and may provide new knowledge to better understand genetic mechanisms of inflammatory pain and to serve as models for therapeutic target validation and drug development.


Asunto(s)
Nocicepción , Dolor , Animales , Adyuvante de Freund/toxicidad , Ratones , Ratones Noqueados , Dolor/genética , Dimensión del Dolor
16.
BMC Genomics ; 23(1): 156, 2022 Feb 22.
Artículo en Inglés | MEDLINE | ID: mdl-35193494

RESUMEN

BACKGROUND: Patient-derived xenografts (PDX) mice models play an important role in preclinical trials and personalized medicine. Sharing data on the models is highly valuable for numerous reasons - ethical, economical, research cross validation etc. The EurOPDX Consortium was established 8 years ago to share such information and avoid duplicating efforts in developing new PDX mice models and unify approaches to support preclinical research. EurOPDX Data Portal is the unified data sharing platform adopted by the Consortium. MAIN BODY: In this paper we describe the main features of the EurOPDX Data Portal ( https://dataportal.europdx.eu/ ), its architecture and possible utilization by researchers who look for PDX mice models for their research. The Portal offers a catalogue of European models accessible on a cooperative basis. The models are searchable by metadata, and a detailed view provides molecular profiles (gene expression, mutation, copy number alteration) and treatment studies. The Portal displays the data in multiple tools (PDX Finder, cBioPortal, and GenomeCruzer in future), which are populated from a common database displaying strictly mutually consistent views. (SHORT) CONCLUSION: EurOPDX Data Portal is an entry point to the EurOPDX Research Infrastructure offering PDX mice models for collaborative research, (meta)data describing their features and deep molecular data analysis according to users' interests.


Asunto(s)
Neoplasias , Animales , Xenoinjertos , Humanos , Difusión de la Información , Ratones , Neoplasias/genética , Medicina de Precisión , Ensayos Antitumor por Modelo de Xenoinjerto
17.
Mamm Genome ; 33(1): 135-142, 2022 03.
Artículo en Inglés | MEDLINE | ID: mdl-34524473

RESUMEN

Most current biomedical and protein research focuses only on a small proportion of genes, which results in a lost opportunity to identify new gene-disease associations and explore new opportunities for therapeutic intervention. The International Mouse Phenotyping Consortium (IMPC) focuses on elucidating gene function at scale for poorly characterized and/or under-studied genes. A key component of the IMPC initiative is the implementation of a broad phenotyping pipeline, which is facilitating the discovery of pleiotropy. Characterizing pleiotropy is essential to identify gene-disease associations, and it is of particular importance when elucidating the genetic causes of syndromic disorders. Here we show how the IMPC is effectively uncovering pleiotropy and how the new mouse models and gene function hypotheses generated by the IMPC are increasing our understanding of the mammalian genome, forming the basis of new research and identifying new gene-disease associations.


Asunto(s)
Genoma , Mamíferos , Animales , Modelos Animales de Enfermedad , Ratones , Morbilidad , Fenotipo
18.
Nucleic Acids Res ; 50(D1): D980-D987, 2022 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-34791407

RESUMEN

The European Genome-phenome Archive (EGA - https://ega-archive.org/) is a resource for long term secure archiving of all types of potentially identifiable genetic, phenotypic, and clinical data resulting from biomedical research projects. Its mission is to foster hosted data reuse, enable reproducibility, and accelerate biomedical and translational research in line with the FAIR principles. Launched in 2008, the EGA has grown quickly, currently archiving over 4,500 studies from nearly one thousand institutions. The EGA operates a distributed data access model in which requests are made to the data controller, not to the EGA, therefore, the submitter keeps control on who has access to the data and under which conditions. Given the size and value of data hosted, the EGA is constantly improving its value chain, that is, how the EGA can contribute to enhancing the value of human health data by facilitating its submission, discovery, access, and distribution, as well as leading the design and implementation of standards and methods necessary to deliver the value chain. The EGA has become a key GA4GH Driver Project, leading multiple development efforts and implementing new standards and tools, and has been appointed as an ELIXIR Core Data Resource.


Asunto(s)
Confidencialidad/legislación & jurisprudencia , Genoma Humano , Difusión de la Información/métodos , Fenómica/organización & administración , Investigación Biomédica Traslacional/métodos , Conjuntos de Datos como Asunto , Genotipo , Historia del Siglo XX , Historia del Siglo XXI , Humanos , Difusión de la Información/ética , Metadatos/ética , Metadatos/estadística & datos numéricos , Fenómica/historia , Fenotipo
19.
Nucleic Acids Res ; 50(D1): D1216-D1220, 2022 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-34718739

RESUMEN

The European Variation Archive (EVA; https://www.ebi.ac.uk/eva/) is a resource for sharing all types of genetic variation data (SNPs, indels, and structural variants) for all species. The EVA was created in 2014 to provide FAIR access to genetic variation data and has since grown to be a primary resource for genomic variants hosting >3 billion records. The EVA and dbSNP have established a compatible global system to assign unique identifiers to all submitted genetic variants. The EVA is active within the Global Alliance of Genomics and Health (GA4GH), maintaining, contributing and implementing standards such as VCF, Refget and Variant Representation Specification (VRS). In this article, we describe the submission and permanent accessioning services along with the different ways the data can be retrieved by the scientific community.


Asunto(s)
Biología Computacional , Bases de Datos Genéticas , Variación Genética/genética , Programas Informáticos , Animales , Variación Estructural del Genoma/genética , Genómica , Humanos , Mutación INDEL/genética , Anotación de Secuencia Molecular , Polimorfismo de Nucleótido Simple/genética
20.
Cell Genom ; 1(1)2021 Oct 13.
Artículo en Inglés | MEDLINE | ID: mdl-34870259

RESUMEN

Genome sequencing has recently become a viable genotyping technology for use in genome-wide association studies (GWASs), offering the potential to analyze a broader range of genome-wide variation, including rare variants. To survey current standards, we assessed the content and quality of reporting of statistical methods, analyses, results, and datasets in 167 exome- or genome-wide-sequencing-based GWAS publications published from 2014 to 2020; 81% of publications included tests of aggregate association across multiple variants, with multiple test models frequently used. We observed a lack of standardized terms and incomplete reporting of datasets, particularly for variants analyzed in aggregate tests. We also find a lower frequency of sharing of summary statistics compared with array-based GWASs. Reporting standards and increased data sharing are required to ensure sequencing-based association study data are findable, interoperable, accessible, and reusable (FAIR). To support that, we recommend adopting the standard terminology of sequencing-based GWAS (seqGWAS). Further, we recommend that single-variant analyses be reported following the same standards and conventions as standard array-based GWASs and be shared in the GWAS Catalog. We also provide initial recommended standards for aggregate analyses metadata and summary statistics.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...