Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
1.
Nat Med ; 26(10): 1654-1662, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-32839619

RESUMEN

Differences in cardiac and aortic structure and function are associated with cardiovascular diseases and a wide range of other types of disease. Here we analyzed cardiovascular magnetic resonance images from a population-based study, the UK Biobank, using an automated machine-learning-based analysis pipeline. We report a comprehensive range of structural and functional phenotypes for the heart and aorta across 26,893 participants, and explore how these phenotypes vary according to sex, age and major cardiovascular risk factors. We extended this analysis with a phenome-wide association study, in which we tested for correlations of a wide range of non-imaging phenotypes of the participants with imaging phenotypes. We further explored the associations of imaging phenotypes with early-life factors, mental health and cognitive function using both observational analysis and Mendelian randomization. Our study illustrates how population-based cardiac and aortic imaging phenotypes can be used to better define cardiovascular disease risks as well as heart-brain health interactions, highlighting new opportunities for studying disease mechanisms and developing image-based biomarkers.


Asunto(s)
Aorta/anatomía & histología , Aorta/fisiología , Corazón/anatomía & histología , Corazón/fisiología , Fenómica , Factores de Edad , Anatomía Transversal , Aorta/diagnóstico por imagen , Aorta/patología , Bancos de Muestras Biológicas/estadística & datos numéricos , Enfermedades Cardiovasculares/diagnóstico por imagen , Enfermedades Cardiovasculares/genética , Enfermedades Cardiovasculares/patología , Enfermedades Cardiovasculares/fisiopatología , Femenino , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Corazón/diagnóstico por imagen , Pruebas de Función Cardíaca , Humanos , Procesamiento de Imagen Asistido por Computador/métodos , Aprendizaje Automático , Imagen por Resonancia Magnética/estadística & datos numéricos , Masculino , Miocardio/patología , Fenómica/métodos , Fenotipo , Polimorfismo de Nucleótido Simple , Factores Sexuales , Relación Estructura-Actividad , Reino Unido/epidemiología
2.
JAMA Netw Open ; 2(12): e1917257, 2019 12 02.
Artículo en Inglés | MEDLINE | ID: mdl-31825506

RESUMEN

Importance: Identifying brain regions associated with risk factors for dementia could guide mechanistic understanding of risk factors associated with Alzheimer disease (AD). Objectives: To characterize volume changes in brain regions associated with aging and modifiable risk factors for dementia (MRFD) and to test whether volume differences in these regions are associated with cognitive performance. Design, Setting, and Participants: This cross-sectional study used data from UK Biobank participants who underwent T1-weighted structural brain imaging from August 5, 2014, to October 14, 2016. A voxelwise linear model was applied to test for regional gray matter volume differences associated with aging and MRFD (ie, hypertension, diabetes, obesity, and frequent alcohol use). The potential clinical relevance of these associations was explored by comparing their neuroanatomical distributions with the regional brain atrophy found with AD. Mediation models for risk factors, brain volume differences, and cognitive measures were tested. The primary hypothesis was that common, overlapping regions would be found. Primary analysis was conducted on April 1, 2018. Main Outcomes and Measures: Gray matter regions that showed relative atrophy associated with AD, aging, and greater numbers of MRFD. Results: Among 8312 participants (mean [SD] age, 62.4 [7.4] years; 3959 [47.1%] men), aging and 4 major MRFD (ie, hypertension, diabetes, obesity, and frequent alcohol use) had independent negative associations with specific gray matter volumes. These regions overlapped neuroanatomically with those showing lower volumes in participants with AD, including the posterior cingulate cortex, the thalamus, the hippocampus, and the orbitofrontal cortex. Associations between these MRFD and spatial memory were mediated by differences in posterior cingulate cortex volume (ß = 0.0014; SE = 0.0006; P = .02). Conclusions and Relevance: This cross-sectional study identified differences in localized brain gray matter volume associated with aging and MRFD, suggesting regional vulnerabilities. These differences appeared relevant to cognitive performance even among people considered cognitively healthy.


Asunto(s)
Envejecimiento/patología , Encéfalo/patología , Cognición , Demencia/etiología , Imagen por Resonancia Magnética , Adulto , Anciano , Envejecimiento/psicología , Enfermedad de Alzheimer/diagnóstico por imagen , Enfermedad de Alzheimer/etiología , Enfermedad de Alzheimer/patología , Enfermedad de Alzheimer/psicología , Encéfalo/diagnóstico por imagen , Estudios Transversales , Demencia/diagnóstico por imagen , Demencia/patología , Demencia/psicología , Femenino , Sustancia Gris/patología , Humanos , Modelos Lineales , Masculino , Persona de Mediana Edad , Tamaño de los Órganos , Factores de Riesgo
3.
Sci Data ; 6(1): 149, 2019 08 13.
Artículo en Inglés | MEDLINE | ID: mdl-31409798

RESUMEN

Biomedical informatics has traditionally adopted a linear view of the informatics process (collect, store and analyse) in translational medicine (TM) studies; focusing primarily on the challenges in data integration and analysis. However, a data management challenge presents itself with the new lifecycle view of data emphasized by the recent calls for data re-use, long term data preservation, and data sharing. There is currently a lack of dedicated infrastructure focused on the 'manageability' of the data lifecycle in TM research between data collection and analysis. Current community efforts towards establishing a culture for open science prompt the creation of a data custodianship environment for management of TM data assets to support data reuse and reproducibility of research results. Here we present the development of a lifecycle-based methodology to create a metadata management framework based on community driven standards for standardisation, consolidation and integration of TM research data. Based on this framework, we also present the development of a new platform (PlatformTM) focused on managing the lifecycle for translational research data assets.


Asunto(s)
Difusión de la Información , Informática Médica , Investigación Biomédica Traslacional , Humanos , Metadatos , Interfaz Usuario-Computador
4.
BMC Genomics ; 15 Suppl 8: S3, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25435347

RESUMEN

BACKGROUND: High-throughput transcriptomic data generated by microarray experiments is the most abundant and frequently stored kind of data currently used in translational medicine studies. Although microarray data is supported in data warehouses such as tranSMART, when querying relational databases for hundreds of different patient gene expression records queries are slow due to poor performance. Non-relational data models, such as the key-value model implemented in NoSQL databases, hold promise to be more performant solutions. Our motivation is to improve the performance of the tranSMART data warehouse with a view to supporting Next Generation Sequencing data. RESULTS: In this paper we introduce a new data model better suited for high-dimensional data storage and querying, optimized for database scalability and performance. We have designed a key-value pair data model to support faster queries over large-scale microarray data and implemented the model using HBase, an implementation of Google's BigTable storage system. An experimental performance comparison was carried out against the traditional relational data model implemented in both MySQL Cluster and MongoDB, using a large publicly available transcriptomic data set taken from NCBI GEO concerning Multiple Myeloma. Our new key-value data model implemented on HBase exhibits an average 5.24-fold increase in high-dimensional biological data query performance compared to the relational model implemented on MySQL Cluster, and an average 6.47-fold increase on query performance on MongoDB. CONCLUSIONS: The performance evaluation found that the new key-value data model, in particular its implementation in HBase, outperforms the relational model currently implemented in tranSMART. We propose that NoSQL technology holds great promise for large-scale data management, in particular for high-dimensional biological data such as that demonstrated in the performance evaluation described in this paper. We aim to use this new data model as a basis for migrating tranSMART's implementation to a more scalable solution for Big Data.


Asunto(s)
Sistemas de Administración de Bases de Datos , Bases de Datos Genéticas , Almacenamiento y Recuperación de la Información/métodos , Transcriptoma , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Informática Médica , Mieloma Múltiple/genética , Mieloma Múltiple/metabolismo , Análisis de Secuencia por Matrices de Oligonucleótidos
5.
BMC Bioinformatics ; 15: 351, 2014 Nov 05.
Artículo en Inglés | MEDLINE | ID: mdl-25371114

RESUMEN

BACKGROUND: High-throughput molecular profiling data has been used to improve clinical decision making by stratifying subjects based on their molecular profiles. Unsupervised clustering algorithms can be used for stratification purposes. However, the current speed of the clustering algorithms cannot meet the requirement of large-scale molecular data due to poor performance of the correlation matrix calculation. With high-throughput sequencing technologies promising to produce even larger datasets per subject, we expect the performance of the state-of-the-art statistical algorithms to be further impacted unless efforts towards optimisation are carried out. MapReduce is a widely used high performance parallel framework that can solve the problem. RESULTS: In this paper, we evaluate the current parallel modes for correlation calculation methods and introduce an efficient data distribution and parallel calculation algorithm based on MapReduce to optimise the correlation calculation. We studied the performance of our algorithm using two gene expression benchmarks. In the micro-benchmark, our implementation using MapReduce, based on the R package RHIPE, demonstrates a 3.26-5.83 fold increase compared to the default Snowfall and 1.56-1.64 fold increase compared to the basic RHIPE in the Euclidean, Pearson and Spearman correlations. Though vanilla R and the optimised Snowfall outperforms our optimised RHIPE in the micro-benchmark, they do not scale well with the macro-benchmark. In the macro-benchmark the optimised RHIPE performs 2.03-16.56 times faster than vanilla R. Benefiting from the 3.30-5.13 times faster data preparation, the optimised RHIPE performs 1.22-1.71 times faster than the optimised Snowfall. Both the optimised RHIPE and the optimised Snowfall successfully performs the Kendall correlation with TCGA dataset within 7 hours. Both of them conduct more than 30 times faster than the estimated vanilla R. CONCLUSIONS: The performance evaluation found that the new MapReduce algorithm and its implementation in RHIPE outperforms vanilla R and the conventional parallel algorithms implemented in R Snowfall. We propose that MapReduce framework holds great promise for large molecular data analysis, in particular for high-dimensional genomic data such as that demonstrated in the performance evaluation described in this paper. We aim to use this new algorithm as a basis for optimising high-throughput molecular data correlation calculation for Big Data.


Asunto(s)
Algoritmos , Perfilación de la Expresión Génica/métodos , Análisis por Conglomerados , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA