Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 49
Filtrar
Más filtros

Bases de datos
Tipo del documento
Intervalo de año de publicación
1.
Cell ; 173(2): 400-416.e11, 2018 04 05.
Artículo en Inglés | MEDLINE | ID: mdl-29625055

RESUMEN

For a decade, The Cancer Genome Atlas (TCGA) program collected clinicopathologic annotation data along with multi-platform molecular profiles of more than 11,000 human tumors across 33 different cancer types. TCGA clinical data contain key features representing the democratized nature of the data collection process. To ensure proper use of this large clinical dataset associated with genomic features, we developed a standardized dataset named the TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR), which includes four major clinical outcome endpoints. In addition to detailing major challenges and statistical limitations encountered during the effort of integrating the acquired clinical data, we present a summary that includes endpoint usage recommendations for each cancer type. These TCGA-CDR findings appear to be consistent with cancer genomics studies independent of the TCGA effort and provide opportunities for investigating cancer biology using clinical correlates at an unprecedented scale.


Asunto(s)
Neoplasias/patología , Bases de Datos Genéticas , Genómica , Humanos , Estimación de Kaplan-Meier , Neoplasias/genética , Neoplasias/mortalidad , Modelos de Riesgos Proporcionales
2.
Cell ; 173(2): 338-354.e15, 2018 04 05.
Artículo en Inglés | MEDLINE | ID: mdl-29625051

RESUMEN

Cancer progression involves the gradual loss of a differentiated phenotype and acquisition of progenitor and stem-cell-like features. Here, we provide novel stemness indices for assessing the degree of oncogenic dedifferentiation. We used an innovative one-class logistic regression (OCLR) machine-learning algorithm to extract transcriptomic and epigenetic feature sets derived from non-transformed pluripotent stem cells and their differentiated progeny. Using OCLR, we were able to identify previously undiscovered biological mechanisms associated with the dedifferentiated oncogenic state. Analyses of the tumor microenvironment revealed unanticipated correlation of cancer stemness with immune checkpoint expression and infiltrating immune cells. We found that the dedifferentiated oncogenic phenotype was generally most prominent in metastatic tumors. Application of our stemness indices to single-cell data revealed patterns of intra-tumor molecular heterogeneity. Finally, the indices allowed for the identification of novel targets and possible targeted therapies aimed at tumor differentiation.


Asunto(s)
Desdiferenciación Celular/genética , Aprendizaje Automático , Neoplasias/patología , Carcinogénesis , Metilación de ADN , Bases de Datos Genéticas , Epigénesis Genética , Humanos , MicroARNs/metabolismo , Metástasis de la Neoplasia , Neoplasias/genética , Células Madre/citología , Células Madre/metabolismo , Transcriptoma , Microambiente Tumoral
3.
Cell ; 158(4): 929-944, 2014 Aug 14.
Artículo en Inglés | MEDLINE | ID: mdl-25109877

RESUMEN

Recent genomic analyses of pathologically defined tumor types identify "within-a-tissue" disease subtypes. However, the extent to which genomic signatures are shared across tissues is still unclear. We performed an integrative analysis using five genome-wide platforms and one proteomic platform on 3,527 specimens from 12 cancer types, revealing a unified classification into 11 major subtypes. Five subtypes were nearly identical to their tissue-of-origin counterparts, but several distinct cancer types were found to converge into common subtypes. Lung squamous, head and neck, and a subset of bladder cancers coalesced into one subtype typified by TP53 alterations, TP63 amplifications, and high expression of immune and proliferation pathway genes. Of note, bladder cancers split into three pan-cancer subtypes. The multiplatform classification, while correlated with tissue-of-origin, provides independent information for predicting clinical outcomes. All data sets are available for data-mining from a unified resource to support further biological discoveries and insights into novel therapeutic strategies.


Asunto(s)
Neoplasias/clasificación , Neoplasias/genética , Análisis por Conglomerados , Humanos , Neoplasias/patología , Transcriptoma
4.
J Rheumatol ; 2024 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-38879192

RESUMEN

OBJECTIVE: Psoriatic disease remains underdiagnosed and undertreated. We developed and validated a suite of novel, sensor-based smartphone assessments (Psorcast app) that can be self-administered to measure cutaneous and musculoskeletal signs and symptoms of psoriatic disease. METHODS: Participants with psoriasis (PsO) or psoriatic arthritis (PsA) and healthy controls were recruited between June 5, 2019, and November 10, 2021, at 2 academic medical centers. Concordance and accuracy of digital measures and image-based machine learning models were compared to their analogous clinical measures from trained rheumatologists and dermatologists. RESULTS: Of 104 study participants, 51 (49%) were female and 53 (51%) were male, with a mean age of 42.3 years (SD 12.6). Seventy-nine (76%) participants had PsA, 16 (15.4%) had PsO, and 9 (8.7%) were healthy controls. Digital patient assessment of percent body surface area (BSA) affected with PsO demonstrated very strong concordance (Lin concordance correlation coefficient [CCC] 0.94 [95% CI 0.91-0.96]) with physician-assessed BSA. The in-clinic and remote target plaque physician global assessments showed fair-to-moderate concordance (CCCerythema 0.72 [0.59-0.85]; CCCinduration 0.72 [0.62-0.82]; CCCscaling 0.60 [0.48-0.72]). Machine learning models of hand photos taken by patients accurately identified clinically diagnosed nail PsO with an accuracy of 0.76. The Digital Jar Open assessment categorized physician-assessed upper extremity involvement, considering joint tenderness or enthesitis (AUROC 0.68 [0.47-0.85]). CONCLUSION: The Psorcast digital assessments achieved significant clinical validity, although they require further validation in larger cohorts before use in evidence-based medicine or clinical trial settings. The smartphone software and analysis pipelines from the Psorcast suite are open source and freely available.

5.
BMC Med Inform Decis Mak ; 24(1): 57, 2024 Feb 20.
Artículo en Inglés | MEDLINE | ID: mdl-38378636

RESUMEN

BACKGROUND: The two-way partial AUC has been recently proposed as a way to directly quantify partial area under the ROC curve with simultaneous restrictions on the sensitivity and specificity ranges of diagnostic tests or classifiers. The metric, as originally implemented in the tpAUC R package, is estimated using a nonparametric estimator based on a trimmed Mann-Whitney U-statistic, which becomes computationally expensive in large sample sizes. (Its computational complexity is of order [Formula: see text], where [Formula: see text] and [Formula: see text] represent the number of positive and negative cases, respectively). This is problematic since the statistical methodology for comparing estimates generated from alternative diagnostic tests/classifiers relies on bootstrapping resampling and requires repeated computations of the estimator on a large number of bootstrap samples. METHODS: By leveraging the graphical and probabilistic representations of the AUC, partial AUCs, and two-way partial AUC, we derive a novel estimator for the two-way partial AUC, which can be directly computed from the output of any software able to compute AUC and partial AUCs. We implemented our estimator using the computationally efficient pROC R package, which leverages a nonparametric approach using the trapezoidal rule for the computation of AUC and partial AUC scores. (Its computational complexity is of order [Formula: see text], where [Formula: see text].). We compare the empirical bias and computation time of the proposed estimator against the original estimator provided in the tpAUC package in a series of simulation studies and on two real datasets. RESULTS: Our estimator tended to be less biased than the original estimator based on the trimmed Mann-Whitney U-statistic across all experiments (and showed considerably less bias in the experiments based on small sample sizes). But, most importantly, because the computational complexity of the proposed estimator is of order [Formula: see text], rather than [Formula: see text], it is much faster to compute when sample sizes are large. CONCLUSIONS: The proposed estimator provides an improvement for the computation of two-way partial AUC, and allows the comparison of diagnostic tests/machine learning classifiers in large datasets where repeated computations of the original estimator on bootstrap samples become too expensive to compute.


Asunto(s)
Área Bajo la Curva , Humanos , Simulación por Computador
6.
BMC Neurol ; 23(1): 323, 2023 Sep 12.
Artículo en Inglés | MEDLINE | ID: mdl-37700241

RESUMEN

BACKGROUND: Exercise has various health benefits for people with Parkinson's disease (PD). However, implementing exercise into daily life and long-term adherence remain challenging. To increase a sustainable engagement with physical activity of people with PD, interventions that are motivating, accessible, and scalable are needed. We primarily aim to investigate whether a smartphone app (STEPWISE app) can increase physical activity (i.e., step count) in people with PD over one year. Our second aim is to investigate the potential effects of the intervention on physical fitness, and motor- and non-motor function. Our third aim is to explore whether there is a dose-response relationship between volume of physical activity and our secondary endpoints. METHODS: STEPWISE is a double-blind, randomized controlled trial. We aim to include 452 Dutch people with PD who can walk independently (Hoehn & Yahr stages 1-3) and who do not take more than 7,000 steps per day prior to inclusion. Physical activity levels are measured as step counts on the participant's own smartphone and scaled as percentage of each participant's baseline. Participants are randomly assigned to an active control group with an increase of 5-20% (active controls) or any of the three intervention arms with increases of 25-100% (intermediate dose), 50-200% (large dose), or 100-400% (very large dose). The primary endpoint is change in step count as measured by the STEPWISE smartphone app from baseline to 52 weeks. For our primary aim, we will evaluate the between-group difference in average daily step count change from baseline to 52 weeks. For our second aim, measures of physical fitness, and motor- and non-motor function are included. For our third aim, we will associate 52-week changes in step count with 52-week changes in secondary outcomes. DISCUSSION: This trial evaluates the potential of a smartphone-based intervention to increase activity levels in people with PD. We envision that motivational apps will increase adherence to physical activity recommendations and could permit conduct of remote clinical trials of exercise for people with PD or those at risk of PD. TRIAL REGISTRATION: ClinicalTrials.gov; NCT04848077; 19/04/2021. CLINICALTRIALS: gov/ct2/show/NCT04848077.


Asunto(s)
Aplicaciones Móviles , Enfermedad de Parkinson , Humanos , Teléfono Inteligente , Ejercicio Físico , Aptitud Física , Ensayos Clínicos Controlados Aleatorios como Asunto
7.
Bioinformatics ; 35(14): i568-i576, 2019 07 15.
Artículo en Inglés | MEDLINE | ID: mdl-31510680

RESUMEN

MOTIVATION: Late onset Alzheimer's disease is currently a disease with no known effective treatment options. To better understand disease, new multi-omic data-sets have recently been generated with the goal of identifying molecular causes of disease. However, most analytic studies using these datasets focus on uni-modal analysis of the data. Here, we propose a data driven approach to integrate multiple data types and analytic outcomes to aggregate evidences to support the hypothesis that a gene is a genetic driver of the disease. The main algorithmic contributions of our article are: (i) a general machine learning framework to learn the key characteristics of a few known driver genes from multiple feature sets and identifying other potential driver genes which have similar feature representations, and (ii) A flexible ranking scheme with the ability to integrate external validation in the form of Genome Wide Association Study summary statistics. While we currently focus on demonstrating the effectiveness of the approach using different analytic outcomes from RNA-Seq studies, this method is easily generalizable to other data modalities and analysis types. RESULTS: We demonstrate the utility of our machine learning algorithm on two benchmark multiview datasets by significantly outperforming the baseline approaches in predicting missing labels. We then use the algorithm to predict and rank potential drivers of Alzheimer's. We show that our ranked genes show a significant enrichment for single nucleotide polymorphisms associated with Alzheimer's and are enriched in pathways that have been previously associated with the disease. AVAILABILITY AND IMPLEMENTATION: Source code and link to all feature sets is available at https://github.com/Sage-Bionetworks/EvidenceAggregatedDriverRanking.


Asunto(s)
Algoritmos , Enfermedad de Alzheimer , Estudio de Asociación del Genoma Completo , Enfermedad de Alzheimer/genética , Humanos , Aprendizaje Automático , Programas Informáticos
8.
Crit Care Med ; 46(6): 915-925, 2018 06.
Artículo en Inglés | MEDLINE | ID: mdl-29537985

RESUMEN

OBJECTIVES: To find and validate generalizable sepsis subtypes using data-driven clustering. DESIGN: We used advanced informatics techniques to pool data from 14 bacterial sepsis transcriptomic datasets from eight different countries (n = 700). SETTING: Retrospective analysis. SUBJECTS: Persons admitted to the hospital with bacterial sepsis. INTERVENTIONS: None. MEASUREMENTS AND MAIN RESULTS: A unified clustering analysis across 14 discovery datasets revealed three subtypes, which, based on functional analysis, we termed "Inflammopathic, Adaptive, and Coagulopathic." We then validated these subtypes in nine independent datasets from five different countries (n = 600). In both discovery and validation data, the Adaptive subtype is associated with a lower clinical severity and lower mortality rate, and the Coagulopathic subtype is associated with higher mortality and clinical coagulopathy. Further, these clusters are statistically associated with clusters derived by others in independent single sepsis cohorts. CONCLUSIONS: The three sepsis subtypes may represent a unifying framework for understanding the molecular heterogeneity of the sepsis syndrome. Further study could potentially enable a precision medicine approach of matching novel immunomodulatory therapies with septic patients most likely to benefit.


Asunto(s)
Perfilación de la Expresión Génica , Sepsis/genética , Inmunidad Adaptativa/genética , Adolescente , Adulto , Anciano , Trastornos de la Coagulación Sanguínea/genética , Análisis por Conglomerados , Conjuntos de Datos como Asunto , Femenino , Humanos , Inmunidad Innata/genética , Inflamación/genética , Masculino , Persona de Mediana Edad , Estudios Retrospectivos , Sepsis/microbiología , Adulto Joven
9.
Am J Hum Genet ; 91(4): 660-71, 2012 Oct 05.
Artículo en Inglés | MEDLINE | ID: mdl-23040495

RESUMEN

Full sequencing of individual human genomes has greatly expanded our understanding of human genetic variation and population history. Here, we present a systematic analysis of 50 human genomes from 11 diverse global populations sequenced at high coverage. Our sample includes 12 individuals who have admixed ancestry and who have varying degrees of recent (within the last 500 years) African, Native American, and European ancestry. We found over 21 million single-nucleotide variants that contribute to a 1.75-fold range in nucleotide heterozygosity across diverse human genomes. This heterozygosity ranged from a high of one heterozygous site per kilobase in west African genomes to a low of 0.57 heterozygous sites per kilobase in segments inferred to have diploid Native American ancestry from the genomes of Mexican and Puerto Rican individuals. We show evidence of all three continental ancestries in the genomes of Mexican, Puerto Rican, and African American populations, and the genome-wide statistics are highly consistent across individuals from a population once ancestry proportions have been accounted for. Using a generalized linear model, we identified subtle variations across populations in the proportion of neutral versus deleterious variation and found that genome-wide statistics vary in admixed populations even once ancestry proportions have been factored in. We further infer that multiple periods of gene flow shaped the diversity of admixed populations in the Americas-70% of the European ancestry in today's African Americans dates back to European gene flow happening only 7-8 generations ago.


Asunto(s)
Genoma Humano , Haplotipos/genética , Población/genética , Grupos Raciales/genética , Genética de Población/métodos , Heterocigoto , Humanos , Polimorfismo de Nucleótido Simple
10.
PLoS Genet ; 8(4): e1002641, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22570615

RESUMEN

African Pygmy groups show a distinctive pattern of phenotypic variation, including short stature, which is thought to reflect past adaptation to a tropical environment. Here, we analyze Illumina 1M SNP array data in three Western Pygmy populations from Cameroon and three neighboring Bantu-speaking agricultural populations with whom they have admixed. We infer genome-wide ancestry, scan for signals of positive selection, and perform targeted genetic association with measured height variation. We identify multiple regions throughout the genome that may have played a role in adaptive evolution, many of which contain loci with roles in growth hormone, insulin, and insulin-like growth factor signaling pathways, as well as immunity and neuroendocrine signaling involved in reproduction and metabolism. The most striking results are found on chromosome 3, which harbors a cluster of selection and association signals between approximately 45 and 60 Mb. This region also includes the positional candidate genes DOCK3, which is known to be associated with height variation in Europeans, and CISH, a negative regulator of cytokine signaling known to inhibit growth hormone-stimulated STAT5 signaling. Finally, pathway analysis for genes near the strongest signals of association with height indicates enrichment for loci involved in insulin and insulin-like growth factor signaling.


Asunto(s)
Evolución Biológica , Estatura/genética , Enanismo , Etnicidad/genética , Adaptación Biológica , África Occidental , Población Negra , Mapeo Cromosómico , Enanismo/genética , Estudios de Asociación Genética , Genoma Humano , Hormona del Crecimiento/genética , Factores de Intercambio de Guanina Nucleótido/genética , Humanos , Factor I del Crecimiento Similar a la Insulina/genética , Proteínas del Tejido Nervioso/genética , Polimorfismo de Nucleótido Simple , Selección Genética , Proteínas Supresoras de la Señalización de Citocinas/genética
11.
J Clin Exp Neuropsychol ; : 1-10, 2024 May 16.
Artículo en Inglés | MEDLINE | ID: mdl-38753819

RESUMEN

INTRODUCTION: Arranging Pictures is a new episodic memory test based on the NIH Toolbox (NIHTB) Picture Sequence Memory measure and optimized for self-administration on a personal smartphone within the Mobile Toolbox (MTB). We describe evidence from three distinct validation studies. METHOD: In Study 1, 92 participants self-administered Arranging Pictures on study-provided smartphones in the lab and were administered external measures of similar and dissimilar constructs by trained examiners to assess validity under controlled circumstances. In Study 2, 1,021 participants completed the external measures in the lab and self-administered Arranging Pictures remotely on their personal smartphones to assess validity in real-world contexts. In Study 3, 141 participants self-administered Arranging Pictures remotely twice with a two-week delay on personal iOS smartphones to assess test-retest reliability and practice effects. RESULTS: Internal consistency was good across samples (ρxx = .80 to .85, p < .001). Test-retest reliability was marginal (ICC = .49, p < .001) and there were significant practice effects after a two-week delay (ΔM = 3.21 (95% CI [2.56, 3.88]). As expected, correlations with convergent measures were significant and moderate to large in magnitude (ρ = .44 to .76, p < .001), while correlations with discriminant measures were small (ρ = .23 to .27, p < .05) or nonsignificant. Scores demonstrated significant negative correlations with age (ρ = -.32 to -.21, p < .001). Mean performance was slightly higher in the iOS compared to the Android group (MiOS = 18.80, NiOS = 635; MAndroid = 17.11, NAndroid = 386; t(757.73) = 4.17, p < .001), but device type did not significantly influence the psychometric properties of the measure. Indicators of potential cheating were mixed; average scores were significantly higher in the remote samples (F(2, 850) = 11.415, p < .001), but there were not significantly more perfect scores. CONCLUSION: The MTB Arranging Pictures measure demonstrated evidence of reliability and validity when self-administered on personal device. Future research should examine the potential for cheating in remote settings and the properties of the measure in clinical samples.

12.
Artículo en Inglés | MEDLINE | ID: mdl-38414411

RESUMEN

OBJECTIVE: We describe the development of a new computer adaptive vocabulary test, Mobile Toolbox (MTB) Word Meaning, and validity evidence from 3 studies. METHOD: Word Meaning was designed to be a multiple-choice synonym test optimized for self-administration on a personal smartphone. The items were first calibrated online in a sample of 7,525 participants to create the computer-adaptive test algorithm for the Word Meaning measure within the MTB app. In Study 1, 92 participants self-administered Word Meaning on study-provided smartphones in the lab and were administered external measures by trained examiners. In Study 2, 1,021 participants completed the external measures in the lab and Word Meaning was self-administered remotely on their personal smartphones. In Study 3, 141 participants self-administered Word Meaning remotely twice with a 2-week delay on personal iPhones. RESULTS: The final bank included 1363 items. Internal consistency was adequate to good across samples (ρxx = 0.78 to 0.81, p < .001). Test-retest reliability was good (ICC = 0.65, p < .001), and the mean theta score was not significantly different upon the second administration. Correlations were moderate to large with measures of similar constructs (ρ = 0.67-0.75, p < .001) and non-significant with measures of dissimilar constructs. Scores demonstrated small to moderate correlations with age (ρ = 0.35 to 0.45, p < .001) and education (ρ = 0.26, p < .001). CONCLUSION: The MTB Word Meaning measure demonstrated evidence of reliability and validity in three samples. Further validation studies in clinical samples are necessary.

13.
PLOS Digit Health ; 2(3): e0000208, 2023 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-36976789

RESUMEN

One of the promising opportunities of digital health is its potential to lead to more holistic understandings of diseases by interacting with the daily life of patients and through the collection of large amounts of real-world data. Validating and benchmarking indicators of disease severity in the home setting is difficult, however, given the large number of confounders present in the real world and the challenges in collecting ground truth data in the home. Here we leverage two datasets collected from patients with Parkinson's disease, which couples continuous wrist-worn accelerometer data with frequent symptom reports in the home setting, to develop digital biomarkers of symptom severity. Using these data, we performed a public benchmarking challenge in which participants were asked to build measures of severity across 3 symptoms (on/off medication, dyskinesia, and tremor). 42 teams participated and performance was improved over baseline models for each subchallenge. Additional ensemble modeling across submissions further improved performance, and the top models validated in a subset of patients whose symptoms were observed and rated by trained clinicians.

14.
BMC Genomics ; 13: 82, 2012 Feb 29.
Artículo en Inglés | MEDLINE | ID: mdl-22375630

RESUMEN

BACKGROUND: The small airway epithelium (SAE), the cell population that covers the human airway surface from the 6th generation of airway branching to the alveoli, is the major site of lung disease caused by smoking. The focus of this study is to provide quantitative assessment of the SAE transcriptome in the resting state and in response to chronic cigarette smoking using massive parallel mRNA sequencing (RNA-Seq). RESULTS: The data demonstrate that 48% of SAE expressed genes are ubiquitous, shared with many tissues, with 52% enriched in this cell population. The most highly expressed gene, SCGB1A1, is characteristic of Clara cells, the cell type unique to the human SAE. Among other genes expressed by the SAE are those related to Clara cell differentiation, secretory mucosal defense, and mucociliary differentiation. The high sensitivity of RNA-Seq permitted quantification of gene expression related to infrequent cell populations such as neuroendocrine cells and epithelial stem/progenitor cells. Quantification of the absolute smoking-induced changes in SAE gene expression revealed that, compared to ubiquitous genes, more SAE-enriched genes responded to smoking with up-regulation, and those with the highest basal expression levels showed most dramatic changes. Smoking had no effect on SAE gene splicing, but was associated with a shift in molecular pattern from Clara cell-associated towards the mucus-secreting cell differentiation pathway with multiple features of cancer-associated molecular phenotype. CONCLUSIONS: These observations provide insights into the unique biology of human SAE by providing quantitative assessment of the global transcriptome under physiological conditions and in response to the stress of chronic cigarette smoking.


Asunto(s)
Perfilación de la Expresión Génica , Mucosa Respiratoria/metabolismo , Transcriptoma , Empalme Alternativo , Humanos , Masculino , Anotación de Secuencia Molecular , Familia de Multigenes , Especificidad de Órganos/genética , Análisis de Secuencia de ARN , Fumar/efectos adversos , Ubiquitinación/genética
15.
BMC Genet ; 13: 49, 2012 Jun 26.
Artículo en Inglés | MEDLINE | ID: mdl-22734698

RESUMEN

BACKGROUND: Populations of the Arabian Peninsula have a complex genetic structure that reflects waves of migrations including the earliest human migrations from Africa and eastern Asia, migrations along ancient civilization trading routes and colonization history of recent centuries. RESULTS: Here, we present a study of genome-wide admixture in this region, using 156 genotyped individuals from Qatar, a country located at the crossroads of these migration patterns. Since haplotypes of these individuals could have originated from many different populations across the world, we have developed a machine learning method "SupportMix" to infer loci-specific genomic ancestry when simultaneously analyzing many possible ancestral populations. Simulations show that SupportMix is not only more accurate than other popular admixture discovery tools but is the first admixture inference method that can efficiently scale for simultaneous analysis of 50-100 putative ancestral populations while being independent of prior demographic information. CONCLUSIONS: By simultaneously using the 55 world populations from the Human Genome Diversity Panel, SupportMix was able to extract the fine-scale ancestry of the Qatar population, providing many new observations concerning the ancestry of the region. For example, as well as recapitulating the three major sub-populations in Qatar, composed of mainly Arabic, Persian, and African ancestry, SupportMix additionally identifies the specific ancestry of the Persian group to populations sampled in Greater Persia rather than from China and the ancestry of the African group to sub-Saharan origin and not Southern African Bantu origin as previously thought.


Asunto(s)
Genética de Población , Genoma Humano , Bases de Datos Factuales , Genotipo , Haplotipos , Migración Humana , Humanos , Desequilibrio de Ligamiento , Cadenas de Markov , Qatar , Máquina de Vectores de Soporte
16.
Hum Biol ; 84(4): 343-64, 2012 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-23249312

RESUMEN

Identifying ancestry along each chromosome in admixed individuals provides a wealth of information for understanding the population genetic history of admixture events and is valuable for admixture mapping and identifying recent targets of selection. We present PCAdmix (available at https://sites.google.com/site/pcadmix/home ), a Principal Components-based algorithm for determining ancestry along each chromosome from a high-density, genome-wide set of phased single-nucleotide polymorphism (SNP) genotypes of admixed individuals. We compare our method to HAPMIX on simulated data from two ancestral populations, and we find high concordance between the methods. Our method also has better accuracy than LAMP when applied to three-population admixture, a situation as yet unaddressed by HAPMIX. Finally, we apply our method to a data set of four Latino populations with European, African, and Native American ancestry. We find evidence of assortative mating in each of the four populations, and we identify regions of shared ancestry that may be recent targets of selection and could serve as candidate regions for admixture-based association mapping.


Asunto(s)
Cromosomas Humanos , Genotipo , Modelos Genéticos , Polimorfismo de Nucleótido Simple , Dinámica Poblacional , Análisis de Componente Principal/métodos , Grupos Raciales/genética , Algoritmos , Simulación por Computador , Genómica , Humanos , Filogeografía , Estados Unidos
17.
Digit Biomark ; 6(1): 1-8, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35224425

RESUMEN

BACKGROUND: Smartphones can generate objective measures of Parkinson's disease (PD) and supplement traditional in-person rating scales. However, smartphone use in clinical trials has been limited. OBJECTIVE: This study aimed to determine the feasibility of introducing a smartphone research application into a PD clinical trial and to evaluate the resulting measures. METHODS: A smartphone application was introduced part-way into a phase 3 randomized clinical trial of inosine. The application included finger tapping, gait, and cognition tests, and participants were asked to complete an assessment battery at home and in clinic alongside the Movement Disorder Society-Unified Parkinson's Disease Rating Scale (MDS-UPDRS). RESULTS: Of 236 eligible participants in the parent study, 88 (37%) consented to participate, and 59 (27 randomized to inosine and 32 to placebo) completed a baseline smartphone assessment. These 59 participants collectively completed 1,292 batteries of assessments. The proportion of participants who completed at least one smartphone assessment was 61% at 3, 54% at 6, and 35% at 12 months. Finger tapping speed correlated weakly with the part III motor portion (r = -0.16, left hand; r = -0.04, right hand) and total (r = -0.14) MDS-UPDRS. Gait speed correlated better with the same measures (r = -0.25, part III motor; r = -0.34, total). Over 6 months, finger tapping speed, gait speed, and memory scores did not differ between those randomized to active drug or placebo. CONCLUSIONS: Introducing a smartphone application midway into a phase 3 clinical trial was challenging. Measures of bradykinesia and gait speed correlated modestly with traditional outcomes and were consistent with the study's overall findings, which found no benefit of the active drug.

18.
Nat Commun ; 13(1): 7609, 2022 Dec 09.
Artículo en Inglés | MEDLINE | ID: mdl-36494374

RESUMEN

Synthetic health data have the potential to mitigate privacy concerns in supporting biomedical research and healthcare applications. Modern approaches for data generation continue to evolve and demonstrate remarkable potential. Yet there is a lack of a systematic assessment framework to benchmark methods as they emerge and determine which methods are most appropriate for which use cases. In this work, we introduce a systematic benchmarking framework to appraise key characteristics with respect to utility and privacy metrics. We apply the framework to evaluate synthetic data generation methods for electronic health records data from two large academic medical centers with respect to several use cases. The results illustrate that there is a utility-privacy tradeoff for sharing synthetic health data and further indicate that no method is unequivocally the best on all criteria in each use case, which makes it evident why synthetic data generation methods need to be assessed in context.


Asunto(s)
Investigación Biomédica , Registros Electrónicos de Salud , Privacidad , Benchmarking
19.
PLoS One ; 17(8): e0271766, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35925980

RESUMEN

Ideally, a patient's response to medication can be monitored by measuring changes in performance of some activity. In observational studies, however, any detected association between treatment ("on-medication" vs "off-medication") and the outcome (performance in the activity) might be due to confounders. In particular, causal inferences at the personalized level are especially vulnerable to confounding effects that arise in a cyclic fashion. For quick acting medications, effects can be confounded by circadian rhythms and daily routines. Using the time-of-the-day as a surrogate for these confounders and the performance measurements as captured on a smartphone, we propose a personalized statistical approach to disentangle putative treatment and "time-of-the-day" effects, that leverages conditional independence relations spanned by causal graphical models involving the treatment, time-of-the-day, and outcome variables. Our approach is based on conditional independence tests implemented via standard and temporal linear regression models. Using synthetic data, we investigate when and how residual autocorrelation can affect the standard tests, and how time series modeling (namely, ARIMA and robust regression via HAC covariance matrix estimators) can remedy these issues. In particular, our simulations illustrate that when patients perform their activities in a paired fashion, positive autocorrelation can lead to conservative results for the standard regression approach (i.e., lead to deflated true positive detection), whereas negative autocorrelation can lead to anticonservative behavior (i.e., lead to inflated false positive detection). The adoption of time series methods, on the other hand, leads to well controlled type I error rates. We illustrate the application of our methodology with data from a Parkinson's disease mobile health study.


Asunto(s)
Medicina de Precisión , Telemedicina , Causalidad , Humanos , Modelos Lineales , Teléfono Inteligente
20.
Nat Biotechnol ; 40(4): 480-487, 2022 04.
Artículo en Inglés | MEDLINE | ID: mdl-34373643

RESUMEN

Remote health assessments that gather real-world data (RWD) outside clinic settings require a clear understanding of appropriate methods for data collection, quality assessment, analysis and interpretation. Here we examine the performance and limitations of smartphones in collecting RWD in the remote mPower observational study of Parkinson's disease (PD). Within the first 6 months of study commencement, 960 participants had enrolled and performed at least five self-administered active PD symptom assessments (speeded tapping, gait/balance, phonation or memory). Task performance, especially speeded tapping, was predictive of self-reported PD status (area under the receiver operating characteristic curve (AUC) = 0.8) and correlated with in-clinic evaluation of disease severity (r = 0.71; P < 1.8 × 10-6) when compared with motor Movement Disorder Society-Unified Parkinson's Disease Rating Scale (MDS-UPDRS). Although remote assessment requires careful consideration for accurate interpretation of RWD, our results support the use of smartphones and wearables in objective and personalized disease assessments.


Asunto(s)
Enfermedad de Parkinson , Teléfono Inteligente , Marcha , Humanos , Movimiento , Enfermedad de Parkinson/diagnóstico , Índice de Severidad de la Enfermedad
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA