Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 388
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
N Engl J Med ; 385(12): 1078-1090, 2021 09 16.
Artículo en Inglés | MEDLINE | ID: mdl-34432976

RESUMEN

BACKGROUND: Preapproval trials showed that messenger RNA (mRNA)-based vaccines against severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) had a good safety profile, yet these trials were subject to size and patient-mix limitations. An evaluation of the safety of the BNT162b2 mRNA vaccine with respect to a broad range of potential adverse events is needed. METHODS: We used data from the largest health care organization in Israel to evaluate the safety of the BNT162b2 mRNA vaccine. For each potential adverse event, in a population of persons with no previous diagnosis of that event, we individually matched vaccinated persons to unvaccinated persons according to sociodemographic and clinical variables. Risk ratios and risk differences at 42 days after vaccination were derived with the use of the Kaplan-Meier estimator. To place these results in context, we performed a similar analysis involving SARS-CoV-2-infected persons matched to uninfected persons. The same adverse events were studied in the vaccination and SARS-CoV-2 infection analyses. RESULTS: In the vaccination analysis, the vaccinated and control groups each included a mean of 884,828 persons. Vaccination was most strongly associated with an elevated risk of myocarditis (risk ratio, 3.24; 95% confidence interval [CI], 1.55 to 12.44; risk difference, 2.7 events per 100,000 persons; 95% CI, 1.0 to 4.6), lymphadenopathy (risk ratio, 2.43; 95% CI, 2.05 to 2.78; risk difference, 78.4 events per 100,000 persons; 95% CI, 64.1 to 89.3), appendicitis (risk ratio, 1.40; 95% CI, 1.02 to 2.01; risk difference, 5.0 events per 100,000 persons; 95% CI, 0.3 to 9.9), and herpes zoster infection (risk ratio, 1.43; 95% CI, 1.20 to 1.73; risk difference, 15.8 events per 100,000 persons; 95% CI, 8.2 to 24.2). SARS-CoV-2 infection was associated with a substantially increased risk of myocarditis (risk ratio, 18.28; 95% CI, 3.95 to 25.12; risk difference, 11.0 events per 100,000 persons; 95% CI, 5.6 to 15.8) and of additional serious adverse events, including pericarditis, arrhythmia, deep-vein thrombosis, pulmonary embolism, myocardial infarction, intracranial hemorrhage, and thrombocytopenia. CONCLUSIONS: In this study in a nationwide mass vaccination setting, the BNT162b2 vaccine was not associated with an elevated risk of most of the adverse events examined. The vaccine was associated with an excess risk of myocarditis (1 to 5 events per 100,000 persons). The risk of this potentially serious adverse event and of many other serious adverse events was substantially increased after SARS-CoV-2 infection. (Funded by the Ivan and Francesca Berkowitz Family Living Laboratory Collaboration at Harvard Medical School and Clalit Research Institute.).


Asunto(s)
Vacunas contra la COVID-19/efectos adversos , COVID-19/complicaciones , Enfermedades Cardiovasculares/etiología , Miocarditis/etiología , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Apendicitis/etiología , Vacuna BNT162 , Enfermedades Cardiovasculares/epidemiología , Femenino , Herpes Zóster/etiología , Humanos , Israel , Estimación de Kaplan-Meier , Linfadenopatía/etiología , Masculino , Persona de Mediana Edad , Miocarditis/epidemiología , Riesgo , Factores de Riesgo , Adulto Joven
2.
J Am Soc Nephrol ; 34(2): 309-321, 2023 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-36368777

RESUMEN

BACKGROUND: The National Kidney Foundation and American Society of Nephrology Task Force on Reassessing the Inclusion of Race in Diagnosing Kidney Disease recently recommended a new race-free creatinine-based equation for eGFR. The effect on recommended clinical care across race and ethnicity groups is unknown. METHODS: We analyzed nationally representative cross-sectional questionnaires and medical examinations from 44,360 participants collected between 2001 and 2018 by the National Health and Nutrition Examination Survey. We quantified the number and proportion of Black, White, Hispanic, and Asian/Other adults with guideline-recommended changes in care. RESULTS: The new equation, if applied nationally, could assign new CKD diagnoses to 434,000 (95% confidence interval [CI], 350,000 to 517,000) Black adults, reclassify 584,000 (95% CI, 508,000 to 667,000) to more advanced stages of CKD, restrict kidney donation eligibility for 246,000 (95% CI, 189,000 to 303,000), expand nephrologist referrals for 41,800 (95% CI, 19,800 to 63,800), and reduce medication dosing for 222,000 (95% CI, 169,000 to 275,000). Among non-Black adults, these changes may undo CKD diagnoses for 5.51 million (95% CI, 4.86 million to 6.16 million), reclassify 4.59 million (95% CI, 4.28 million to 4.92 million) to less advanced stages of CKD, expand kidney donation eligibility for 3.96 million (95% CI, 3.46 million to 4.46 million), reverse nephrologist referral for 75,800 (95% CI, 35,400 to 116,000), and reverse medication dose reductions for 1.47 million (95% CI, 1.22 million to 1.73 million). The racial and ethnic mix of the populations used to develop eGFR equations has a substantial effect on potential care changes. CONCLUSION: The newly recommended 2021 CKD-EPI creatinine-based eGFR equation may result in substantial changes to recommended care for US patients of all racial and ethnic groups.


Asunto(s)
Insuficiencia Renal Crónica , Adulto , Humanos , Creatinina , Tasa de Filtración Glomerular , Encuestas Nutricionales , Estudios Transversales , Insuficiencia Renal Crónica/diagnóstico
3.
Lancet ; 398(10316): 2093-2100, 2021 12 04.
Artículo en Inglés | MEDLINE | ID: mdl-34756184

RESUMEN

BACKGROUND: Many countries are experiencing a resurgence of COVID-19, driven predominantly by the delta (B.1.617.2) variant of SARS-CoV-2. In response, these countries are considering the administration of a third dose of mRNA COVID-19 vaccine as a booster dose to address potential waning immunity over time and reduced effectiveness against the delta variant. We aimed to use the data repositories of Israel's largest health-care organisation to evaluate the effectiveness of a third dose of the BNT162b2 mRNA vaccine for preventing severe COVID-19 outcomes. METHODS: Using data from Clalit Health Services, which provides mandatory health-care coverage for over half of the Israeli population, individuals receiving a third vaccine dose between July 30, 2020, and Sept 23, 2021, were matched (1:1) to demographically and clinically similar controls who did not receive a third dose. Eligible participants had received the second vaccine dose at least 5 months before the recruitment date, had no previous documented SARS-CoV-2 infection, and had no contact with the health-care system in the 3 days before recruitment. Individuals who are health-care workers, live in long-term care facilities, or are medically confined to their homes were excluded. Primary outcomes were COVID-19-related admission to hospital, severe disease, and COVID-19-related death. The third dose effectiveness for each outcome was estimated as 1 - risk ratio using the Kaplan-Meier estimator. FINDINGS: 1 158 269 individuals were eligible to be included in the third dose group. Following matching, the third dose and control groups each included 728 321 individuals. Participants had a median age of 52 years (IQR 37-68) and 51% were female. The median follow-up time was 13 days (IQR 6-21) in both groups. Vaccine effectiveness evaluated at least 7 days after receipt of the third dose, compared with receiving only two doses at least 5 months ago, was estimated to be 93% (231 events for two doses vs 29 events for three doses; 95% CI 88-97) for admission to hospital, 92% (157 vs 17 events; 82-97) for severe disease, and 81% (44 vs seven events; 59-97) for COVID-19-related death. INTERPRETATION: Our findings suggest that a third dose of the BNT162b2 mRNA vaccine is effective in protecting individuals against severe COVID-19-related outcomes, compared with receiving only two doses at least 5 months ago. FUNDING: The Ivan and Francesca Berkowitz Family Living Laboratory Collaboration at Harvard Medical School and Clalit Research Institute.


Asunto(s)
Vacuna BNT162 , COVID-19/prevención & control , Inmunización Secundaria , Eficacia de las Vacunas , Adulto , Anciano , COVID-19/epidemiología , COVID-19/virología , Femenino , Humanos , Israel/epidemiología , Masculino , Vacunación Masiva , Persona de Mediana Edad , Pandemias/prevención & control , Pronóstico , SARS-CoV-2
5.
Biostatistics ; 22(2): 381-401, 2021 04 10.
Artículo en Inglés | MEDLINE | ID: mdl-31545341

RESUMEN

We propose a computationally and statistically efficient divide-and-conquer (DAC) algorithm to fit sparse Cox regression to massive datasets where the sample size $n_0$ is exceedingly large and the covariate dimension $p$ is not small but $n_0\gg p$. The proposed algorithm achieves computational efficiency through a one-step linear approximation followed by a least square approximation to the partial likelihood (PL). These sequences of linearization enable us to maximize the PL with only a small subset and perform penalized estimation via a fast approximation to the PL. The algorithm is applicable for the analysis of both time-independent and time-dependent survival data. Simulations suggest that the proposed DAC algorithm substantially outperforms the full sample-based estimators and the existing DAC algorithm with respect to the computational speed, while it achieves similar statistical efficiency as the full sample-based estimators. The proposed algorithm was applied to extraordinarily large survival datasets for the prediction of heart failure-specific readmission within 30 days among Medicare heart failure patients.


Asunto(s)
Algoritmos , Medicare , Anciano , Simulación por Computador , Humanos , Análisis de los Mínimos Cuadrados , Modelos de Riesgos Proporcionales , Estados Unidos
6.
Eur Radiol ; 32(4): 2824-2836, 2022 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-34797386

RESUMEN

OBJECTIVES: To describe the imaging findings of intimate partner violence (IPV)-related injury and to evaluate the role of longitudinal imaging review in detecting IPV. METHODS: Radiology studies were reviewed in chronological order and IPV-related injuries were recorded among 400 victims of any type of abuse (group 1) and 288 of physical abuse (group 2) from January 2013 to June 2018. The likelihood of IPV was assessed as low/moderate/high based on the review of (1) current and prior anatomically related studies only and (2) longitudinal imaging history consisting of all prior studies. The first radiological study date with moderate/high suspicion was compared to the self-reported date by the victim. RESULTS: A total of 135 victims (33.8%) in group 1 and 144 victims (50%) in group 2 demonstrated IPV-related injuries. Musculoskeletal injury was most common (58.2% and 44.5% in groups 1 and 2, respectively; most commonly lower/upper extremity fractures), followed by neurologic injury (20.9% and 32.9% in groups 1 and 2, respectively; most commonly facial injury). With longitudinal imaging history, radiologists were able to identify IPV in 31% of group 1 and 46.5% of group 2 patients. Amongst these patients, earlier identification by radiologists was provided compared to the self-reported date in 62.3% of group 1 (median, 64 months) and in 52.6% of group 2 (median, 69.3 months). CONCLUSIONS: Musculoskeletal and neurological injuries were the most common IPV-related injuries. Knowledge of common injuries and longitudinal imaging history may help IPV identification when victims are not forthcoming. KEY POINTS: • Musculoskeletal injuries were the most common type of IPV-related injury, followed by neurological injuries. • With longitudinal imaging history, radiologists were able to better raise the suspicion of IPV compared to the selective review of anatomically related studies only. • With longitudinal imaging history, radiologists were able to identify IPV earlier than the self-reported date by a median of 64 months in any type of abuse, and a median of 69.3 months in physical abuse.


Asunto(s)
Fracturas Óseas , Violencia de Pareja , Diagnóstico por Imagen , Humanos , Radiólogos
7.
Hum Mol Genet ; 28(21): 3625-3636, 2019 11 01.
Artículo en Inglés | MEDLINE | ID: mdl-31518399

RESUMEN

The X-linked neurodevelopmental diseases CDKL5 deficiency disorder (CDD) and Rett syndrome (RTT) are associated with intellectual disability, infantile spasms and seizures. Although mitochondrial dysfunction has been suggested in RTT, less is understood about mitochondrial function in CDD. A comparison of bioenergetics and mitochondrial function between isogenic wild-type and mutant neural progenitor cell (NPC) lines revealed increased oxygen consumption in CDD mutant lines, which is associated with altered mitochondrial function and structure. Transcriptomic analysis revealed differential expression of genes related to mitochondrial and REDOX function in NPCs expressing the mutant CDKL5. Furthermore, a similar increase in oxygen consumption specific to RTT patient-derived isogenic mutant NPCs was observed, though the pattern of mitochondrial functional alterations was distinct from CDKL5 mutant-expressing NPCs. We propose that aberrant neural bioenergetics is a common feature between CDD and RTT disorders. The observed changes in oxidative stress and mitochondrial function may facilitate the development of therapeutic agents for CDD and related disorders.


Asunto(s)
Síndromes Epilépticos/metabolismo , Mitocondrias/metabolismo , Síndrome de Rett/metabolismo , Espasmos Infantiles/metabolismo , Adulto , Células Cultivadas , Preescolar , Metabolismo Energético , Síndromes Epilépticos/genética , Femenino , Humanos , Mitocondrias/genética , Células-Madre Neurales/citología , Células-Madre Neurales/metabolismo , Estrés Oxidativo , Oxígeno/metabolismo , Proteínas Serina-Treonina Quinasas/genética , Proteínas Serina-Treonina Quinasas/metabolismo , Síndrome de Rett/genética , Espasmos Infantiles/genética
9.
N Engl J Med ; 379(22): 2131-2139, 2018 11 29.
Artículo en Inglés | MEDLINE | ID: mdl-30304647

RESUMEN

BACKGROUND: Many patients remain without a diagnosis despite extensive medical evaluation. The Undiagnosed Diseases Network (UDN) was established to apply a multidisciplinary model in the evaluation of the most challenging cases and to identify the biologic characteristics of newly discovered diseases. The UDN, which is funded by the National Institutes of Health, was formed in 2014 as a network of seven clinical sites, two sequencing cores, and a coordinating center. Later, a central biorepository, a metabolomics core, and a model organisms screening center were added. METHODS: We evaluated patients who were referred to the UDN over a period of 20 months. The patients were required to have an undiagnosed condition despite thorough evaluation by a health care provider. We determined the rate of diagnosis among patients who subsequently had a complete evaluation, and we observed the effect of diagnosis on medical care. RESULTS: A total of 1519 patients (53% female) were referred to the UDN, of whom 601 (40%) were accepted for evaluation. Of the accepted patients, 192 (32%) had previously undergone exome sequencing. Symptoms were neurologic in 40% of the applicants, musculoskeletal in 10%, immunologic in 7%, gastrointestinal in 7%, and rheumatologic in 6%. Of the 382 patients who had a complete evaluation, 132 received a diagnosis, yielding a rate of diagnosis of 35%. A total of 15 diagnoses (11%) were made by clinical review alone, and 98 (74%) were made by exome or genome sequencing. Of the diagnoses, 21% led to recommendations regarding changes in therapy, 37% led to changes in diagnostic testing, and 36% led to variant-specific genetic counseling. We defined 31 new syndromes. CONCLUSIONS: The UDN established a diagnosis in 132 of the 382 patients who had a complete evaluation, yielding a rate of diagnosis of 35%. (Funded by the National Institutes of Health Common Fund.).


Asunto(s)
Pruebas Genéticas , Enfermedades Raras/genética , Análisis de Secuencia de ADN , Adulto , Animales , Niño , Diagnóstico Diferencial , Drosophila , Exoma , Femenino , Pruebas Genéticas/economía , Costos de la Atención en Salud/estadística & datos numéricos , Humanos , Masculino , Modelos Animales , National Institutes of Health (U.S.) , Enfermedades Raras/diagnóstico , Síndrome , Estados Unidos
10.
Bioinformatics ; 36(13): 4047-4057, 2020 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-31860066

RESUMEN

MOTIVATION: The advent of in vivo automated techniques for single-cell lineaging, sequencing and analysis of gene expression has begun to dramatically increase our understanding of organismal development. We applied novel meta-analysis and visualization techniques to the EPIC single-cell-resolution developmental gene expression dataset for Caenorhabditis elegans from Bao, Murray, Waterston et al. to gain insights into regulatory mechanisms governing the timing of development. RESULTS: Our meta-analysis of the EPIC dataset revealed that a simple linear combination of the expression levels of the developmental genes is strongly correlated with the developmental age of the organism, irrespective of the cell division rate of different cell lineages. We uncovered a pattern of collective sinusoidal oscillation in gene activation, in multiple dominant frequencies and in multiple orthogonal axes of gene expression, pointing to the existence of a coordinated, multi-frequency global timing mechanism. We developed a novel method based on Fisher's Discriminant Analysis to identify gene expression weightings that maximally separate traits of interest, and found that remarkably, simple linear gene expression weightings are capable of producing sinusoidal oscillations of any frequency and phase, adding to the growing body of evidence that oscillatory mechanisms likely play an important role in the timing of development. We cross-linked EPIC with gene ontology and anatomy ontology terms, employing Fisher's Discriminant Analysis methods to identify previously unknown positive and negative genetic contributions to developmental processes and cell phenotypes. This meta-analysis demonstrates new evidence for direct linear and/or sinusoidal mechanisms regulating the timing of development. We uncovered a number of previously unknown positive and negative correlations between developmental genes and developmental processes or cell phenotypes. Our results highlight both the continued relevance of the EPIC technique, and the value of meta-analysis of previously published results. The presented analysis and visualization techniques are broadly applicable across developmental and systems biology. AVAILABILITY AND IMPLEMENTATION: Analysis software available upon request. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Proteínas de Caenorhabditis elegans , Caenorhabditis elegans , Animales , Caenorhabditis elegans/genética , Caenorhabditis elegans/metabolismo , Proteínas de Caenorhabditis elegans/genética , Proteínas de Caenorhabditis elegans/metabolismo , Linaje de la Célula , Regulación del Desarrollo de la Expresión Génica , Activación Transcripcional
11.
Genet Med ; 23(6): 1075-1085, 2021 06.
Artículo en Inglés | MEDLINE | ID: mdl-33580225

RESUMEN

PURPOSE: Genomic sequencing has become an increasingly powerful and relevant tool to be leveraged for the discovery of genetic aberrations underlying rare, Mendelian conditions. Although the computational tools incorporated into diagnostic workflows for this task are continually evolving and improving, we nevertheless sought to investigate commonalities across sequencing processing workflows to reveal consensus and standard practice tools and highlight exploratory analyses where technical and theoretical method improvements would be most impactful. METHODS: We collected details regarding the computational approaches used by a genetic testing laboratory and 11 clinical research sites in the United States participating in the Undiagnosed Diseases Network via meetings with bioinformaticians, online survey forms, and analyses of internal protocols. RESULTS: We found that tools for processing genomic sequencing data can be grouped into four distinct categories. Whereas well-established practices exist for initial variant calling and quality control steps, there is substantial divergence across sites in later stages for variant prioritization and multimodal data integration, demonstrating a diversity of approaches for solving the most mysterious undiagnosed cases. CONCLUSION: The largest differences across diagnostic workflows suggest that advances in structural variant detection, noncoding variant interpretation, and integration of additional biomedical data may be especially promising for solving chronically undiagnosed cases.


Asunto(s)
Genómica , Enfermedades no Diagnosticadas , Biología Computacional , Pruebas Genéticas , Genoma , Humanos , Programas Informáticos , Flujo de Trabajo
12.
BMC Neurol ; 21(1): 201, 2021 May 18.
Artículo en Inglés | MEDLINE | ID: mdl-34006233

RESUMEN

BACKGROUND: Characterization of prediagnostic Parkinson's Disease (PD) and early prediction of subsequent development are critical for preventive interventions, risk stratification and understanding of disease pathology. This study aims to characterize the role of the prediagnostic period in PD and, using selected features from this period as novel interception points, construct a prediction model to accelerate the diagnosis in a real-world setting. METHODS: We constructed two sets of machine learning models: a retrospective approach highlighting exposures up to 5 years prior to PD diagnosis, and an alternative model that prospectively predicted future PD diagnosis from all individuals at their first diagnosis of a gait or tremor disorder, these being features that appeared to represent the initiation of a differential diagnostic window. RESULTS: We found many novel features captured by the retrospective models; however, the high accuracy was primarily driven from surrogate diagnoses for PD, such as gait and tremor disorders, suggesting the presence of a distinctive differential diagnostic period when the clinician already suspected PD. The model utilizing a gait/tremor diagnosis as the interception point, achieved a validation AUC of 0.874 with potential time compression to a future PD diagnosis of more than 300 days. Comparisons of predictive diagnoses between the prospective and prediagnostic cohorts suggest the presence of distinctive trajectories of PD progression based on comorbidity profiles. CONCLUSIONS: Overall, our machine learning approach allows for both guiding clinical decisions such as the initiation of neuroprotective interventions and importantly, the possibility of earlier diagnosis for clinical trials for disease modifying therapies.


Asunto(s)
Enfermedad de Parkinson/diagnóstico , Marcha/fisiología , Análisis de la Marcha , Humanos , Aprendizaje Automático , Estudios Retrospectivos , Medición de Riesgo , Temblor
13.
J Med Internet Res ; 23(3): e22219, 2021 03 02.
Artículo en Inglés | MEDLINE | ID: mdl-33600347

RESUMEN

Coincident with the tsunami of COVID-19-related publications, there has been a surge of studies using real-world data, including those obtained from the electronic health record (EHR). Unfortunately, several of these high-profile publications were retracted because of concerns regarding the soundness and quality of the studies and the EHR data they purported to analyze. These retractions highlight that although a small community of EHR informatics experts can readily identify strengths and flaws in EHR-derived studies, many medical editorial teams and otherwise sophisticated medical readers lack the framework to fully critically appraise these studies. In addition, conventional statistical analyses cannot overcome the need for an understanding of the opportunities and limitations of EHR-derived studies. We distill here from the broader informatics literature six key considerations that are crucial for appraising studies utilizing EHR data: data completeness, data collection and handling (eg, transformation), data type (ie, codified, textual), robustness of methods against EHR variability (within and across institutions, countries, and time), transparency of data and analytic code, and the multidisciplinary approach. These considerations will inform researchers, clinicians, and other stakeholders as to the recommended best practices in reviewing manuscripts, grants, and other outputs from EHR-data derived studies, and thereby promote and foster rigor, quality, and reliability of this rapidly growing field.


Asunto(s)
COVID-19/epidemiología , Recolección de Datos/métodos , Registros Electrónicos de Salud , Recolección de Datos/normas , Humanos , Revisión de la Investigación por Pares/normas , Edición/normas , Reproducibilidad de los Resultados , SARS-CoV-2/aislamiento & purificación
14.
Hum Mol Genet ; 27(R1): R29-R34, 2018 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-29566172

RESUMEN

While tens of thousands of pathogenic variants are used to inform the many clinical applications of genomics, there remains limited information on quantitative disease risk for the majority of variants used in clinical practice. At the same time, rising demand for genetic counselling has prompted a growing need for computational approaches that can help interpret genetic variation. Such tasks include predicting variant pathogenicity and identifying variants that are too common to be penetrant. To address these challenges, researchers are increasingly turning to integrative informatics approaches. These approaches often leverage vast sources of data, including electronic health records and population-level allele frequency databases (e.g. gnomAD), as well as machine learning techniques such as support vector machines and deep learning. In this review, we highlight recent informatics and machine learning approaches that are improving our understanding of pathogenic variation and discuss obstacles that may limit their emerging role in clinical genomics.


Asunto(s)
Biología Computacional/tendencias , Genoma Humano/genética , Genómica/tendencias , Aprendizaje Automático/tendencias , Bases de Datos Genéticas , Humanos
16.
Clin Gastroenterol Hepatol ; 18(8): 1890-1892, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-31404664

RESUMEN

Crohn's disease (CD) and ulcerative colitis (UC) are heterogeneous. With availability of therapeutic classes with distinct immunologic mechanisms of action, it has become imperative to identify markers that predict likelihood of response to each drug class. However, robust development of such tools has been challenging because of need for large prospective cohorts with systematic and careful assessment of treatment response using validated indices. Most hospitals in the United States use electronic health records (EHRs) that warehouse a large amount of narrative (free-text) and codified (administrative) data generated during routine clinical care. These data have been used to construct virtual disease cohorts for epidemiologic research as well as for defining genetic basis of disease states or discrete laboratory values.1-3 Whether EHR-based data can be used to validate genetic associations for more nuanced outcomes such as treatment response has not been examined previously.


Asunto(s)
Colitis Ulcerosa , Enfermedad de Crohn , Enfermedades Inflamatorias del Intestino , Registros Electrónicos de Salud , Humanos , Enfermedades Inflamatorias del Intestino/tratamiento farmacológico , Estudios Prospectivos , Estados Unidos
17.
BMC Med ; 18(1): 236, 2020 08 18.
Artículo en Inglés | MEDLINE | ID: mdl-32807164

RESUMEN

BACKGROUND: Ovarian cancer causes 151,900 deaths per year worldwide. Treatment and prognosis are primarily determined by the histopathologic interpretation in combination with molecular diagnosis. However, the relationship between histopathology patterns and molecular alterations is not fully understood, and it is difficult to predict patients' chemotherapy response using the known clinical and histological variables. METHODS: We analyzed the whole-slide histopathology images, RNA-Seq, and proteomics data from 587 primary serous ovarian adenocarcinoma patients and developed a systematic algorithm to integrate histopathology and functional omics findings and to predict patients' response to platinum-based chemotherapy. RESULTS: Our convolutional neural networks identified the cancerous regions with areas under the receiver operating characteristic curve (AUCs) > 0.95 and classified tumor grade with AUCs > 0.80. Functional omics analysis revealed that expression levels of proteins participated in innate immune responses and catabolic pathways are associated with tumor grade. Quantitative histopathology analysis successfully stratified patients with different response to platinum-based chemotherapy (P = 0.003). CONCLUSIONS: These results indicated the potential clinical utility of quantitative histopathology evaluation in tumor cell detection and chemotherapy response prediction. The developed algorithm is easily extensible to other tumor types and treatment modalities.


Asunto(s)
Neoplasias Ováricas/tratamiento farmacológico , Neoplasias Ováricas/patología , Platino (Metal)/uso terapéutico , Femenino , Humanos , Persona de Mediana Edad , Pronóstico
18.
J Med Internet Res ; 22(8): e16709, 2020 08 05.
Artículo en Inglés | MEDLINE | ID: mdl-32755895

RESUMEN

BACKGROUND: Chest computed tomography (CT) is crucial for the detection of lung cancer, and many automated CT evaluation methods have been proposed. Due to the divergent software dependencies of the reported approaches, the developed methods are rarely compared or reproduced. OBJECTIVE: The goal of the research was to generate reproducible machine learning modules for lung cancer detection and compare the approaches and performances of the award-winning algorithms developed in the Kaggle Data Science Bowl. METHODS: We obtained the source codes of all award-winning solutions of the Kaggle Data Science Bowl Challenge, where participants developed automated CT evaluation methods to detect lung cancer (training set n=1397, public test set n=198, final test set n=506). The performance of the algorithms was evaluated by the log-loss function, and the Spearman correlation coefficient of the performance in the public and final test sets was computed. RESULTS: Most solutions implemented distinct image preprocessing, segmentation, and classification modules. Variants of U-Net, VGGNet, and residual net were commonly used in nodule segmentation, and transfer learning was used in most of the classification algorithms. Substantial performance variations in the public and final test sets were observed (Spearman correlation coefficient = .39 among the top 10 teams). To ensure the reproducibility of results, we generated a Docker container for each of the top solutions. CONCLUSIONS: We compared the award-winning algorithms for lung cancer detection and generated reproducible Docker images for the top solutions. Although convolutional neural networks achieved decent accuracy, there is plenty of room for improvement regarding model generalizability.


Asunto(s)
Neoplasias Pulmonares/diagnóstico por imagen , Neoplasias Pulmonares/diagnóstico , Aprendizaje Automático/normas , Tomografía Computarizada por Rayos X/métodos , Algoritmos , Humanos , Reproducibilidad de los Resultados
19.
BMC Bioinformatics ; 20(1): 268, 2019 May 28.
Artículo en Inglés | MEDLINE | ID: mdl-31138121

RESUMEN

BACKGROUND: Correcting a heterogeneous dataset that presents artefacts from several confounders is often an essential bioinformatics task. Attempting to remove these batch effects will result in some biologically meaningful signals being lost. Thus, a central challenge is assessing if the removal of unwanted technical variation harms the biological signal that is of interest to the researcher. RESULTS: We describe a novel framework, B-CeF, to evaluate the effectiveness of batch correction methods and their tendency toward over or under correction. The approach is based on comparing co-expression of adjusted gene-gene pairs to a-priori knowledge of highly confident gene-gene associations based on thousands of unrelated experiments derived from an external reference. Our framework includes three steps: (1) data adjustment with the desired methods (2) calculating gene-gene co-expression measurements for adjusted datasets (3) evaluating the performance of the co-expression measurements against a gold standard. Using the framework, we evaluated five batch correction methods applied to RNA-seq data of six representative tissue datasets derived from the GTEx project. CONCLUSIONS: Our framework enables the evaluation of batch correction methods to better preserve the original biological signal. We show that using a multiple linear regression model to correct for known confounders outperforms factor analysis-based methods that estimate hidden confounders. The code is publicly available as an R package.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Bases de Datos Genéticas , Epistasis Genética , Genes , Área Bajo la Curva , Regulación de la Expresión Génica , Humanos , Curva ROC , Grasa Subcutánea/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA