RESUMEN
OBJECTIVES: To externally validate risk models for the detection of kidney cancer, as early detection of kidney cancer improves survival and stratifying the population using risk models could enable an individually tailored screening programme. METHODS: We validated the performance of 30 existing phenotypic models predicting the risk of kidney cancer in the UK Biobank cohort (n = 450 687). We compared the discrimination and calibration of models for men, women, and a mixed-sex cohort. Population level data were used to estimate model performance in a screening scenario for a range of risk thresholds (6-year risk: 0.1-1.0%). RESULTS: In all, 10 models had reasonable discrimination (area under the receiver-operating characteristic curve >0.60), although some had poor calibration. Modelling demonstrated similar performance of the best models over a range of thresholds. The models showed an improvement in ability to identify cases compared to age- and sex-based screening. All the models performed less well in women than men. CONCLUSIONS: The present study is the first comprehensive external validation of risk models for kidney cancer. The best-performing models are better at identifying individuals at high risk of kidney cancer than age and sex alone; however, the benefits are relatively small. Feasibility studies are required to determine applicability to a screening programme.
Asunto(s)
Bancos de Muestras Biológicas , Neoplasias Renales , Femenino , Humanos , Neoplasias Renales/diagnóstico , Neoplasias Renales/epidemiología , Masculino , Tamizaje Masivo , Salud Pública , Medición de Riesgo , Factores de Riesgo , Reino Unido/epidemiologíaRESUMEN
BACKGROUND: Polygenic risk scores (PRSs) can stratify populations into cardiovascular disease (CVD) risk groups. We aimed to quantify the potential advantage of adding information on PRSs to conventional risk factors in the primary prevention of CVD. METHODS AND FINDINGS: Using data from UK Biobank on 306,654 individuals without a history of CVD and not on lipid-lowering treatments (mean age [SD]: 56.0 [8.0] years; females: 57%; median follow-up: 8.1 years), we calculated measures of risk discrimination and reclassification upon addition of PRSs to risk factors in a conventional risk prediction model (i.e., age, sex, systolic blood pressure, smoking status, history of diabetes, and total and high-density lipoprotein cholesterol). We then modelled the implications of initiating guideline-recommended statin therapy in a primary care setting using incidence rates from 2.1 million individuals from the Clinical Practice Research Datalink. The C-index, a measure of risk discrimination, was 0.710 (95% CI 0.703-0.717) for a CVD prediction model containing conventional risk predictors alone. Addition of information on PRSs increased the C-index by 0.012 (95% CI 0.009-0.015), and resulted in continuous net reclassification improvements of about 10% and 12% in cases and non-cases, respectively. If a PRS were assessed in the entire UK primary care population aged 40-75 years, assuming that statin therapy would be initiated in accordance with the UK National Institute for Health and Care Excellence guidelines (i.e., for persons with a predicted risk of ≥10% and for those with certain other risk factors, such as diabetes, irrespective of their 10-year predicted risk), then it could help prevent 1 additional CVD event for approximately every 5,750 individuals screened. By contrast, targeted assessment only among people at intermediate (i.e., 5% to <10%) 10-year CVD risk could help prevent 1 additional CVD event for approximately every 340 individuals screened. Such a targeted strategy could help prevent 7% more CVD events than conventional risk prediction alone. Potential gains afforded by assessment of PRSs on top of conventional risk factors would be about 1.5-fold greater than those provided by assessment of C-reactive protein, a plasma biomarker included in some risk prediction guidelines. Potential limitations of this study include its restriction to European ancestry participants and a lack of health economic evaluation. CONCLUSIONS: Our results suggest that addition of PRSs to conventional risk factors can modestly enhance prediction of first-onset CVD and could translate into population health benefits if used at scale.
Asunto(s)
Enfermedades Cardiovasculares/epidemiología , Enfermedades Cardiovasculares/prevención & control , Factores de Riesgo de Enfermedad Cardiaca , Adulto , Anciano , Biomarcadores/sangre , Estudios de Cohortes , Femenino , Humanos , Incidencia , Masculino , Persona de Mediana Edad , Medición de Riesgo , Reino Unido/epidemiologíaRESUMEN
Cardiovascular disease (CVD) risk-prediction models are used to identify high-risk individuals and guide statin initiation. However, these models are usually derived from individuals who might initiate statins during follow-up. We present a simple approach to address statin initiation to predict "statin-naive" CVD risk. We analyzed primary care data (2004-2017) from the UK Clinical Practice Research Datalink for 1,678,727 individuals (aged 40-85 years) without CVD or statin treatment history at study entry. We derived age- and sex-specific prediction models including conventional risk factors and a time-dependent effect of statin initiation constrained to 25% risk reduction (from trial results). We compared predictive performance and measures of public-health impact (e.g., number needed to screen to prevent 1 event) against models ignoring statin initiation. During a median follow-up of 8.9 years, 103,163 individuals developed CVD. In models accounting for (versus ignoring) statin initiation, 10-year CVD risk predictions were slightly higher; predictive performance was moderately improved. However, few individuals were reclassified to a high-risk threshold, resulting in negligible improvements in number needed to screen to prevent 1 event. In conclusion, incorporating statin effects from trial results into risk-prediction models enables statin-naive CVD risk estimation and provides moderate gains in predictive ability but had a limited impact on treatment decision-making under current guidelines in this population.
Asunto(s)
Enfermedades Cardiovasculares/diagnóstico , Enfermedades Cardiovasculares/tratamiento farmacológico , Toma de Decisiones Clínicas/métodos , Técnicas de Apoyo para la Decisión , Inhibidores de Hidroximetilglutaril-CoA Reductasas/administración & dosificación , Adulto , Anciano , Anciano de 80 o más Años , Femenino , Predicción , Factores de Riesgo de Enfermedad Cardiaca , Humanos , Masculino , Persona de Mediana Edad , Valor Predictivo de las Pruebas , Atención Primaria de Salud/métodos , Medición de Riesgo/métodos , Reino UnidoRESUMEN
AIMS: There is debate about the optimum algorithm for cardiovascular disease (CVD) risk estimation. We conducted head-to-head comparisons of four algorithms recommended by primary prevention guidelines, before and after 'recalibration', a method that adapts risk algorithms to take account of differences in the risk characteristics of the populations being studied. METHODS AND RESULTS: Using individual-participant data on 360 737 participants without CVD at baseline in 86 prospective studies from 22 countries, we compared the Framingham risk score (FRS), Systematic COronary Risk Evaluation (SCORE), pooled cohort equations (PCE), and Reynolds risk score (RRS). We calculated measures of risk discrimination and calibration, and modelled clinical implications of initiating statin therapy in people judged to be at 'high' 10 year CVD risk. Original risk algorithms were recalibrated using the risk factor profile and CVD incidence of target populations. The four algorithms had similar risk discrimination. Before recalibration, FRS, SCORE, and PCE over-predicted CVD risk on average by 10%, 52%, and 41%, respectively, whereas RRS under-predicted by 10%. Original versions of algorithms classified 29-39% of individuals aged ≥40 years as high risk. By contrast, recalibration reduced this proportion to 22-24% for every algorithm. We estimated that to prevent one CVD event, it would be necessary to initiate statin therapy in 44-51 such individuals using original algorithms, in contrast to 37-39 individuals with recalibrated algorithms. CONCLUSION: Before recalibration, the clinical performance of four widely used CVD risk algorithms varied substantially. By contrast, simple recalibration nearly equalized their performance and improved modelled targeting of preventive action to clinical need.
Asunto(s)
Algoritmos , Enfermedades Cardiovasculares/etiología , Anciano , Calibración , Femenino , Humanos , Masculino , Persona de Mediana Edad , Estudios Prospectivos , Medición de RiesgoRESUMEN
Importance: It is uncertain whether depressive symptoms are independently associated with subsequent risk of cardiovascular diseases (CVDs). Objective: To characterize the association between depressive symptoms and CVD incidence across the spectrum of lower mood. Design, Setting, and Participants: A pooled analysis of individual-participant data from the Emerging Risk Factors Collaboration (ERFC; 162â¯036 participants; 21 cohorts; baseline surveys, 1960-2008; latest follow-up, March 2020) and the UK Biobank (401â¯219 participants; baseline surveys, 2006-2010; latest follow-up, March 2020). Eligible participants had information about self-reported depressive symptoms and no CVD history at baseline. Exposures: Depressive symptoms were recorded using validated instruments. ERFC scores were harmonized across studies to a scale representative of the Center for Epidemiological Studies Depression (CES-D) scale (range, 0-60; ≥16 indicates possible depressive disorder). The UK Biobank recorded the 2-item Patient Health Questionnaire 2 (PHQ-2; range, 0-6; ≥3 indicates possible depressive disorder). Main Outcomes and Measures: Primary outcomes were incident fatal or nonfatal coronary heart disease (CHD), stroke, and CVD (composite of the 2). Hazard ratios (HRs) per 1-SD higher log CES-D or PHQ-2 adjusted for age, sex, smoking, and diabetes were reported. Results: Among 162â¯036 participants from the ERFC (73%, women; mean age at baseline, 63 years [SD, 9 years]), 5078 CHD and 3932 stroke events were recorded (median follow-up, 9.5 years). Associations with CHD, stroke, and CVD were log linear. The HR per 1-SD higher depression score for CHD was 1.07 (95% CI, 1.03-1.11); stroke, 1.05 (95% CI, 1.01-1.10); and CVD, 1.06 (95% CI, 1.04-1.08). The corresponding incidence rates per 10â¯000 person-years of follow-up in the highest vs the lowest quintile of CES-D score (geometric mean CES-D score, 19 vs 1) were 36.3 vs 29.0 for CHD events, 28.0 vs 24.7 for stroke events, and 62.8 vs 53.5 for CVD events. Among 401â¯219 participants from the UK Biobank (55% were women, mean age at baseline, 56 years [SD, 8 years]), 4607 CHD and 3253 stroke events were recorded (median follow-up, 8.1 years). The HR per 1-SD higher depression score for CHD was 1.11 (95% CI, 1.08-1.14); stroke, 1.10 (95% CI, 1.06-1.14); and CVD, 1.10 (95% CI, 1.08-1.13). The corresponding incidence rates per 10â¯000 person-years of follow-up among individuals with PHQ-2 scores of 4 or higher vs 0 were 20.9 vs 14.2 for CHD events, 15.3 vs 10.2 for stroke events, and 36.2 vs 24.5 for CVD events. The magnitude and statistical significance of the HRs were not materially changed after adjustment for additional risk factors. Conclusions and Relevance: In a pooled analysis of 563â¯255 participants in 22 cohorts, baseline depressive symptoms were associated with CVD incidence, including at symptom levels lower than the threshold indicative of a depressive disorder. However, the magnitude of associations was modest.
Asunto(s)
Enfermedades Cardiovasculares/psicología , Depresión/complicaciones , Anciano , Enfermedades Cardiovasculares/epidemiología , Estudios de Cohortes , Enfermedad Coronaria/epidemiología , Enfermedad Coronaria/psicología , Femenino , Humanos , Incidencia , Masculino , Persona de Mediana Edad , Factores de Riesgo , Accidente Cerebrovascular/epidemiología , Accidente Cerebrovascular/psicologíaRESUMEN
The added value of incorporating information from repeated blood pressure and cholesterol measurements to predict cardiovascular disease (CVD) risk has not been rigorously assessed. We used data on 191,445 adults from the Emerging Risk Factors Collaboration (38 cohorts from 17 countries with data encompassing 1962-2014) with more than 1 million measurements of systolic blood pressure, total cholesterol, and high-density lipoprotein cholesterol. Over a median 12 years of follow-up, 21,170 CVD events occurred. Risk prediction models using cumulative mean values of repeated measurements and summary measures from longitudinal modeling of the repeated measurements were compared with models using measurements from a single time point. Risk discrimination (C-index) and net reclassification were calculated, and changes in C-indices were meta-analyzed across studies. Compared with the single-time-point model, the cumulative means and longitudinal models increased the C-index by 0.0040 (95% confidence interval (CI): 0.0023, 0.0057) and 0.0023 (95% CI: 0.0005, 0.0042), respectively. Reclassification was also improved in both models; compared with the single-time-point model, overall net reclassification improvements were 0.0369 (95% CI: 0.0303, 0.0436) for the cumulative-means model and 0.0177 (95% CI: 0.0110, 0.0243) for the longitudinal model. In conclusion, incorporating repeated measurements of blood pressure and cholesterol into CVD risk prediction models slightly improves risk prediction.
Asunto(s)
Determinación de la Presión Sanguínea , Enfermedades Cardiovasculares/epidemiología , Colesterol/sangre , Medición de Riesgo/métodos , Adulto , Anciano , Presión Sanguínea , Femenino , Humanos , Persona de Mediana Edad , Factores de RiesgoRESUMEN
BACKGROUND: There is debate about the value of assessing levels of C-reactive protein (CRP) and other biomarkers of inflammation for the prediction of first cardiovascular events. METHODS: We analyzed data from 52 prospective studies that included 246,669 participants without a history of cardiovascular disease to investigate the value of adding CRP or fibrinogen levels to conventional risk factors for the prediction of cardiovascular risk. We calculated measures of discrimination and reclassification during follow-up and modeled the clinical implications of initiation of statin therapy after the assessment of CRP or fibrinogen. RESULTS: The addition of information on high-density lipoprotein cholesterol to a prognostic model for cardiovascular disease that included age, sex, smoking status, blood pressure, history of diabetes, and total cholesterol level increased the C-index, a measure of risk discrimination, by 0.0050. The further addition to this model of information on CRP or fibrinogen increased the C-index by 0.0039 and 0.0027, respectively (P<0.001), and yielded a net reclassification improvement of 1.52% and 0.83%, respectively, for the predicted 10-year risk categories of "low" (<10%), "intermediate" (10% to <20%), and "high" (≥20%) (P<0.02 for both comparisons). We estimated that among 100,000 adults 40 years of age or older, 15,025 persons would initially be classified as being at intermediate risk for a cardiovascular event if conventional risk factors alone were used to calculate risk. Assuming that statin therapy would be initiated in accordance with Adult Treatment Panel III guidelines (i.e., for persons with a predicted risk of ≥20% and for those with certain other risk factors, such as diabetes, irrespective of their 10-year predicted risk), additional targeted assessment of CRP or fibrinogen levels in the 13,199 remaining participants at intermediate risk could help prevent approximately 30 additional cardiovascular events over the course of 10 years. CONCLUSIONS: In a study of people without known cardiovascular disease, we estimated that under current treatment guidelines, assessment of the CRP or fibrinogen level in people at intermediate risk for a cardiovascular event could help prevent one additional event over a period of 10 years for every 400 to 500 people screened. (Funded by the British Heart Foundation and others.).
Asunto(s)
Proteína C-Reactiva/metabolismo , Enfermedades Cardiovasculares/prevención & control , Fibrinógeno/metabolismo , Adulto , Biomarcadores/sangre , Enfermedades Cardiovasculares/sangre , Estudios de Cohortes , Femenino , Humanos , Lípidos/sangre , Masculino , Tamizaje Masivo , Persona de Mediana Edad , Guías de Práctica Clínica como Asunto , Pronóstico , Modelos de Riesgos Proporcionales , Factores de RiesgoRESUMEN
During recent decades, Bangladesh has experienced a rapid epidemiological transition from communicable to non-communicable diseases. Coronary heart disease (CHD), with myocardial infarction (MI) as its main manifestation, is a major cause of death in the country. However, there is limited reliable evidence about its determinants in this population. The Bangladesh Risk of Acute Vascular Events (BRAVE) study is an epidemiological bioresource established to examine environmental, genetic, lifestyle and biochemical determinants of CHD among the Bangladeshi population. By early 2015, the ongoing BRAVE study had recruited over 5000 confirmed first-ever MI cases, and over 5000 controls "frequency-matched" by age and sex. For each participant, information has been recorded on demographic factors, lifestyle, socioeconomic, clinical, and anthropometric characteristics. A 12-lead electrocardiogram has been recorded. Biological samples have been collected and stored, including extracted DNA, plasma, serum and whole blood. Additionally, for the 3000 cases and 3000 controls initially recruited, genotyping has been done using the CardioMetabochip+ and the Exome+ arrays. The mean age (standard deviation) of MI cases is 53 (10) years, with 88 % of cases being male and 46 % aged 50 years or younger. The median interval between reported onset of symptoms and hospital admission is 5 h. Initial analyses indicate that Bangladeshis are genetically distinct from major non-South Asian ethnicities, as well as distinct from other South Asian ethnicities. The BRAVE study is well-placed to serve as a powerful resource to investigate current and future hypotheses relating to environmental, biochemical and genetic causes of CHD in an important but under-studied South Asian population.
Asunto(s)
Pueblo Asiatico/genética , Enfermedad Coronaria/genética , Adulto , Anciano , Bangladesh , Estudios de Casos y Controles , Enfermedad de la Arteria Coronaria/etnología , Enfermedad de la Arteria Coronaria/genética , Enfermedad Coronaria/etnología , Femenino , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Infarto del Miocardio/epidemiología , Riesgo , Factores de RiesgoRESUMEN
IMPORTANCE: The prevalence of cardiometabolic multimorbidity is increasing. OBJECTIVE: To estimate reductions in life expectancy associated with cardiometabolic multimorbidity. DESIGN, SETTING, AND PARTICIPANTS: Age- and sex-adjusted mortality rates and hazard ratios (HRs) were calculated using individual participant data from the Emerging Risk Factors Collaboration (689,300 participants; 91 cohorts; years of baseline surveys: 1960-2007; latest mortality follow-up: April 2013; 128,843 deaths). The HRs from the Emerging Risk Factors Collaboration were compared with those from the UK Biobank (499,808 participants; years of baseline surveys: 2006-2010; latest mortality follow-up: November 2013; 7995 deaths). Cumulative survival was estimated by applying calculated age-specific HRs for mortality to contemporary US age-specific death rates. EXPOSURES: A history of 2 or more of the following: diabetes mellitus, stroke, myocardial infarction (MI). MAIN OUTCOMES AND MEASURES: All-cause mortality and estimated reductions in life expectancy. RESULTS: In participants in the Emerging Risk Factors Collaboration without a history of diabetes, stroke, or MI at baseline (reference group), the all-cause mortality rate adjusted to the age of 60 years was 6.8 per 1000 person-years. Mortality rates per 1000 person-years were 15.6 in participants with a history of diabetes, 16.1 in those with stroke, 16.8 in those with MI, 32.0 in those with both diabetes and MI, 32.5 in those with both diabetes and stroke, 32.8 in those with both stroke and MI, and 59.5 in those with diabetes, stroke, and MI. Compared with the reference group, the HRs for all-cause mortality were 1.9 (95% CI, 1.8-2.0) in participants with a history of diabetes, 2.1 (95% CI, 2.0-2.2) in those with stroke, 2.0 (95% CI, 1.9-2.2) in those with MI, 3.7 (95% CI, 3.3-4.1) in those with both diabetes and MI, 3.8 (95% CI, 3.5-4.2) in those with both diabetes and stroke, 3.5 (95% CI, 3.1-4.0) in those with both stroke and MI, and 6.9 (95% CI, 5.7-8.3) in those with diabetes, stroke, and MI. The HRs from the Emerging Risk Factors Collaboration were similar to those from the more recently recruited UK Biobank. The HRs were little changed after further adjustment for markers of established intermediate pathways (eg, levels of lipids and blood pressure) and lifestyle factors (eg, smoking, diet). At the age of 60 years, a history of any 2 of these conditions was associated with 12 years of reduced life expectancy and a history of all 3 of these conditions was associated with 15 years of reduced life expectancy. CONCLUSIONS AND RELEVANCE: Mortality associated with a history of diabetes, stroke, or MI was similar for each condition. Because any combination of these conditions was associated with multiplicative mortality risk, life expectancy was substantially lower in people with multimorbidity.
Asunto(s)
Diabetes Mellitus , Esperanza de Vida , Mortalidad , Infarto del Miocardio , Accidente Cerebrovascular , Adulto , Anciano , Comorbilidad , Diabetes Mellitus/epidemiología , Femenino , Humanos , Masculino , Persona de Mediana Edad , Infarto del Miocardio/epidemiología , Factores de Riesgo , Accidente Cerebrovascular/epidemiologíaRESUMEN
Individual participant time-to-event data from multiple prospective epidemiologic studies enable detailed investigation into the predictive ability of risk models. Here we address the challenges in appropriately combining such information across studies. Methods are exemplified by analyses of log C-reactive protein and conventional risk factors for coronary heart disease in the Emerging Risk Factors Collaboration, a collation of individual data from multiple prospective studies with an average follow-up duration of 9.8 years (dates varied). We derive risk prediction models using Cox proportional hazards regression analysis stratified by study and obtain estimates of risk discrimination, Harrell's concordance index, and Royston's discrimination measure within each study; we then combine the estimates across studies using a weighted meta-analysis. Various weighting approaches are compared and lead us to recommend using the number of events in each study. We also discuss the calculation of measures of reclassification for multiple studies. We further show that comparison of differences in predictive ability across subgroups should be based only on within-study information and that combining measures of risk discrimination from case-control studies and prospective studies is problematic. The concordance index and discrimination measure gave qualitatively similar results throughout. While the concordance index was very heterogeneous between studies, principally because of differing age ranges, the increments in the concordance index from adding log C-reactive protein to conventional risk factors were more homogeneous.
Asunto(s)
Modelos Estadísticos , Medición de Riesgo , Proteína C-Reactiva/análisis , Enfermedad Coronaria/sangre , Enfermedad Coronaria/epidemiología , Interpretación Estadística de Datos , Femenino , Humanos , Masculino , Metaanálisis como Asunto , Persona de Mediana Edad , Modelos de Riesgos Proporcionales , Estudios Prospectivos , Factores de RiesgoRESUMEN
Cardiovascular diseases remain the number one cause of death globally. Cardiovascular disease risk scores are an integral tool in primary prevention, being used to identify individuals at the highest risk and guide the assignment of preventive interventions. Available risk scores differ substantially in terms of the population sample data sources used for their derivation and, consequently, in the absolute risks they assign to individuals. Differences in cardiovascular disease epidemiology between the populations contributing to the development of risk scores, and the target populations in which they are applied, can result in overestimation or underestimation of cardiovascular disease risks for individuals, and poorly informed clinical decisions. Given the wide plethora of cardiovascular disease risk scores available, identification of an appropriate risk score for a target population can be challenging. This Review provides an up-to-date overview of guideline-recommended cardiovascular disease risk scores from global, regional, and national contexts, evaluates their comparative characteristics and qualities, and provides guidance on selection of an appropriate risk score.
Asunto(s)
Enfermedades Cardiovasculares , Prevención Primaria , Humanos , Enfermedades Cardiovasculares/prevención & control , Enfermedades Cardiovasculares/epidemiología , Medición de Riesgo/métodos , Factores de Riesgo de Enfermedad CardiacaRESUMEN
Objective: To quantify the potential advantages of using 10 year risk prediction models for cardiovascular disease, in combination with risk thresholds specific to both age and sex, to identify individuals at high risk of cardiovascular disease for allocation of statin treatment. Design: Prospective open cohort study. Setting: Primary care data from the UK Clinical Practice Research Datalink GOLD, linked with hospital admissions from Hospital Episode Statistics and national mortality records from the Office for National Statistics in England, 1 January 2006 to 31 May 2019. Participants: 1 046 736 individuals (aged 40-85 years) with no cardiovascular disease, diabetes, or a history of statin treatment at baseline using data from electronic health records. Main outcome measures: 10 year risk of cardiovascular disease, calculated with version 2 of the QRISK cardiovascular disease risk algorithm (QRISK2), with two main strategies to identify individuals at high risk: in strategy A, estimated risk was a fixed cut-off value of ≥10% (ie, as per the UK National Institute for Health and Care Excellence guidelines); in strategy B, estimated risk was ≥10% or ≥90th centile of age and sex specific risk distributions. Results: Compared with strategy A, strategy B stratified 20 241 (149.8%) more women aged ≤53 years and 9832 (150.2%) more men aged ≤47 years as having a high risk of cardiovascular disease; for all other ages the strategies were the same. Assuming that treatment with statins would be initiated in those identified as high risk, differences in the estimated gain in cardiovascular disease-free life years from statin treatment for strategy B versus strategy A were 0.14 and 0.16 years for women and men aged 40 years, respectively; among individuals aged 40-49 years, the numbers needed to treat to prevent one cardiovascular disease event for strategy B versus strategy A were 39 versus 21 in women and 19 versus 15 in men, respectively. Conclusions: This study quantified the potential gains in cardiovascular disease-free life years when implementing prevention strategies based on age and sex specific risk thresholds instead of a fixed risk threshold for allocation of statin treatment. Such gains should be weighed against the costs of treating more younger people with statins for longer.
RESUMEN
Combining information from multiple GWASs for a disease and its risk factors has proven a powerful approach for development of polygenic risk scores (PRSs). This may be particularly useful for type 2 diabetes (T2D), a highly polygenic and heterogeneous disease where the additional predictive value of a PRS is unclear. Here, we use a meta-scoring approach to develop a metaPRS for T2D that incorporated genome-wide associations from both European and non-European genetic ancestries and T2D risk factors. We evaluated the performance of this metaPRS and benchmarked it against existing genome-wide PRS in 620,059 participants and 50,572 T2D cases amongst six diverse genetic ancestries from UK Biobank, INTERVAL, the All of Us Research Program, and the Singapore Multi-Ethnic Cohort. We show that our metaPRS was the most powerful PRS for predicting T2D in European population-based cohorts and had comparable performance to the top ancestry-specific PRS, highlighting its transferability. In UK Biobank, we show the metaPRS had stronger predictive power for 10-year risk than all individual risk factors apart from BMI and biomarkers of dysglycemia. The metaPRS modestly improved T2D risk stratification of QDiabetes risk scores for 10-year risk prediction, particularly when prioritising individuals for blood tests of dysglycemia. Overall, we present a highly predictive and transferrable PRS for T2D and demonstrate that the potential for PRS to incrementally improve T2D risk prediction when incorporated into UK guideline-recommended screening and risk prediction with a clinical risk score.
RESUMEN
BACKGROUND: Case-cohort studies are increasingly used to quantify the association of novel factors with disease risk. Conventional measures of predictive ability need modification for this design. We show how Harrell's C-index, Royston's D, and the category-based and continuous versions of the net reclassification index (NRI) can be adapted. METHODS: We simulated full cohort and case-cohort data, with sampling fractions ranging from 1% to 90%, using covariates from a cohort study of coronary heart disease, and two incidence rates. We then compared the accuracy and precision of the proposed risk prediction metrics. RESULTS: The C-index and D must be weighted in order to obtain unbiased results. The NRI does not need modification, provided that the relevant non-subcohort cases are excluded from the calculation. The empirical standard errors across simulations were consistent with analytical standard errors for the C-index and D but not for the NRI. Good relative efficiency of the prediction metrics was observed in our examples, provided the sampling fraction was above 40% for the C-index, 60% for D, or 30% for the NRI. Stata code is made available. CONCLUSIONS: Case-cohort designs can be used to provide unbiased estimates of the C-index, D measure and NRI.
Asunto(s)
Enfermedad Coronaria/epidemiología , Algoritmos , Estudios de Casos y Controles , Simulación por Computador , Enfermedad Coronaria/etiología , Interpretación Estadística de Datos , Humanos , Incidencia , Modelos Estadísticos , Modelos de Riesgos Proporcionales , Medición de Riesgo , Factores de RiesgoRESUMEN
Metabolic biomarker data quantified by nuclear magnetic resonance (NMR) spectroscopy in approximately 121,000 UK Biobank participants has recently been released as a community resource, comprising absolute concentrations and ratios of 249 circulating metabolites, lipids, and lipoprotein sub-fractions. Here we identify and characterise additional sources of unwanted technical variation influencing individual biomarkers in the data available to download from UK Biobank. These included sample preparation time, shipping plate well, spectrometer batch effects, drift over time within spectrometer, and outlier shipping plates. We developed a procedure for removing this unwanted technical variation, and demonstrate that it increases signal for genetic and epidemiological studies of the NMR metabolic biomarker data in UK Biobank. We subsequently developed an R package, ukbnmr, which we make available to the wider research community to enhance the utility of the UK Biobank NMR metabolic biomarker data and to facilitate rapid analysis.
Asunto(s)
Bancos de Muestras Biológicas , Imagen por Resonancia Magnética , Humanos , Imagen por Resonancia Magnética/métodos , Imagen por Resonancia Magnética/normas , Espectroscopía de Resonancia Magnética , Control de Calidad , Reino UnidoRESUMEN
In the absence of population-based screening, addition of screening for kidney cancer to lung cancer screening could provide an efficient and low-resource means to improve early detection. In this study, we used the UK Biobank cohort (n = 442 865) to determine the performance of the Yorkshire Lung Cancer Screening Trial (YLST) eligibility criteria for selecting individuals for kidney cancer screening. We measured the performance of two models widely used to determine eligibility for lung cancer screening (PLCO[m2012] and the Liverpool-Lung-Project-v2) and the performance of the combined YLST criteria. We found that the lung cancer models have discrimination (area under the receiver operating curve) between 0.60 and 0.68 for kidney cancer. In the UK, one in four cases (25%) of kidney cancer cases is expected to occur in those eligible for lung cancer screening, and one case of kidney cancer detected for every 200 people invited to lung cancer screening. These results suggest that adding kidney cancer screening to lung cancer screening would be an effective strategy to improve early detection rates of kidney cancer. However, most kidney cancers would not be picked up by this approach. This analysis does not address other important considerations about kidney cancer screening, such as overdiagnosis. PATIENT SUMMARY: It has been proposed that adding-on kidney cancer screening to lung cancer screening (both carried out by a computed tomography scan of the chest/abdomen) would be an easy and low-cost way of detecting cases of kidney cancer earlier, when these can be treated more easily. Lung cancer screening is usually targeted at people who are at a high risk (eg, older smokers); therefore, here we look at whether the same group of people are also at a high risk of kidney cancer. Our analysis shows that one in four people later diagnosed with kidney cancer are also at a high risk of lung cancer; hence, a combined screening programme could detect up to a quarter of kidney cancers.
Asunto(s)
Carcinoma de Células Renales , Neoplasias Renales , Neoplasias Pulmonares , Humanos , Bancos de Muestras Biológicas , Detección Precoz del Cáncer/métodos , Riñón , Neoplasias Renales/diagnóstico , Neoplasias Pulmonares/diagnóstico , Neoplasias Pulmonares/epidemiología , Tamizaje Masivo/métodos , Reino Unido/epidemiologíaRESUMEN
BACKGROUND: Many models developed for predicting the risk of cardiovascular disease (CVD) are adjusted for the competing risk of non-CVD mortality, which has been suggested to reduce potential overestimation of cumulative incidence in populations where the risk of competing events is high. The objective was to evaluate and illustrate the clinical impact of competing risk adjustment when deriving a CVD prediction model in a high-risk population. METHODS AND RESULTS: Individuals with established atherosclerotic CVD were included from the Utrecht Cardiovascular Cohort-Secondary Manifestations of ARTerial disease (UCC-SMART). In 8355 individuals, followed for a median of 8.2 years (IQR 4.2-12.5), two similar prediction models for the estimation of 10-year residual CVD risk were derived: with competing risk adjustment using a Fine and Gray model and without competing risk adjustment using a Cox proportional hazards model. On average, predictions were higher from the Cox model. The Cox model predictions overestimated the cumulative incidence [predicted-observed ratio 1.14 (95% CI 1.09-1.20)], which was most apparent in the highest risk quartiles and in older persons. Discrimination of both models was similar. When determining treatment eligibility on thresholds of predicted risks, more individuals would be treated based on the Cox model predictions. If, for example, individuals with a predicted risk > 20% were considered eligible for treatment, 34% of the population would be treated according to the Fine and Gray model predictions and 44% according to the Cox model predictions. INTERPRETATION: Individual predictions from the model unadjusted for competing risks were higher, reflecting the different interpretations of both models. For models aiming to accurately predict absolute risks, especially in high-risk populations, competing risk adjustment must be considered.