Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
J Comput Graph Stat ; 32(3): 950-960, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-38013849

RESUMEN

Elastic net penalization is widely used in high-dimensional prediction and variable selection settings. Auxiliary information on the variables, for example, groups of variables, is often available. Group-adaptive elastic net penalization exploits this information to potentially improve performance by estimating group penalties, thereby penalizing important groups of variables less than other groups. Estimating these group penalties is, however, hard due to the high dimension of the data. Existing methods are computationally expensive or not generic in the type of response. Here we present a fast method for estimation of group-adaptive elastic net penalties for generalized linear models. We first derive a low-dimensional representation of the Taylor approximation of the marginal likelihood for group-adaptive ridge penalties, to efficiently estimate these penalties. Then we show by using asymptotic normality of the linear predictors that this marginal likelihood approximates that of elastic net models. The ridge group penalties are then transformed to elastic net group penalties by matching the ridge prior variance to the elastic net prior variance as function of the group penalties. The method allows for overlapping groups and unpenalized variables, and is easily extended to other penalties. For a model-based simulation study and two cancer genomics applications we demonstrate a substantially decreased computation time and improved or matching performance compared to other methods. Supplementary materials for this article are available online.

2.
Nat Commun ; 14(1): 5635, 2023 09 13.
Artículo en Inglés | MEDLINE | ID: mdl-37704597

RESUMEN

Diagnosis of dementia with Lewy bodies (DLB) is challenging and specific biofluid biomarkers are highly needed. We employed proximity extension-based assays to measure 665 proteins in the cerebrospinal fluid (CSF) from patients with DLB (n = 109), Alzheimer´s disease (AD, n = 235) and cognitively unimpaired controls (n = 190). We identified over 50 CSF proteins dysregulated in DLB, enriched in myelination processes among others. The dopamine biosynthesis enzyme DDC was the strongest dysregulated protein, and could efficiently discriminate DLB from controls and AD (AUC:0.91 and 0.81 respectively). Classification modeling unveiled a 7-CSF biomarker panel that better discriminate DLB from AD (AUC:0.93). A custom multiplex panel for six of these markers (DDC, CRH, MMP-3, ABL1, MMP-10, THOP1) was developed and validated in independent cohorts, including an AD and DLB autopsy cohort. This DLB CSF proteome study identifies DLB-specific protein changes and translates these findings to a practicable biomarker panel that accurately identifies DLB patients, providing promising diagnostic and clinical trial testing opportunities.


Asunto(s)
Enfermedad de Alzheimer , Enfermedad por Cuerpos de Lewy , Humanos , Enfermedad de Alzheimer/diagnóstico , Enfermedad por Cuerpos de Lewy/diagnóstico , Proteoma , Autopsia , Biomarcadores
3.
BMC Bioinformatics ; 24(1): 172, 2023 Apr 26.
Artículo en Inglés | MEDLINE | ID: mdl-37101151

RESUMEN

BACKGROUND: High-dimensional prediction considers data with more variables than samples. Generic research goals are to find the best predictor or to select variables. Results may be improved by exploiting prior information in the form of co-data, providing complementary data not on the samples, but on the variables. We consider adaptive ridge penalised generalised linear and Cox models, in which the variable-specific ridge penalties are adapted to the co-data to give a priori more weight to more important variables. The R-package ecpc originally accommodated various and possibly multiple co-data sources, including categorical co-data, i.e. groups of variables, and continuous co-data. Continuous co-data, however, were handled by adaptive discretisation, potentially inefficiently modelling and losing information. As continuous co-data such as external p values or correlations often arise in practice, more generic co-data models are needed. RESULTS: Here, we present an extension to the method and software for generic co-data models, particularly for continuous co-data. At the basis lies a classical linear regression model, regressing prior variance weights on the co-data. Co-data variables are then estimated with empirical Bayes moment estimation. After placing the estimation procedure in the classical regression framework, extension to generalised additive and shape constrained co-data models is straightforward. Besides, we show how ridge penalties may be transformed to elastic net penalties. In simulation studies we first compare various co-data models for continuous co-data from the extension to the original method. Secondly, we compare variable selection performance to other variable selection methods. The extension is faster than the original method and shows improved prediction and variable selection performance for non-linear co-data relations. Moreover, we demonstrate use of the package in several genomics examples throughout the paper. CONCLUSIONS: The R-package ecpc accommodates linear, generalised additive and shape constrained additive co-data models for the purpose of improved high-dimensional prediction and variable selection. The extended version of the package as presented here (version number 3.1.1 and higher) is available on ( https://cran.r-project.org/web/packages/ecpc/ ).


Asunto(s)
Genómica , Programas Informáticos , Teorema de Bayes , Simulación por Computador , Modelos Lineales
4.
Nat Aging ; 2(11): 1040-1053, 2022 11.
Artículo en Inglés | MEDLINE | ID: mdl-37118088

RESUMEN

Development of disease-modifying therapies against Alzheimer's disease (AD) requires biomarkers reflecting the diverse pathological pathways specific for AD. We measured 665 proteins in 797 cerebrospinal fluid (CSF) samples from patients with mild cognitive impairment with abnormal amyloid (MCI(Aß+): n = 50), AD-dementia (n = 230), non-AD dementias (n = 322) and cognitively unimpaired controls (n = 195) using proximity ligation-based immunoassays. Here we identified >100 CSF proteins dysregulated in MCI(Aß+) or AD compared to controls or non-AD dementias. Proteins dysregulated in MCI(Aß+) were primarily related to protein catabolism, energy metabolism and oxidative stress, whereas those specifically dysregulated in AD dementia were related to cell remodeling, vascular function and immune system. Classification modeling unveiled biomarker panels discriminating clinical groups with high accuracies (area under the curve (AUC): 0.85-0.99), which were translated into custom multiplex assays and validated in external and independent cohorts (AUC: 0.8-0.99). Overall, this study provides novel pathophysiological leads delineating the multifactorial nature of AD and potential biomarker tools for diagnostic settings or clinical trials.


Asunto(s)
Enfermedad de Alzheimer , Disfunción Cognitiva , Humanos , Enfermedad de Alzheimer/diagnóstico , Proteoma , Péptidos beta-Amiloides/líquido cefalorraquídeo , Disfunción Cognitiva/diagnóstico , Biomarcadores/líquido cefalorraquídeo
5.
Stat Med ; 40(26): 5910-5925, 2021 11 20.
Artículo en Inglés | MEDLINE | ID: mdl-34438466

RESUMEN

Clinical research often focuses on complex traits in which many variables play a role in mechanisms driving, or curing, diseases. Clinical prediction is hard when data is high-dimensional, but additional information, like domain knowledge and previously published studies, may be helpful to improve predictions. Such complementary data, or co-data, provide information on the covariates, such as genomic location or P-values from external studies. We use multiple and various co-data to define possibly overlapping or hierarchically structured groups of covariates. These are then used to estimate adaptive multi-group ridge penalties for generalized linear and Cox models. Available group adaptive methods primarily target for settings with few groups, and therefore likely overfit for non-informative, correlated or many groups, and do not account for known structure on group level. To handle these issues, our method combines empirical Bayes estimation of the hyperparameters with an extra level of flexible shrinkage. This renders a uniquely flexible framework as any type of shrinkage can be used on the group level. We describe various types of co-data and propose suitable forms of hypershrinkage. The method is very versatile, as it allows for integration and weighting of multiple co-data sets, inclusion of unpenalized covariates and posterior variable selection. For three cancer genomics applications we demonstrate improvements compared to other models in terms of performance, variable selection stability and validation.


Asunto(s)
Genómica , Teorema de Bayes , Humanos , Modelos de Riesgos Proporcionales
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...