RESUMO
Personalized medicine, a paradigm of medicine tailored to a patient's characteristics, is an increasingly attractive field in health care. An important goal of personalized medicine is to identify a subgroup of patients, based on baseline covariates, that benefits more from the targeted treatment than other comparative treatments. Most of the current subgroup identification methods only focus on obtaining a subgroup with an enhanced treatment effect without paying attention to subgroup size. Yet, a clinically meaningful subgroup learning approach should identify the maximum number of patients who can benefit from the better treatment. In this article, we present an optimal subgroup selection rule (SSR) that maximizes the number of selected patients, and in the meantime, achieves the pre-specified clinically meaningful mean outcome, such as the average treatment effect. We derive two equivalent theoretical forms of the optimal SSR based on the contrast function that describes the treatment-covariates interaction in the outcome. We further propose a constrained policy tree search algorithm (CAPITAL) to find the optimal SSR within the interpretable decision tree class. The proposed method is flexible to handle multiple constraints that penalize the inclusion of patients with negative treatment effects, and to address time to event data using the restricted mean survival time as the clinically interesting mean outcome. Extensive simulations, comparison studies, and real data applications are conducted to demonstrate the validity and utility of our method.
Assuntos
Algoritmos , Medicina de Precisão , Humanos , Políticas , Medicina de Precisão/métodos , Projetos de PesquisaRESUMO
Prediction performance of a risk scoring system needs to be carefully assessed before its adoption in clinical practice. Clinical preventive care often uses risk scores to screen asymptomatic population. The primary clinical interest is to predict the risk of having an event by a prespecified future time t0 . Accuracy measures such as positive predictive values have been recommended for evaluating the predictive performance. However, for commonly used continuous or ordinal risk score systems, these measures require a subjective cutoff threshold value that dichotomizes the risk scores. The need for a cutoff value created barriers for practitioners and researchers. In this paper, we propose a threshold-free summary index of positive predictive values that accommodates time-dependent event status and competing risks. We develop a nonparametric estimator and provide an inference procedure for comparing this summary measure between 2 risk scores for censored time to event data. We conduct a simulation study to examine the finite-sample performance of the proposed estimation and inference procedures. Lastly, we illustrate the use of this measure on a real data example, comparing 2 risk score systems for predicting heart failure in childhood cancer survivors.
Assuntos
Medição de Risco/métodos , Estatísticas não Paramétricas , Biometria , Sobreviventes de Câncer , Simulação por Computador , Insuficiência Cardíaca/complicações , Humanos , Valor Preditivo dos Testes , Fatores de Risco , Fatores de TempoRESUMO
The geographic ranges in which species live is a function of many factors underlying ecological and evolutionary contingencies. Observing the geographic range of an individual species provides valuable information about these historical contingencies for a lineage, determining the distribution of many distantly related species in tandem provides information about large-scale constraints on evolutionary and ecological processes generally. We present a linear regression method that allows for the discrimination of various hypothetical biogeographical models for determining which landscape distributional pattern best matches data from the fossil record. The linear regression models used in the discrimination rely on geodesic distances between sampling sites (typically geologic formations) as the independent variable and three possible dependent variables: Dice/Sorensen similarity; Euclidean distance; and phylogenetic community dissimilarity. Both the similarity and distance measures are useful for full-community analyses without evolutionary information, whereas the phylogenetic community dissimilarity requires phylogenetic data. Importantly, the discrimination method uses linear regression residual error to provide relative measures of support for each biogeographical model tested, not absolute answers or p-values. When applied to a recently published dataset of Campanian pollen, we find evidence that supports two plant communities separated by a transitional zone of unknown size. A similar case study of ceratopsid dinosaurs using phylogenetic community dissimilarity provided no evidence of a biogeographical pattern, but this case study suffers from a lack of data to accurately discriminate and/or too much temporal mixing. Future research aiming to reconstruct the distribution of organisms across a landscape has a statistical-based method for determining what biogeographic distributional model best matches the available data.
Assuntos
Evolução Biológica , Dinossauros , Animais , Filogenia , Fósseis , Modelos EstatísticosRESUMO
The sustainable, undirected, and selective catalytic hydroxylation of arenes remains an ongoing research challenge because of the relative inertness of aryl carbon-hydrogen bonds, the higher reactivity of the phenolic products leading to over-oxidized by-products, and the frequently insufficient regioselectivity. We report that iron coordinated by a bioinspired l-cystinederived ligand can catalyze undirected arene carbon-hydrogen hydroxylation with hydrogen peroxide as the terminal oxidant. The reaction is distinguished by its broad substrate scope, excellent selectivity, and good yields, and it showcases compatibility with oxidation-sensitive functional groups, such as alcohols, polyphenols, aldehydes, and even a boronic acid. This method is well suited for the synthesis of polyphenols through multiple carbon-hydrogen hydroxylations, as well as the late-stage functionalization of natural products and drug molecules.