Your browser doesn't support javascript.
loading
Generalized t-statistic for two-group classification.
Komori, Osamu; Eguchi, Shinto; Copas, John B.
Afiliación
  • Komori O; Institute of Statistical Mathematics, Midori-cho, Tachikawa, Tokyo 190-8562, Japan.
  • Eguchi S; Institute of Statistical Mathematics, Midori-cho, Tachikawa, Tokyo 190-8562, Japan.
  • Copas JB; Department of Statistics, University of Warwick, Coventry CV4 7AL, UK.
Biometrics ; 71(2): 404-16, 2015 Jun.
Article en En | MEDLINE | ID: mdl-25359078
ABSTRACT
In the classic discriminant model of two multivariate normal distributions with equal variance matrices, the linear discriminant function is optimal both in terms of the log likelihood ratio and in terms of maximizing the standardized difference (the t-statistic) between the means of the two distributions. In a typical case-control study, normality may be sensible for the control sample but heterogeneity and uncertainty in diagnosis may suggest that a more flexible model is needed for the cases. We generalize the t-statistic approach by finding the linear function which maximizes a standardized difference but with data from one of the groups (the cases) filtered by a possibly nonlinear function U. We study conditions for consistency of the method and find the function U which is optimal in the sense of asymptotic efficiency. Optimality may also extend to other measures of discriminatory efficiency such as the area under the receiver operating characteristic curve. The optimal function U depends on a scalar probability density function which can be estimated non-parametrically using a standard numerical algorithm. A lasso-like version for variable selection is implemented by adding L1-regularization to the generalized t-statistic. Two microarray data sets in the study of asthma and various cancers are used as motivating examples.
Asunto(s)
Palabras clave

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Análisis Discriminante Tipo de estudio: Observational_studies / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Biometrics Año: 2015 Tipo del documento: Article

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Análisis Discriminante Tipo de estudio: Observational_studies / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Biometrics Año: 2015 Tipo del documento: Article