Your browser doesn't support javascript.
loading
Variable Selection in Bayesian Multiple Instance Regression using Shotgun Stochastic Search.
Park, Seongoh; Kim, Joungyoun; Wang, Xinlei; Lim, Johan.
Afiliação
  • Park S; School of Mathematics, Statistics and Data Science, Sungshin Women's University, Seoul, Korea.
  • Kim J; Data Science Center, Sungshin Women's University, Seoul, Korea.
  • Wang X; Department of Artificial Intelligence, University of Seoul, Seoul, Korea.
  • Lim J; Center for Data Science Research and Education, College of Science, University of Texas at Arlington, Arlington, TX, USA.
Article em En | MEDLINE | ID: mdl-38646418
ABSTRACT
In multiple instance learning (MIL), a bag represents a sample that has a set of instances, each of which is described by a vector of explanatory variables, but the entire bag only has one label/response. Though many methods for MIL have been developed to date, few have paid attention to interpretability of models and results. The proposed Bayesian regression model stands on two levels of hierarchy, which transparently show how explanatory variables explain and instances contribute to bag responses. Moreover, two selection problems are simultaneously addressed; the instance selection to find out the instances in each bag responsible for the bag response, and the variable selection to search for the important covariates. To explore a joint discrete space of indicator variables created for selection of both explanatory variables and instances, the shotgun stochastic search algorithm is modified to fit in the MIL context. Also, the proposed model offers a natural and rigorous way to quantify uncertainty in coefficient estimation and outcome prediction, which many modern MIL applications call for. The simulation study shows the proposed regression model can select variables and instances with high performance (AUC greater than 0.86), thus predicting responses well. The proposed method is applied to the musk data for prediction of binding strengths (labels) between molecules (bags) with different conformations (instances) and target receptors. It outperforms all existing methods, and can identify variables relevant in modeling responses.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article