Your browser doesn't support javascript.
loading
knnAUC: an open-source R package for detecting nonlinear dependence between one continuous variable and one binary variable.
Li, Yi; Liu, Xiaoyu; Ma, Yanyun; Wang, Yi; Zhou, Weichen; Hao, Meng; Yuan, Zhenghong; Liu, Jie; Xiong, Momiao; Shugart, Yin Yao; Wang, Jiucun; Jin, Li.
Afiliação
  • Li Y; Ministry of Education Key Laboratory of Contemporary Anthropology, Department of Anthropology and Human Genetics, School of Life Sciences, Fudan University, Shanghai, China.
  • Liu X; Six Industrial Research Institute, Fudan University, Shanghai, China.
  • Ma Y; Human Phenome Institute, Fudan University, Shanghai, China.
  • Wang Y; Ministry of Education Key Laboratory of Contemporary Anthropology, Department of Anthropology and Human Genetics, School of Life Sciences, Fudan University, Shanghai, China.
  • Zhou W; Human Phenome Institute, Fudan University, Shanghai, China.
  • Hao M; Ministry of Education Key Laboratory of Contemporary Anthropology, Department of Anthropology and Human Genetics, School of Life Sciences, Fudan University, Shanghai, China.
  • Yuan Z; Six Industrial Research Institute, Fudan University, Shanghai, China.
  • Liu J; Human Phenome Institute, Fudan University, Shanghai, China.
  • Xiong M; Ministry of Education Key Laboratory of Contemporary Anthropology, Department of Anthropology and Human Genetics, School of Life Sciences, Fudan University, Shanghai, China.
  • Shugart YY; Human Phenome Institute, Fudan University, Shanghai, China.
  • Wang J; State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China.
  • Jin L; Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
BMC Bioinformatics ; 19(1): 448, 2018 Nov 22.
Article em En | MEDLINE | ID: mdl-30466390
BACKGROUND: Testing the dependence of two variables is one of the fundamental tasks in statistics. In this work, we developed an open-source R package (knnAUC) for detecting nonlinear dependence between one continuous variable X and one binary dependent variables Y (0 or 1). RESULTS: We addressed this problem by using knnAUC (k-nearest neighbors AUC test, the R package is available at https://sourceforge.net/projects/knnauc/ ). In the knnAUC software framework, we first resampled a dataset to get the training and testing dataset according to the sample ratio (from 0 to 1), and then constructed a k-nearest neighbors algorithm classifier to get the yhat estimator (the probability of y = 1) of testy (the true label of testing dataset). Finally, we calculated the AUC (area under the curve of receiver operating characteristic) estimator and tested whether the AUC estimator is greater than 0.5. To evaluate the advantages of knnAUC compared to seven other popular methods, we performed extensive simulations to explore the relationships between eight different methods and compared the false positive rates and statistical power using both simulated and real datasets (Chronic hepatitis B datasets and kidney cancer RNA-seq datasets). CONCLUSIONS: We concluded that knnAUC is an efficient R package to test non-linear dependence between one continuous variable and one binary dependent variable especially in computational biology area.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Análise de Sequência de RNA Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Análise de Sequência de RNA Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2018 Tipo de documento: Article