Your browser doesn't support javascript.
loading
Efficient test for nonlinear dependence of two continuous variables.
Wang, Yi; Li, Yi; Cao, Hongbao; Xiong, Momiao; Shugart, Yin Yao; Jin, Li.
Afiliação
  • Wang Y; Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, 200433, China. godspeed.wang@gmail.com.
  • Li Y; Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, 200433, China. liyistat@gmail.com.
  • Cao H; Unit on Statistical Genomics, Division of Intramural Division Programs, National Institute of Mental Health, National Institutes of Health, Bethesda, MD, USA. hongbao.cao@nih.gov.
  • Xiong M; Human Genetics Center, School of Public Health, University of Texas Houston Health Sciences Center, Houston, TX, USA. momiao.xiong@gmail.com.
  • Shugart YY; Unit on Statistical Genomics, Division of Intramural Division Programs, National Institute of Mental Health, National Institutes of Health, Bethesda, MD, USA. yin.yao@nih.gov.
  • Jin L; Division of Intramural Research Program, National Institute of Mental Health, National Institute of Health, Porter Bldg, Room 3A100, Bethesda, MD, 20892, USA. yin.yao@nih.gov.
BMC Bioinformatics ; 16: 260, 2015 Aug 19.
Article em En | MEDLINE | ID: mdl-26283601
ABSTRACT

BACKGROUND:

Testing dependence/correlation of two variables is one of the fundamental tasks in statistics. In this work, we proposed a new way of testing nonlinear dependence between two continuous variables (X and Y).

RESULTS:

We addressed this research question by using CANOVA (continuous analysis of variance, software available at https//sourceforge.net/projects/canova/). In the CANOVA framework, we first defined a neighborhood for each data point related to its X value, and then calculated the variance of the Y value within the neighborhood. Finally, we performed permutations to evaluate the significance of the observed values within the neighborhood variance. To evaluate the strength of CANOVA compared to six other methods, we performed extensive simulations to explore the relationship between methods and compared the false positive rates and statistical power using both simulated and real datasets (kidney cancer RNA-seq dataset).

CONCLUSIONS:

We concluded that CANOVA is an efficient method for testing nonlinear correlation with several advantages in real data applications.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software Limite: Humans Idioma: En Ano de publicação: 2015 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software Limite: Humans Idioma: En Ano de publicação: 2015 Tipo de documento: Article