Your browser doesn't support javascript.
loading
A nonlinear correlation measure with applications to gene expression data.
Tripathi, Yogesh M; Chatla, Suneel Babu; Chang, Yuan-Chin I; Huang, Li-Shan; Shieh, Grace S.
Afiliação
  • Tripathi YM; Institute of Statistical Science, Academia Sinica, Taipei, Taiwan.
  • Chatla SB; Indian Institute of Technology Patna, Bihta, India.
  • Chang YI; Institute of Statistics, National Tsing Hua University, Hsinchu, Taiwan.
  • Huang LS; Department of Mathematical Sciences, University of Texas at El Paso, El Paso, Texas, United States of America.
  • Shieh GS; Institute of Statistical Science, Academia Sinica, Taipei, Taiwan.
PLoS One ; 17(6): e0270270, 2022.
Article em En | MEDLINE | ID: mdl-35727808
ABSTRACT
Nonlinear correlation exists in many types of biomedical data. Several types of pairwise gene expression in humans and other organisms show nonlinear correlation across time, e.g., genes involved in human T helper (Th17) cells differentiation, which motivated this study. The proposed procedure, called Kernelized correlation (Kc), first transforms nonlinear data on the plane via a function (kernel, usually nonlinear) to a high-dimensional (Hilbert) space. Next, we plug the transformed data into a classical correlation coefficient, e.g., Pearson's correlation coefficient (r), to yield a nonlinear correlation measure. The algorithm to compute Kc is developed and the R code is provided online. In three simulated nonlinear cases, when noise in data is moderate, Kc with the RBF kernel (Kc-RBF) outperforms Pearson's r and the well-known distance correlation (dCor). However, when noise in data is low, Pearson's r and dCor perform slightly better than (equivalently to) Kc-RBF in Case 1 and 3 (in Case 2); Kendall's tau performs worse than the aforementioned measures in all cases. In Application 1 to discover genes involved in the early Th17 cell differentiation, Kc is shown to detect the nonlinear correlations of four genes with IL17A (a known marker gene), while dCor detects nonlinear correlations of two pairs, and DESeq fails in all these pairs. Next, Kc outperforms Pearson's and dCor, in estimating the nonlinear correlation of negatively correlated gene pairs in yeast cell cycle regulation. In conclusion, Kc is a simple and competent procedure to measure pairwise nonlinear correlations.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Saccharomyces cerevisiae / Algoritmos Limite: Humans Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Saccharomyces cerevisiae / Algoritmos Limite: Humans Idioma: En Ano de publicação: 2022 Tipo de documento: Article