Leveraging heterogeneity across multiple datasets increases cell-mixture deconvolution accuracy and reduces biological and technical biases.

Vallania, Francesco; Tam, Andrew; Lofgren, Shane; Schaffert, Steven; Azad, Tej D; Bongen, Erika; Haynes, Winston; Alsup, Meia; Alonso, Michael; Davis, Mark; Engleman, Edgar; Khatri, Purvesh

Vallania, Francesco; Tam, Andrew; Lofgren, Shane; Schaffert, Steven; Azad, Tej D; Bongen, Erika; Haynes, Winston; Alsup, Meia; Alonso, Michael; Davis, Mark; Engleman, Edgar; Khatri, Purvesh.

Afiliação

Vallania F; Institute for Immunity, Transplantation and Infection, Stanford University, Stanford, 94305, CA, USA.
Tam A; Stanford Center for Biomedical Informatics Research, Department of Medicine, Stanford University, Stanford, 94305, CA, USA.
Lofgren S; Institute for Immunity, Transplantation and Infection, Stanford University, Stanford, 94305, CA, USA.
Schaffert S; Stanford Institutes of Medicine Summer Research Program, Stanford University, Stanford, 94305, CA, USA.
Azad TD; Institute for Immunity, Transplantation and Infection, Stanford University, Stanford, 94305, CA, USA.
Bongen E; Stanford Center for Biomedical Informatics Research, Department of Medicine, Stanford University, Stanford, 94305, CA, USA.
Haynes W; Institute for Immunity, Transplantation and Infection, Stanford University, Stanford, 94305, CA, USA.
Alsup M; Stanford Center for Biomedical Informatics Research, Department of Medicine, Stanford University, Stanford, 94305, CA, USA.
Alonso M; Institute for Immunity, Transplantation and Infection, Stanford University, Stanford, 94305, CA, USA.
Davis M; Institute for Immunity, Transplantation and Infection, Stanford University, Stanford, 94305, CA, USA.
Engleman E; Stanford Center for Biomedical Informatics Research, Department of Medicine, Stanford University, Stanford, 94305, CA, USA.
Khatri P; Institute for Immunity, Transplantation and Infection, Stanford University, Stanford, 94305, CA, USA.

Nat Commun ; 9(1): 4735, 2018 11 09.

Article em En | MEDLINE | ID: mdl-30413720

RESUMO

In silico quantification of cell proportions from mixed-cell transcriptomics data (deconvolution) requires a reference expression matrix, called basis matrix. We hypothesize that matrices created using only healthy samples from a single microarray platform would introduce biological and technical biases in deconvolution. We show presence of such biases in two existing matrices, IRIS and LM22, irrespective of deconvolution method. Here, we present immunoStates, a basis matrix built using 6160 samples with different disease states across 42 microarray platforms. We find that immunoStates significantly reduces biological and technical biases. Importantly, we find that different methods have virtually no or minimal effect once the basis matrix is chosen. We further show that cellular proportion estimates using immunoStates are consistently more correlated with measured proportions than IRIS and LM22, across all methods. Our results demonstrate the need and importance of incorporating biological and technical heterogeneity in a basis matrix for achieving consistently high accuracy.

Assuntos

Bases de Dados como Assunto; Leucócitos Mononucleares/metabolismo; Doença; Humanos; Análise em Microsséries; Curva ROC

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Leucócitos Mononucleares / Bases de Dados como Assunto Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google