Your browser doesn't support javascript.
loading
MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting.
Zou, Meng; Duren, Zhana; Yuan, Qiuyue; Li, Henry; Hutchins, Andrew Paul; Wong, Wing Hung; Wang, Yong.
Affiliation
  • Zou M; Department of Mathematics, Huazhong University of Science and Technology, Beijing 100190, China.
  • Duren Z; Department of Genetics and Biochemistry, Clemson University, Beijing 100190, China.
  • Yuan Q; Academy of Mathematics and Systems Science, CAS, Beijing 100190, China.
  • Li H; Department of Health Research & Policy, Bio-X Program Stanford University, Beijing 100190, China.
  • Hutchins AP; Southern University of Science and Technology, Beijing 100190, China.
  • Wong WH; Department of Statistics, Department of Biomedical Data Science, Bio-X Program Stanford University, Beijing 100190, China.
  • Wang Y; CEMS, NCMIS, MDIS, Academy of Mathematics and Systems Science, Center for Excellence in Animal Evolution and Genetics, University of Chinese Academy of Sciences, CAS, Beijing 100190, China.
Brief Bioinform ; 22(6)2021 11 05.
Article in En | MEDLINE | ID: mdl-34180954
ABSTRACT
Multi-omics data allow us to select a small set of informative markers for the discrimination of specific cell types and study of cellular heterogeneity. However, it is often challenging to choose an optimal marker panel from the high-dimensional molecular profiles for a large amount of cell types. Here, we propose a method called Mixed Integer programming Model to Identify Cell type-specific marker panel (MIMIC). MIMIC maintains the hierarchical topology among different cell types and simultaneously maximizes the specificity of a fixed number of selected markers. MIMIC was benchmarked on the mouse ENCODE RNA-seq dataset, with 29 diverse tissues, for 43 surface markers (SMs) and 1345 transcription factors (TFs). MIMIC could select biologically meaningful markers and is robust for different accuracy criteria. It shows advantages over the standard single gene-based approaches and widely used dimensional reduction methods, such as multidimensional scaling and t-SNE, both in accuracy and in biological interpretation. Furthermore, the combination of SMs and TFs achieves better specificity than SMs or TFs alone. Applying MIMIC to a large collection of 641 RNA-seq samples covering 231 cell types identifies a panel of TFs and SMs that reveal the modularity of cell type association networks. Finally, the scalability of MIMIC is demonstrated by selecting enhancer markers from mouse ENCODE data. MIMIC is freely available at https//github.com/MengZou1/MIMIC.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Organ Specificity / Software / Biomarkers / Computational Biology / Gene Expression Profiling / Flow Cytometry Type of study: Prognostic_studies Limits: Humans Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2021 Type: Article Affiliation country: China

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Organ Specificity / Software / Biomarkers / Computational Biology / Gene Expression Profiling / Flow Cytometry Type of study: Prognostic_studies Limits: Humans Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2021 Type: Article Affiliation country: China