Hybrid tensor decomposition in neural network compression.

Wu, Bijiao; Wang, Dingheng; Zhao, Guangshe; Deng, Lei; Li, Guoqi

Wu, Bijiao; Wang, Dingheng; Zhao, Guangshe; Deng, Lei; Li, Guoqi.

Affiliation

Wu B; School of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China. Electronic address: wbj123@stu.xjtu.edu.cn.
Wang D; School of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China. Electronic address: wangdai11@stu.xjtu.edu.cn.
Zhao G; School of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China. Electronic address: zhaogs@mail.xjtu.edu.cn.
Deng L; University of California, Santa Barbara,, CA 93106, USA. Electronic address: leideng@ucsb.edu.
Li G; Department of Precision Instrumentation, Center for Brain Inspired Computing Research and Beijing Innovation Center for Future Chip, Tsinghua University, Beijing 100084, China. Electronic address: liguoqi@mail.tsinghua.edu.cn.

Neural Netw ; 132: 309-320, 2020 Dec.

Article de En | MEDLINE | ID: mdl-32977276

ABSTRACT

ABSTRACT

Deep neural networks (DNNs) have enabled impressive breakthroughs in various artificial intelligence (AI) applications recently due to its capability of learning high-level features from big data. However, the current demand of DNNs for computational resources especially the storage consumption is growing due to that the increasing sizes of models are being required for more and more complicated applications. To address this problem, several tensor decomposition methods including tensor-train (TT) and tensor-ring (TR) have been applied to compress DNNs and shown considerable compression effectiveness. In this work, we introduce the hierarchical Tucker (HT), a classical but rarely-used tensor decomposition method, to investigate its capability in neural network compression. We convert the weight matrices and convolutional kernels to both HT and TT formats for comparative study, since the latter is the most widely used decomposition method and the variant of HT. We further theoretically and experimentally discover that the HT format has better performance on compressing weight matrices, while the TT format is more suited for compressing convolutional kernels. Based on this phenomenon we propose a strategy of hybrid tensor decomposition by combining TT and HT together to compress convolutional and fully connected parts separately and attain better accuracy than only using the TT or HT format on convolutional neural networks (CNNs). Our work illuminates the prospects of hybrid tensor decomposition for neural network compression.

Sujet(s)
Mots clés

Balanced structure; Hierarchical Tucker; Hybrid tensor decomposition; Neural network compression; Tensor-train

Texte intégral

Ajouter à My VHL

Imprimer

XML

PubMed Links

Recherche sur Google

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Sujet principal: / Compression de données / Apprentissage profond Langue: En Journal: Neural Netw Sujet du journal: NEUROLOGIA Année: 2020 Type de document: Article

Texte intégral

Ajouter à My VHL

Imprimer

XML

PubMed Links

Recherche sur Google