Pesquisa | BVS - MINISTÉRIO DA SAÚDE

1.

MCMC: Multi-Constrained Model Compression via One-Stage Envelope Reinforcement Learning.

Li, Siqi; Chen, Jun; Liu, Shanqi; Zhu, Chengrui; Tian, Guanzhong; Liu, Yong.

IEEE Trans Neural Netw Learn Syst ; PP2024 Jan 30.

Artigo em Inglês | MEDLINE | ID: mdl-38289836

RESUMO

Model compression methods are being developed to bridge the gap between the massive scale of neural networks and the limited hardware resources on edge devices. Since most real-world applications deployed on resource-limited hardware platforms typically have multiple hardware constraints simultaneously, most existing model compression approaches that only consider optimizing one single hardware objective are ineffective. In this article, we propose an automated pruning method called multi-constrained model compression (MCMC) that allows for the optimization of multiple hardware targets, such as latency, floating point operations (FLOPs), and memory usage, while minimizing the impact on accuracy. Specifically, we propose an improved multi-objective reinforcement learning (MORL) algorithm, the one-stage envelope deep deterministic policy gradient (DDPG) algorithm, to determine the pruning strategy for neural networks. Our improved one-stage envelope DDPG algorithm reduces exploration time and offers greater flexibility in adjusting target priorities, enhancing its suitability for pruning tasks. For instance, on the visual geometry group (VGG)-16 network, our method achieved an 80% reduction in FLOPs, a 2.31× reduction in memory usage, and a 1.92× acceleration, with an accuracy improvement of 0.09% compared with the baseline. For larger datasets, such as ImageNet, we reduced FLOPs by 50% for MobileNet-V1, resulting in a 4.7× faster speed and 1.48× memory compression, while maintaining the same accuracy. When applied to edge devices, such as JETSON XAVIER NX, our method resulted in a 71% reduction in FLOPs for MobileNet-V1, leading to a 1.63× faster speed, 1.64× memory compression, and an accuracy improvement.

2.

Learning Multi-Agent Cooperation via Considering Actions of Teammates.

Liu, Shanqi; Liu, Weiwei; Chen, Wenzhou; Tian, Guanzhong; Chen, Jun; Tong, Yao; Cao, Junjie; Liu, Yong.

IEEE Trans Neural Netw Learn Syst ; PP2023 Apr 18.

Artigo em Inglês | MEDLINE | ID: mdl-37071511

RESUMO

Recently value-based centralized training with decentralized execution (CTDE) multi-agent reinforcement learning (MARL) methods have achieved excellent performance in cooperative tasks. However, the most representative method among these methods, Q-network MIXing (QMIX), restricts the joint action Q values to be a monotonic mixing of each agent's utilities. Furthermore, current methods cannot generalize to unseen environments or different agent configurations, which is known as ad hoc team play situation. In this work, we propose a novel Q values decomposition that considers both the return of an agent acting on its own and cooperating with other observable agents to address the nonmonotonic problem. Based on the decomposition, we propose a greedy action searching method that can improve exploration and is not affected by changes in observable agents or changes in the order of agents' actions. In this way, our method can adapt to ad hoc team play situation. Furthermore, we utilize an auxiliary loss related to environmental cognition consistency and a modified prioritized experience replay (PER) buffer to assist training. Our extensive experimental results show that our method achieves significant performance improvements in both challenging monotonic and nonmonotonic domains, and can handle the ad hoc team play situation perfectly.

3.

Delving Deeper Into Mask Utilization in Video Object Segmentation.

Wang, Mengmeng; Mei, Jianbiao; Liu, Lina; Tian, Guanzhong; Liu, Yong; Pan, Zaisheng.

IEEE Trans Image Process ; 31: 6255-6266, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-36166565

RESUMO

This paper focuses on the mask utilization of video object segmentation (VOS). The mask here mains the reference masks in the memory bank, i.e., several chosen high-quality predicted masks, which are usually used with the reference frames together. The reference masks depict the edge and contour features of the target object and indicate the boundary of the target against the background, while the reference frames contain the raw RGB information of the whole image. It is obvious that the reference masks could play a significant role in the VOS, but this is not well explored yet. To tackle this, we propose to investigate the mask advantages of both the encoder and the matcher. For the encoder, we provide a unified codebase to integrate and compare eight different mask-fused encoders. Half of them are inherited or summarized from existing methods, and the other half are devised by ourselves. We find the best configuration from our design and give valuable observations from the comparison. Then, we propose a new mask-enhanced matcher to reduce the background distraction and enhance the locality of the matching process. Combining the mask-fused encoder, mask-enhanced matcher and a standard decoder, we formulate a new architecture named MaskVOS, which sufficiently exploits the mask benefits for VOS. Qualitative and quantitative results demonstrate the effectiveness of our method. We hope our exploration could raise the attention of mask utilization in VOS.

4.

A Hybrid Prognostic Method for Proton-Exchange-Membrane Fuel Cell with Decomposition Forecasting Framework Based on AEKF and LSTM.

Xia, Zetao; Wang, Yining; Ma, Longhua; Zhu, Yang; Li, Yongjie; Tao, Jili; Tian, Guanzhong.

Sensors (Basel) ; 23(1)2022 Dec 24.

Artigo em Inglês | MEDLINE | ID: mdl-36616764

RESUMO

Durability and reliability are the major bottlenecks of the proton-exchange-membrane fuel cell (PEMFC) for large-scale commercial deployment. With the help of prognostic approaches, we can reduce its maintenance cost and maximize its lifetime. This paper proposes a hybrid prognostic method for PEMFCs based on a decomposition forecasting framework. Firstly, the original voltage data is decomposed into the calendar aging part and the reversible aging part based on locally weighted regression (LOESS). Then, we apply an adaptive extended Kalman filter (AEKF) and long short-term memory (LSTM) neural network to predict those two components, respectively. Three-dimensional aging factors are introduced in the physical aging model to capture the overall aging trend better. We utilize the automatic machine-learning method based on the genetic algorithm to train the LSTM model more efficiently and improve prediction accuracy. The aging voltage is derived from the sum of the two predicted voltage components, and we can further realize the remaining useful life estimation. Experimental results show that the proposed hybrid prognostic method can realize an accurate long-term voltage-degradation prediction and outperform the single model-based method or data-based method.

5.

Adding Before Pruning: Sparse Filter Fusion for Deep Convolutional Neural Networks via Auxiliary Attention.

Tian, Guanzhong; Sun, Yiran; Liu, Yuang; Zeng, Xianfang; Wang, Mengmeng; Liu, Yong; Zhang, Jiangning; Chen, Jun.

IEEE Trans Neural Netw Learn Syst ; PP2021 Sep 06.

Artigo em Inglês | MEDLINE | ID: mdl-34487502

RESUMO

Filter pruning is a significant feature selection technique to shrink the existing feature fusion schemes (especially on convolution calculation and model size), which helps to develop more efficient feature fusion models while maintaining state-of-the-art performance. In addition, it reduces the storage and computation requirements of deep neural networks (DNNs) and accelerates the inference process dramatically. Existing methods mainly rely on manual constraints such as normalization to select the filters. A typical pipeline comprises two stages: first pruning the original neural network and then fine-tuning the pruned model. However, choosing a manual criterion can be somehow tricky and stochastic. Moreover, directly regularizing and modifying filters in the pipeline suffer from being sensitive to the choice of hyperparameters, thus making the pruning procedure less robust. To address these challenges, we propose to handle the filter pruning issue through one stage: using an attention-based architecture that adaptively fuses the filter selection with filter learning in a unified network. Specifically, we present a pruning method named adding before pruning (ABP) to make the model focus on the filters of higher significance by training instead of man-made criteria such as norm, rank, etc. First, we add an auxiliary attention layer into the original model and set the significance scores in this layer to be binary. Furthermore, to propagate the gradients in the auxiliary attention layer, we design a specific gradient estimator and prove its effectiveness for convergence in the graph flow through mathematical derivation. In the end, to relieve the dependence on the complicated prior knowledge for designing the thresholding criterion, we simultaneously prune and train the filters to automatically eliminate network redundancy with recoverability. Extensive experimental results on the two typical image classification benchmarks, CIFAR-10 and ILSVRC-2012, illustrate that the proposed approach performs favorably against previous state-of-the-art filter pruning algorithms.

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA