Pesquisa | Portal de Pesquisa da BVS

Multi-Level Content-Aware Boundary Detection for Temporal Action Proposal Generation.

Su, Taiyi; Wang, Hanli; Wang, Lei.

IEEE Trans Image Process ; 32: 6090-6101, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37922166

RESUMO

It is challenging to generate temporal action proposals from untrimmed videos. In general, boundary-based temporal action proposal generators are based on detecting temporal action boundaries, where a classifier is usually applied to evaluate the probability of each temporal action location. However, most existing approaches treat boundaries and contents separately, which neglect that the context of actions and the temporal locations complement each other, resulting in incomplete modeling of boundaries and contents. In addition, temporal boundaries are often located by exploiting either local clues or global information, without mining local temporal information and temporal-to-temporal relations sufficiently at different levels. Facing these challenges, a novel approach named multi-level content-aware boundary detection (MCBD) is proposed to generate temporal action proposals from videos, which jointly models the boundaries and contents of actions and captures multi-level (i.e., frame level and proposal level) temporal and context information. Specifically, the proposed MCBD preliminarily mines rich frame-level features to generate one-dimensional probability sequences, and further exploits temporal-to-temporal proposal-level relations to produce two-dimensional probability maps. The final temporal action proposals are obtained by a fusion of the multi-level boundary and content probabilities, achieving precise boundaries and reliable confidence of proposals. The extensive experiments on the three benchmark datasets of THUMOS14, ActivityNet v1.3 and HACS demonstrate the effectiveness of the proposed MCBD compared to state-of-the-art methods. The source code of this work can be found in https://mic.tongji.edu.cn.

Advancing Image Understanding in Poor Visibility Environments: A Collective Benchmark Study.

Yang, Wenhan; Yuan, Ye; Ren, Wenqi; Liu, Jiaying; Scheirer, Walter J; Wang, Zhangyang; Zhang, Taiheng; Zhong, Qiaoyong; Xie, Di; Pu, Shiliang; Zheng, Yuqiang; Qu, Yanyun; Xie, Yuhong; Chen, Liang; Li, Zhonghao; Hong, Chen; Jiang, Hao; Yang, Siyuan; Liu, Yan; Qu, Xiaochao; Wan, Pengfei; Zheng, Shuai; Zhong, Minhui; Su, Taiyi; He, Lingzhi; Guo, Yandong; Zhao, Yao; Zhu, Zhenfeng; Liang, Jinxiu; Wang, Jingwen; Chen, Tianyi; Quan, Yuhui; Xu, Yong; Liu, Bo; Liu, Xin; Sun, Qi; Lin, Tingyu; Li, Xiaochuan; Lu, Feng; Gu, Lin; Zhou, Shengdi; Cao, Cong; Zhang, Shifeng; Chi, Cheng; Zhuang, Chubin; Lei, Zhen; Li, Stan Z; Wang, Shizheng; Liu, Ruizhe; Yi, Dong.

IEEE Trans Image Process ; 2020 Mar 27.

Artigo em Inglês | MEDLINE | ID: mdl-32224457

RESUMO

Existing enhancement methods are empirically expected to help the high-level end computer vision task: however, that is observed to not always be the case in practice. We focus on object or face detection in poor visibility enhancements caused by bad weathers (haze, rain) and low light conditions. To provide a more thorough examination and fair comparison, we introduce three benchmark sets collected in real-world hazy, rainy, and low-light conditions, respectively, with annotated objects/faces. We launched the UG2+ challenge Track 2 competition in IEEE CVPR 2019, aiming to evoke a comprehensive discussion and exploration about whether and how low-level vision techniques can benefit the high-level automatic visual recognition in various scenarios. To our best knowledge, this is the first and currently largest effort of its kind. Baseline results by cascading existing enhancement and detection models are reported, indicating the highly challenging nature of our new data as well as the large room for further technical innovations. Thanks to a large participation from the research community, we are able to analyze representative team solutions, striving to better identify the strengths and limitations of existing mindsets as well as the future directions.

RESUMO

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA