RecapNet: Action Proposal Generation Mimicking Human Cognitive Process

Tian Wang; Yang Chen; Zhiwei Lin; Aichun Zhu; Yong Li; Hichem Snoussi; Hui Wang

doi:10.1109/TCYB.2020.2965196

Article Dans Une Revue IEEE Transactions on Cybernetics Année : 2020

RecapNet: Action Proposal Generation Mimicking Human Cognitive Process

(1) , (1) , (2) , (3) , (1) , (4) , (2)

1
2
3
4

Tian Wang

Fonction : Auteur

School of Automation and Electrical Engineering [Beijing] (University of Science and Technology Beijing)

Yang Chen

Fonction : Auteur

School of Automation and Electrical Engineering [Beijing] (University of Science and Technology Beijing)

Zhiwei Lin

Fonction : Auteur

Faculty of Computing and Engineering [University of Ulster]

Aichun Zhu

Fonction : Auteur

Nanjing University of Science and Technology

Yong Li

Fonction : Auteur

School of Automation and Electrical Engineering [Beijing] (University of Science and Technology Beijing)

Hichem Snoussi

Fonction : Auteur
PersonId : 753580
IdHAL : hichem-snoussi
ORCID : 0000-0002-6563-2135
IdRef : 080165826

Laboratoire Modélisation et Sûreté des Systèmes

Hui Wang

Fonction : Auteur

Faculty of Computing and Engineering [University of Ulster]

Résumé

Generating action proposals in untrimmed videos is a challenging task, since video sequences usually contain lots of irrelevant contents and the duration of an action instance is arbitrary. The quality of action proposals is key to action detection performance. The previous methods mainly rely on sliding windows or anchor boxes to cover all ground-truth actions, but this is infeasible and computationally inefficient. To this end, this article proposes a RecapNet--a novel framework for generating action proposal, by mimicking the human cognitive process of understanding video content. Specifically, this RecapNet includes a residual causal convolution module to build a short memory of the past events, based on which the joint probability actionness density ranking mechanism is designed to retrieve the action proposals. The RecapNet can handle videos with arbitrary length and more important, a video sequence will need to be processed only in one single pass in order to generate all action proposals. The experiments show that the proposed RecapNet outperforms the state of the art under all metrics on the benchmark THUMOS14 and ActivityNet-1.3 datasets. The code is available publicly at https://github.com/tianwangbuaa/RecapNet.

Mots clés

Action detection action proposal residual causal convolution

Domaines

Apprentissage [cs.LG]

Jean-Baptiste VU VAN : Connectez-vous pour contacter le contributeur

https://utt.hal.science/hal-02461499

Soumis le : jeudi 30 janvier 2020-16:51:06

Dernière modification le : mercredi 24 avril 2024-17:44:54

Dates et versions

hal-02461499 , version 1 (30-01-2020)

Identifiants

HAL Id : hal-02461499 , version 1
DOI : 10.1109/TCYB.2020.2965196

Citer

Tian Wang, Yang Chen, Zhiwei Lin, Aichun Zhu, Yong Li, et al.. RecapNet: Action Proposal Generation Mimicking Human Cognitive Process. IEEE Transactions on Cybernetics, 2020, 51 (12), pp.6017 - 6028. ⟨10.1109/TCYB.2020.2965196⟩. ⟨hal-02461499⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UTT UTT-LIST3N LM2S-UTT

41 Consultations

0 Téléchargements

RecapNet: Action Proposal Generation Mimicking Human Cognitive Process

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager