An enhanced 3DCNN‐ConvLSTM for spatiotemporal multimedia data analysis

Tian Wang; Jiakun Li; Mengyi Zhang; Aichun Zhu; Hichem Snoussi; Chang Hyuck Choi

doi:10.1002/cpe.5302

Article Dans Une Revue Concurrency and Computation: Practice and Experience Année : 2019

An enhanced 3DCNN‐ConvLSTM for spatiotemporal multimedia data analysis

(1) , (1) , (2) , (2) , (3) , (4)

1
2
3
4

Tian Wang

Fonction : Auteur

Beihang University

Jiakun Li

Fonction : Auteur

Beihang University

Mengyi Zhang

Fonction : Auteur

Nanjing University of Science and Technology

Aichun Zhu

Fonction : Auteur

Nanjing University of Science and Technology

Hichem Snoussi

Fonction : Auteur
PersonId : 753580
IdHAL : hichem-snoussi
ORCID : 0000-0002-6563-2135
IdRef : 080165826

Laboratoire Modélisation et Sûreté des Systèmes

Chang Hyuck Choi

Fonction : Auteur

Chosun University

Résumé

At present, human action recognition is a challenging and complex task in the field of computer vision. The combination of CNN and RNN is a common and effective network structure for this task. Especially, we use 3DCNN in CNN part and ConvLSTM in RNN part. We divide the video into multiple temporal segments by average and compress each segment into one feature map by pooling layer. Adding the pooling layer, dropout layer, and batch normalization layer into ConvLSTM is our groundbreaking work. We test our model on KTH, UCF‐11, and HMDB51 datasets and achieve a high accuracy of action recognition.

Domaines

Recherche opérationnelle [math.OC]

Jean-Baptiste VU VAN : Connectez-vous pour contacter le contributeur

https://utt.hal.science/hal-02297518

Soumis le : jeudi 26 septembre 2019-11:18:38

Dernière modification le : jeudi 25 avril 2024-03:17:59

Dates et versions

hal-02297518 , version 1 (26-09-2019)

Identifiants

HAL Id : hal-02297518 , version 1
DOI : 10.1002/cpe.5302

Citer

Tian Wang, Jiakun Li, Mengyi Zhang, Aichun Zhu, Hichem Snoussi, et al.. An enhanced 3DCNN‐ConvLSTM for spatiotemporal multimedia data analysis. Concurrency and Computation: Practice and Experience, 2019, pp.e5302. ⟨10.1002/cpe.5302⟩. ⟨hal-02297518⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS TDS-MACS UTT UTT-LIST3N LM2S-UTT

66 Consultations

0 Téléchargements

An enhanced 3DCNN‐ConvLSTM for spatiotemporal multimedia data analysis

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager