Spatio-Temporal Analysis for Human Action Detection and Recognition in Uncontrolled Environments

Liu, Dianting; Yan, Yilin; Shyu, Mei-Ling; Zhao, Guiru; Chen, Min

Spatio-Temporal Analysis for Human Action Detection and Recognition in Uncontrolled Environments

Dianting Liu, Yilin Yan, Mei-Ling Shyu, Guiru Zhao and Min Chen
Additional contact information
Dianting Liu: Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL, USA
Yilin Yan: Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL, USA
Mei-Ling Shyu: Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL, USA
Guiru Zhao: China Earthquake Networks Center, Beijing, China
Min Chen: Computing and Software Systems, University of Washington Bothell, Bothell, WA, USA

International Journal of Multimedia Data Engineering and Management (IJMDEM), 2015, vol. 6, issue 1, 1-18

Abstract: Understanding semantic meaning of human actions captured in unconstrained environments has broad applications in fields ranging from patient monitoring, human-computer interaction, to surveillance systems. However, while great progresses have been achieved on automatic human action detection and recognition in videos that are captured in controlled/constrained environments, most existing approaches perform unsatisfactorily on videos with uncontrolled/unconstrained conditions (e.g., significant camera motion, background clutter, scaling, and light conditions). To address this issue, the authors propose a robust human action detection and recognition framework that works effectively on videos taken in controlled or uncontrolled environments. Specifically, the authors integrate the optical flow field and Harris3D corner detector to generate a new spatial-temporal information representation for each video sequence, from which the general Gaussian mixture model (GMM) is learned. All the mean vectors of the Gaussian components in the generated GMM model are concatenated to create the GMM supervector for video action recognition. They build a boosting classifier based on a set of sparse representation classifiers and hamming distance classifiers to improve the accuracy of action recognition. The experimental results on two broadly used public data sets, KTH and UCF YouTube Action, show that the proposed framework outperforms the other state-of-the-art approaches on both action detection and recognition.

Date: 2015
References: Add references at CitEc
Citations:

Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 18/ijmdem.2015010101 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:igg:jmdem0:v:6:y:2015:i:1:p:1-18

Access Statistics for this article

International Journal of Multimedia Data Engineering and Management (IJMDEM) is currently edited by Chengcui Zhang

More articles in International Journal of Multimedia Data Engineering and Management (IJMDEM) from IGI Global Scientific Publishing
Bibliographic data for series maintained by Journal Editor ().