University of Leicester
Browse

File(s) under permanent embargo

Reason: This item is currently closed access.

BoMW: Bag of Manifold Words for One-shot Learning Gesture Recognition from Kinect

journal contribution
posted on 2018-02-16, 15:17 authored by Lei Zhang, Shengping Zhang, Feng Jiang, Yuankai Qi, Jun Zhang, Yuliang Guo, Huiyu Zhou
In this paper, we study one-shot learning gesture recognition on RGB-D data recorded from Microsoft’s Kinect. To this end, we propose a novel bag of manifold words (BoMW) based feature representation on sysmetric positive definite (SPD) manifolds. In particular, we use covariance matrices to extract local features from RGB-D data due to its compact representation ability as well as the convenience of fusing both RGB and depth information. Since covariance matrices are SPD matrices and the space spanned by them is the SPD manifold, traditional learning methods in the Euclidean space such as sparse coding can not be directly applied to them. To overcome this problem, we propose a unified framework to transfer the sparse coding on SPD manifolds to the one on the Euclidean space, which enables any existing learning method can be used. After building BoMW representation on a video from each gesture class, a nearest neighbour classifier is adopted to perform the one-shot learning gesture recognition. Experimental results on the ChaLearn gesture dataset demonstrate the outstanding performance of the proposed one-shot learning gesture recognition method compared against state-of-the-art methods. The effectiveness of the proposed feature extraction method is also validated on a new RGB-D action recognition dataset.

Funding

S. Zhang is supported by the National Natural Science Foundation of China under Grant 61672188, the Key Research and Development Program of Shandong Province under Grant 2016GGX101021 and HIT Outstanding Young Talents Program. F. Jiang is supported by the Major State Basic Research Development Program of China (973 Program 2015CB351804) and the National Natural Science Foundation of China under Grant No. 61572155. J. Zhang is supported by the Natural Science Foundation of China (61403116) and the China Postdoctoral Science Foundation (2014M560507). H. Zhou is supported by UK EPSRC under Grants EP/N508664/1, EP/R007187/1 and EP/N011074/1, and Royal Society-Newton Advanced Fellowship under Grant NA160342.

History

Citation

IEEE Transactions on Circuits and Systems for Video Technology, 2017, PP(99)

Author affiliation

/Organisation

Version

  • AM (Accepted Manuscript)

Published in

IEEE Transactions on Circuits and Systems for Video Technology

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

issn

1051-8215

eissn

1558-2205

Copyright date

2017

Publisher version

http://ieeexplore.ieee.org/document/7961242/

Language

en

Usage metrics

    University of Leicester Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC