Egocentric Gesture Recognition Using Recurrent 3D Convolutional Neural Networks with Spatiotemporal Transformer Modules

小编映维 | 分类：CV / XR | 发布日期 2020年10月19日

Note: We don't have the ability to review paper

PubDate: December 2017

Teams: University of Chinese Academy of Sciences；Indiana University

Writers: Congqi Cao ; Yifan Zhang ; Yi Wu ; Hanqing Lu ; Jian Cheng

PDF: Egocentric Gesture Recognition Using Recurrent 3D Convolutional Neural Networks with Spatiotemporal Transformer Modules

Abstract

Gesture is a natural interface in interacting with wearable devices such as VR/AR helmet and glasses. The main challenge of gesture recognition in egocentric vision arises from the global camera motion caused by the spontaneous head movement of the device wearer. In this paper, we address the problem by a novel recurrent 3D convolutional neural network for end-to-end learning. We specially design a spatiotemporal transformer module with recurrent connections between neighboring time slices which can actively transform a 3D feature map into a canonical view in both spatial and temporal dimensions. To validate our method, we introduce a new dataset with sufficient size, variation and reality, which contains 83 gestures designed for interaction with wearable devices, and more than 24,000 RGB-D gesture samples from 50 subjects captured in 6 scenes. On this dataset, we show that the proposed network outperforms competing state-of-the-art algorithms. Moreover, our method can achieve state-of-the-art performance on the challenging GTEA egocentric action dataset.

本文链接：https://paper.nweon.com/7070

Egocentric Gesture Recognition Using Recurrent 3D Convolutional Neural Networks with Spatiotemporal Transformer Modules

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Egocentric Gesture Recognition Using Recurrent 3D Convolutional Neural Networks with Spatiotemporal Transformer Modules

您可能还喜欢...

Human Emotions Analysis and Recognition Using EEG Signals in Response to 360° Videos

Six-DoF pose estimation using dual-axis rotating laser sweeps using a probabilistic framework

Toward Improved Surgical Training: Delivering Smoothness Feedback using Haptic Cues

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘