ACTION-Net: Multipath Excitation for Action Recognition

编辑：映维 | 分类：CV / XR | 2021年7月1日

Note: We don't have the ability to review paper

PubDate: June 2021

Teams: Trinity College Dublin;ByteDance AI Lab

Writers: Zhengwei Wang1 Qi She2 Aljosa Smolic

PDF: ACTION-Net: Multipath Excitation for Action Recognition

ACTION-Net: Multipath Excitation for Action Recognition

Abstract

Spatial-temporal, channel-wise, and motion patterns are three complementary and crucial types of information for video action recognition. Conventional 2D CNNs are computationally cheap but cannot catch temporal relationships; 3D CNNs can achieve good performance but are computationally intensive. In this work, we tackle this dilemma by designing a generic and effective module that can be embedded into 2D CNNs. To this end, we propose a spAtiotemporal, Channel and moTion excitatION (ACTION) module consisting of three paths: Spatio-Temporal Excitation (STE) path, Channel Excitation (CE) path, and Motion Excitation (ME) path. The STE path employs one channel 3D convolution to characterize spatio-temporal representation. The CE path adaptively recalibrates channelwise feature responses by explicitly modeling interdependencies between channels in terms of the temporal aspect. The ME path calculates feature-level temporal differences, which is then utilized to excite motion-sensitive channels. We equip 2D CNNs with the proposed ACTION module to form a simple yet effective ACTION-Net with very limited extra computational cost. ACTION-Net is demonstrated by consistently outperforming 2D CNN counterparts on three backbones (i.e., ResNet-50, MobileNet V2 and BNInception) employing three datasets (i.e., Something-Something V2, Jester, and EgoGesture). Code is provided at https: //github.com/V-Sense/ACTION-Net.

本文链接：https://paper.nweon.com/10463

ACTION-Net: Multipath Excitation for Action Recognition

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

ACTION-Net: Multipath Excitation for Action Recognition

您可能还喜欢...

FoV-Aware Edge Caching for Adaptive 360° Video Streaming

Dead Fun: Uncomfortable Interactions in a Virtual Reality Game for Coffins

Tele-augmentation for remote AR coaching

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘