Back to Search
OtherOpen Access

Gaussian Temporal Awareness Networks for Action Localization

Author Affiliations
University of Science and Technology of China, JDSU (United States), University of Science and Technology Chittagong, Jingdong (China)
Published InarXiv (Cornell University)
Year2019

Abstract

Temporally localizing actions in a video is a fundamental challenge in video understanding. Most existing approaches have often drawn inspiration from image object detection and extended the advances, e.g., SSD and Faster R-CNN, to produce temporal locations of an action in a 1D sequence. Nevertheless, the results can suffer from robustness problem due to the design of predetermined temporal scales, which overlooks the temporal structure of an action and limits the utility on detecting actions with complex variations. In this paper, we propose to address the problem by introducing Gaussian kernels to dynamically optimize temporal scale of each action proposal. Specifically, we present Gaussian Temporal Awareness Networks (GTAN) --- a new architecture that novelly integrates the exploitation of temporal structure…
View at Publisher

BORR does not host full-text PDFs. The button above takes you to the original publisher.