MixedPeds: Pedestrian Detection in Unannotated Videos using Synthetically Generated Human-agents for Training

编辑：映维 | 分类：CV / XR | 2020年7月24日

Note: We don't have the ability to review paper

PubDate: Nov 2017

Teams: The University of North Carolina at Chapel Hill

Writers: Ernest C. Cheung, Tsan Kwong Wong, Aniket Bera, Dinesh Manocha

PDF: MixedPeds: Pedestrian Detection in Unannotated Videos using Synthetically Generated Human-agents for Training

MixedPeds: Pedestrian Detection in Unannotated Videos using Synthetically Generated Human-agents for Training

Abstract

We present a new method for training pedestrian detectors on an unannotated set of images. We produce a mixed reality dataset that is composed of real-world background images and synthetically generated static human-agents. Our approach is general, robust, and makes no other assumptions about the unannotated dataset regarding the number or location of pedestrians. We automatically extract from the dataset: i) the vanishing point to calibrate the virtual camera, and ii) the pedestrians’ scales to generate a Spawn Probability Map, which is a novel concept that guides our algorithm to place the pedestrians at appropriate locations. After putting synthetic human-agents in the unannotated images, we use these augmented images to train a Pedestrian Detector, with the annotations generated along with the synthetic agents. We conducted our experiments using Faster R-CNN by comparing the detection results on the unannotated dataset performed by the detector trained using our approach and detectors trained with other manually labeled datasets. We showed that our approach improves the average precision by 5-13% over these detectors.

本文链接：https://paper.nweon.com/4255

MixedPeds: Pedestrian Detection in Unannotated Videos using Synthetically Generated Human-agents for Training

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

MixedPeds: Pedestrian Detection in Unannotated Videos using Synthetically Generated Human-agents for Training

您可能还喜欢...

Build-and-Touch: A Low-Cost, DIY, Open-Source Approach Towards Touchable Virtual Reality

A Gaze-Based Virtual Keyboard Using a Mouth Switch for Command Selection

Learning to compose 6-DoF omnidirectional videos using multi-sphere images

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘