Search | VHL Regional Portal

Towards Reaching Human Performance in Pedestrian Detection.

Zhang, Shanshan; Benenson, Rodrigo; Omran, Mohamed; Hosang, Jan; Schiele, Bernt.

IEEE Trans Pattern Anal Mach Intell ; 40(4): 973-986, 2018 04.

Article in English | MEDLINE | ID: mdl-28475049

ABSTRACT

Encouraged by the recent progress in pedestrian detection, we investigate the gap between current state-of-the-art methods and the "perfect single frame detector". We enable our analysis by creating a human baseline for pedestrian detection (over the Caltech pedestrian dataset). After manually clustering the frequent errors of a top detector, we characterise both localisation and background-versus-foreground errors. To address localisation errors we study the impact of training annotation noise on the detector performance, and show that we can improve results even with a small portion of sanitised training data. To address background/foreground discrimination, we study convnets for pedestrian detection, and discuss which factors affect their performance. Other than our in-depth analysis, we report top performance on the Caltech pedestrian dataset, and provide a new sanitised set of training and test annotations.

Subject(s)

Image Processing, Computer-Assisted/methods , Pattern Recognition, Automated/methods , Pedestrians/classification , Video Recording/methods , Cluster Analysis , Databases, Factual , Humans

What Makes for Effective Detection Proposals?

Hosang, Jan; Benenson, Rodrigo; Dollár, Piotr; Schiele, Bernt.

IEEE Trans Pattern Anal Mach Intell ; 38(4): 814-30, 2016 Apr.

Article in English | MEDLINE | ID: mdl-26959679

ABSTRACT

Current top performing object detectors employ detection proposals to guide the search for objects, thereby avoiding exhaustive sliding window search across images. Despite the popularity and widespread use of detection proposals, it is unclear which trade-offs are made when using them during object detection. We provide an in-depth analysis of twelve proposal methods along with four baselines regarding proposal repeatability, ground truth annotation recall on PASCAL, ImageNet, and MS COCO, and their impact on DPM, R-CNN, and Fast R-CNN detection performance. Our analysis shows that for object detection improving proposal localisation accuracy is as important as improving recall. We introduce a novel metric, the average recall (AR), which rewards both high recall and good localisation and correlates surprisingly well with detection performance. Our findings show common strengths and weaknesses of existing methods, and provide insights and metrics for selecting and tuning proposal methods.

ABSTRACT

Subject(s)

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL