This paper proposes a generic procedure for training a scene specific people detector by exploiting simple human interaction. This technique works for any kind of scene im- aged by a static camera and allows to considerably increase the performances of an appearance-based people detector. The user is requested to validate the results of a basic detec- tor relying on background subtraction and proportions con- straints. From this simple supervision it is possible to select new scene specific examples that can be used for retraining the people detector used in the testing phase. These new ex- amples have the benefit of adapting the classifier to the par- ticular scene imaged by the camera, improving the detec- tion for that particular viewpoint, background, and image resolution. At the same time, positions and scales, where people can be found, are learnt, thus allowing to consider- ably reduce the number of windows that have to be scanned in the detection phase. Experimental results are presented on three different scenarios, showing an improved detection accuracy and a reduced number of false positives even when the ground plane assumption does not hold.

Scene specific people detection by simple human interaction

MUNARO, MATTEO;CENEDESE, ANGELO
2011

Abstract

This paper proposes a generic procedure for training a scene specific people detector by exploiting simple human interaction. This technique works for any kind of scene im- aged by a static camera and allows to considerably increase the performances of an appearance-based people detector. The user is requested to validate the results of a basic detec- tor relying on background subtraction and proportions con- straints. From this simple supervision it is possible to select new scene specific examples that can be used for retraining the people detector used in the testing phase. These new ex- amples have the benefit of adapting the classifier to the par- ticular scene imaged by the camera, improving the detec- tion for that particular viewpoint, background, and image resolution. At the same time, positions and scales, where people can be found, are learnt, thus allowing to consider- ably reduce the number of windows that have to be scanned in the detection phase. Experimental results are presented on three different scenarios, showing an improved detection accuracy and a reduced number of false positives even when the ground plane assumption does not hold.
2011
Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on
Workshop on Human Interaction in Computer Vision
978-1-4673-0062-9
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/2479381
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 1
  • OpenAlex ND
social impact