trackingmultiplepeoplewithilluminationmapsdisi.unitn.it/~zen/data/icpr10_zen_poster.pdf · hjs- map...

1
Tracking Multiple People with Illumination Maps Gloria Zen, Oswald Lanz, Stefano Messelodi, Elisa Ricci { g z e n, l a n z, m e s s e l o d, e l i r i c c i } @ f b k . e u Contribution Motivation: developing a visual tracking system that is robust and adaptive to changing illumination conditions. Our Scenario: (i) non-homogeneous and non-stationary (slowly varying in time) illumination conditions, (ii) occlusions due to other targets or neighbouring objects. Color-based tracking: cues other than color (motion, edges or speech) are not always reliable; color histograms are sensitive to illumination. Idea: learn a global illumination map describing the lighting conditions of the scene . Our approach: (i) joint estimation of the position of the target and its illumination condition within a Bayesian framework, (ii) implement the map as a hierchical Markov Random Field (MRF) with a novel efficient inference approach. Experiments. We tested our method on two different scenarios: our laboratory (LAB) and a public dataset (PETS2007). Modelling the illumination changes The color change under changing illumination can be described by the so-called diagonal (or von Kries) model [3], according to which each color channel c is multiplied by a single factor β c . Creating the Illumination Map The map is obtained with a hierarchical MRF where: hidden nodes correspond to cells in the room. The illumination coefficients β =(β r , β g , β b ) are the hidden variables. observations O i associated to the i-th node are histograms col- lected for different targets during a time interval Δ t . A clustering method is applied to the collected histograms to reduce the effect of noisy observations. the likelihood φ i (β i , O i ) for node i identify the β which better explains the observation O i . The pairwise potentials ψ ij (β i , β j ) enforce the fact that neigh- boring cells should have similar illumination conditions. Tracking with Multiple Templates Multiple people tracking with the Hybrid Joint Separable (HJS) particle filter [1]. Extending the HJS by introducing multiple templates. We consider for each target a discrete set of Γ templates, representing the target under different illumination conditions. The set is obtained by diagonal transforms P β =D β P 0 , with D β =diag(β )= diag(β r , β g , β b ). Joint estimation of target position x t and illumination conditions β t . Tradeoff: adaptivity vs tracking failure. The tracker is able to adapt to target color changes but risks to drift towards background objects with similar appearance. Dynamical model of the HJS. We define p(β q t |β q t-1 , x q t )= p(β q t |β q t-1 )p(β q t |x q t ), where p(β q t |β q t-1 ) is modelled as a gaussian noise and p(β q t |x q t ) models the likelihood of certain illumination conditions given a location in the scene. Building up an illumination map. The prior p(β q t |x q t ) is pro- vided by the map and limits the number of candidate templates. Efficient Inference with High Order φ i () Inference with sum-product belief propagation is computationally costly, O (V Γ 2 )=O (V‘ 6 ), with V the number of nodes, Γ and respectively the label space of β and β c . The adoption of a decomposed model [2] reduces the cost to O (V‘ 3 ). A novel message passing scheme based on Taylor series reduces the cost to O (3V‘ 2 ). Results LAB. Experimental results show that tracking with our method HJS- β allows to follow targets when subject to strong illumination changes, while adopting a β -map reduces the risk of drifting. HJS-β map HJS-β HJS The quantitative results (F-measure) calculated for 6 video sequences are reported in the table. s1 s2 s3 s4 s5 s6 n o targets 1 1 2 2 3 3 HJS 0.35 0.58 0.62 0.68 0.55 0.36 HJS-β -map 0.60 0.67 0.70 0.73 0.74 0.76 PETS2007. The scenario presents strongly non-homogeneous and time-varying illumination conditions. Results show that collected information about lighting from ”easy-to-track” targets can be used to build the map and track more challenging targets. Supplementary materials of this work can be found at: http : //tev.f bk.eu/people/ricci/illumination map.html . References [1] O. Lanz Approximate bayesian multibody tracking In IEEE Trans. PAMI, 28(9):1436-1449, 2006. [2] A. Shekhovtsov, I. Kovtun, and V. Hlavac Efficient MRF deformation model for non-rigid image matching In CVPR’07. [3] G. Finlayson, M. Drew, B. Funt Diagonal transforms suffice for color constancy In ICCV, 1993.

Upload: others

Post on 21-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: TrackingMultiplePeoplewithIlluminationMapsdisi.unitn.it/~zen/data/icpr10_zen_poster.pdf · HJS- map HJS- HJS The quantitative results (F-measure) calculated for 6 video sequences

TrackingMultiplePeoplewithIlluminationMapsGloria Zen, Oswald Lanz, Stefano Messelodi, Elisa Ricci

{ g z e n, l a n z, m e s s e l o d, e l i r i c c i } @ f b k . e u

ContributionMotivation: developing a visual tracking system that is robust and adaptive to changing illumination conditions.Our Scenario: (i) non-homogeneous and non-stationary (slowly varying in time) illumination conditions, (ii) occlusions due to other targetsor neighbouring objects.Color-based tracking: cues other than color (motion, edges or speech) are not always reliable; color histograms are sensitive to illumination.Idea: learn a global illumination map describing the lighting conditions of the scene .Our approach: (i) joint estimation of the position of the target and its illumination condition within a Bayesian framework, (ii) implementthe map as a hierchical Markov Random Field (MRF) with a novel efficient inference approach.Experiments. We tested our method on two different scenarios: our laboratory (LAB) and a public dataset (PETS2007).

Modelling the illumination changesThe color change under changing illumination can be described by theso-called diagonal (or von Kries) model [3], according to which eachcolor channel c is multiplied by a single factor βc.

Creating the Illumination Map

The map is obtained with a hierarchical MRF where:

• hidden nodes correspond to cells in the room. The illuminationcoefficients β=(βr, βg, βb) are the hidden variables.

• observations Oi associated to the i-th node are histograms col-lected for different targets during a time interval ∆t. A clusteringmethod is applied to the collected histograms to reduce the effectof noisy observations.

• the likelihood φi(βi,Oi) for node i identify the β which betterexplains the observation Oi.

• The pairwise potentials ψij(βi,βj) enforce the fact that neigh-boring cells should have similar illumination conditions.

Tracking with Multiple Templates

• Multiple people tracking with the Hybrid Joint Separable (HJS)particle filter [1].

• Extending the HJS by introducing multiple templates. Weconsider for each target a discrete set of Γ templates, representingthe target under different illumination conditions. The set isobtained by diagonal transforms Pβ=DβP0, with Dβ=diag(β) =diag(βr, βg, βb).

• Joint estimation of target positionxt and illumination conditions βt.

• Tradeoff: adaptivity vs trackingfailure. The tracker is able to adaptto target color changes but risksto drift towards background objectswith similar appearance.

• Dynamical model of the HJS. We define p(βqt |βqt−1,x

qt ) =

p(βqt |βqt−1)p(βqt |x

qt ), where p(βqt |β

qt−1) is modelled as a gaussian

noise and p(βqt |xqt ) models the likelihood of certain illumination

conditions given a location in the scene.

• Building up an illumination map. The prior p(βqt |xqt ) is pro-

vided by the map and limits the number of candidate templates.

Efficient Inference with High Order φi()• Inference with sum-product belief propagation is computationally

costly, O(V Γ2)=O(V `6), with V the number of nodes, Γ and `respectively the label space of β and βc.

• The adoption of a decomposed model [2] reduces the cost toO(V `3).

• A novel message passing scheme based on Taylor series reducesthe cost to O(3V `2).

ResultsLAB. Experimental results show that tracking with our method HJS-β allows to follow targets when subject to strong illumination changes,while adopting a β-map reduces the risk of drifting.

HJS-βmap

HJS-β

HJS

The quantitative results (F-measure) calculated for 6 video sequencesare reported in the table.

s1 s2 s3 s4 s5 s6no targets 1 1 2 2 3 3

HJS 0.35 0.58 0.62 0.68 0.55 0.36HJS-β-map 0.60 0.67 0.70 0.73 0.74 0.76

PETS2007. The scenario presents strongly non-homogeneous andtime-varying illumination conditions. Results show that collectedinformation about lighting from ”easy-to-track” targets can be used tobuild the map and track more challenging targets.

Supplementary materials of this work can be found at:http : //tev.fbk.eu/people/ricci/illumination map.html.

References

[1] O. Lanz Approximate bayesian multibody tracking In IEEE Trans. PAMI,28(9):1436-1449, 2006.

[2] A. Shekhovtsov, I. Kovtun, and V. Hlavac Efficient MRF deformation modelfor non-rigid image matching In CVPR’07.

[3] G. Finlayson, M. Drew, B. Funt Diagonal transforms suffice for color constancyIn ICCV, 1993.

1