Abstract
Maintaining the stability of tracks on multiple targets in video over extended time periods and wide areas remains a challenging problem. Basic trackers like the Kalman filter or particle filter deteriorate in performance as the complexity of the scene increases. A few methods have recently shown encouraging results in these application domains. They rely on learning context models, the availability of training data, or modeling the inter-relationships between the tracks. In this chapter, we provide an overview of research in the area of long-term tracking in video. We review some of the methods in the literature and analyze the common sources of errors which cause trackers to fail. We also discuss the limits of performance of the trackers as multiple objects come together to form groups and crowds. On multiple real-life video sequences obtained for a single camera as well as a camera network, we compare the performance of some of the methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ali, S., Shah, M.: Floor fields for tracking in high density crowd scenes. In: Euro. Conference on Computer Vision (2008)
Arulampalam, M.S., Maskell, S., Gordon, N.: A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans. Signal Process. 50, 174–188 (2002)
Babenko, B., Yang, M., Belongie, S.: Visual tracking with online multiple instance learning. In: IEEE Conf. on Computer Vision and Pattern Recognition (2009)
Bar-Shalom, Y., Fortmann, T.: Tracking and Data Association. Academic Press, San Diego (1988)
Bose, B., Wang, X., Grimson, E.: Multi-class object tracking algorithm that handles fragmentation and grouping. In: IEEE Conf. on Computer Vision and Pattern Recognition (2007)
Denina, G., Bhanu, B., Nguyen, H., Ding, C., Kamal, A., Ravishankar, C., Roy-Chowdhury, A., Ivers, A., Varda, B.: VideoWeb dataset for multi-camera activities and non-verbal communication. In: Bhanu, B., Ravishankar, C., Roy-Chowdhury, A., Aghajan, H., Terzopoulos, D. (eds.) Distributed Video Sensor Networks. Springer, London (2011)
Farenzena, M., Bazzani, L., Perina, A., Cristani, M., Murino, V.: Person re-identification by symmetry-driven accumulation of local features. In: IEEE Conf. on Computer Vision and Pattern Recognition (2010)
Ge, W., Collins, R.: Multi-target data association by tracklets with unsupervised parameter estimation. In: British Machine Vision Conference (2008)
Gheissari, N., Sebastian, T., Hartley, R.: Person re-identification using spatiotemporal appearance. In: IEEE Conf. on Computer Vision and Pattern Recognition (2006)
Gray, D., Tao, H.: Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: Euro. Conference on Computer Vision (2008)
Hoiem, D., Efros, A., Hebert, M.: Geometric context from a single image. In: IEEE Intl. Conf. on Computer Vision (2005)
Hu, M., Ali, S., Shah, M.: Detecting global motion patterns in complex videos. In: Intl. Conf. on Pattern Recognition (2008)
Huang, T., Russel, S.: Object identification in a Bayesian context. In: International Joint Conference on Artificial Intelligence (1997)
Hue, C., Cadre, J.L., Prez, P.: Sequential Monte Carlo methods for multiple target tracking and data fusion. IEEE Trans. Signal Process. 50(2), 309–325 (2002)
Isard, M., Blake, A.: Condensation – conditional density propagation for visual tracking. Int. J. Comput. Vis. 29(1), 5–28 (1998)
Javed, O., Rasheed, Z., Shafique, K., Shah, M.: Tracking across multiple cameras with disjoint views. In: IEEE Intl. Conf. on Computer Vision (2003)
Jiang, H., Fels, S., Little, J.: A linear programming approach for multiple object tracking. In: IEEE Conf. on Computer Vision and Pattern Recognition (2007)
Junior, J.C.S.J., Musse, S.R., Jung, C.R.: Crowd analysis using computer vision techniques. IEEE Signal Process. Mag. 27(5), 66–77 (2010)
Kalman, R.: A new approach to linear filtering and prediction problems. Trans. ASME – J. Basic Eng. 82, 35–45 (1990)
Kang, J., Cohen, I., Medioni, G.: Continuous tracking within and across camera streams. In: IEEE Conf. on Computer Vision and Pattern Recognition (2004)
Kettnaker, V., Zabih, R.: Bayesian multi-camera surveillance. In: IEEE Conf. on Computer Vision and Pattern Recognition (1999)
Kuo, C.-H., Huang, C., Nevatia, R.: Inter-camera association of multi-target tracks by on-line learned appearance affinity models. In: Euro. Conference on Computer Vision (2010)
Leibe, B., Schindler, K., Gool, L.V.: Coupled detection and trajectory estimation for multi-object tracking. In: IEEE Intl. Conf. on Computer Vision (2007)
Leoputra, W., Tan, T., Lim, F.L.: Non-overlapping distributed tracking using particle filter. In: Intl. Conf. on Pattern Recognition (2006)
Li, Y., Huang, C., Nevatia, R.: Learning to associate: HybridBoosted multi-target tracker for crowded scene. In: IEEE Conf. on Computer Vision and Pattern Recognition (2009)
Loy, C., Xiang, T., Gong, S.: Multi-camera activity correlation analysis. In: IEEE Conf. on Computer Vision and Pattern Recognition (2009)
Perera, A., Srinivas, C., Hoogs, A., Brooksby, G., Hu, W.: Multi-object tracking through simultaneous long occlusions and split-merge conditions. In: IEEE Conf. on Computer Vision and Pattern Recognition (2006)
Prosser, B., Zheng, W., Gong, S., Xiang, T.: Person re-identification by support vector ranking. In: British Machine Vision Conference (2010)
Qureshi, F.Z., Terzopoulos, D.: Surveillance in virtual reality: System design and multi-camera control. In: IEEE Conf. on Computer Vision and Pattern Recognition (2007)
Rahimi, A., Darrell, T.: Simultaneous calibration and tracking with a network of non-overlapping sensors. In: IEEE Conf. on Computer Vision and Pattern Recognition (2004)
Reid, D.: An algorithm for tracking multiple targets. IEEE Trans. Autom. Control 24(6), 843–854 (1979)
Rodriguez, M., Ali, S., Kanade, T.: Tracking in unstructured crowded scenes. In: IEEE Intl. Conf. on Computer Vision (2009)
Sethi, R., Roy-Chowdhury, A.: Modeling and recognition of complex multi-person interactions in video. In: ACM Intl. Workshop on Multimodal Pervasive Video Analysis (2010)
Shafique, K., Shah, M.: A non-iterative greedy algorithm for multi-frame point correspondence. IEEE Trans. Pattern Anal. Mach. Intell. 27(1), 51–65 (2005)
Song, B., Jeng, T., Staudt, E., Roy-Chowdhury, A.: A stochastic graph evolution framework for robust multi-target tracking. In: Euro. Conference on Computer Vision (2010)
Song, B., Kamal, A., Soto, C., Ding, C., Farrell, J., Roy-Chowdhury, A.: Tracking and activity recognition through consensus in distributed camera networks. IEEE Trans. Image Process. 19(10), 2564–2579 (2010)
Song, B., Roy-Chowdhury, A.: Stochastic adaptive tracking in a camera network. In: IEEE Intl. Conf. on Computer Vision (2007)
Song, B., Roy-Chowdhury, A.: Robust tracking in a camera network: A multi-objective optimization framework. IEEE J. Sel. Topics Signal Process. (Special Issue on Distributed Processing in Vision Networks) 2(4), 582–596 (2008)
Soto, C., Song, B., Roy-Chowdhury, A.: Distributed multi-target tracking in a self-configuring camera network. In: IEEE Conf. on Computer Vision and Pattern Recognition (2009)
Welch, G., Bishop, G.: An Introduction to the Kalman Filter. Technical report, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA (1995)
Xiao, J., Cheng, H., Sawhney, H.S., Han, F.: Vehicle detection and tracking in wide field-of-view aerial video. In: IEEE Conf. on Computer Vision and Pattern Recognition (2010)
Xing, J., Ai, H., Lao, S.: Multi-object tracking through occlusion by local tracklets filtering and global tracklets association with detection responses. In: IEEE Conf. on Computer Vision and Pattern Recognition (2009)
Yang, M., Wu, Y., Hua, G.: Context-aware visual tracking. IEEE Trans. Pattern Anal. Mach. Intell. 31(7), 1195–1209 (2009)
Yu, Q., Medioni, G., Cohen, I.: Multiple target tracking using spatio-temporal Markov chain Monte Carlo data association. In: IEEE Conf. on Computer Vision and Pattern Recognition (2007)
Zhang, L., Li, Y., Nevatia, R.: Global data association for multi-object tracking using network flows. In: IEEE Conf. on Computer Vision and Pattern Recognition (2008)
Acknowledgements
This work was supported in part by NSF grant IIS-0712253 and subcontract from Mayachitra Inc., through a DARPA STTR award (#W31P4Q-08-C-0464).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag London Limited
About this chapter
Cite this chapter
Song, B., Sethi, R.J., Roy-Chowdhury, A.K. (2011). Wide Area Tracking in Single and Multiple Views. In: Moeslund, T., Hilton, A., Krüger, V., Sigal, L. (eds) Visual Analysis of Humans. Springer, London. https://doi.org/10.1007/978-0-85729-997-0_6
Download citation
DOI: https://doi.org/10.1007/978-0-85729-997-0_6
Publisher Name: Springer, London
Print ISBN: 978-0-85729-996-3
Online ISBN: 978-0-85729-997-0
eBook Packages: Computer ScienceComputer Science (R0)