Movatterモバイル変換

Part of the book series:Lecture Notes in Computer Science ((LNIP,volume 9007))

Included in the following conference series:

Asian Conference on Computer Vision

1775Accesses
5Citations

Abstract

In this paper we propose a novel feature descriptor Extended Co-occurrence HOG (ECoHOG) and integrate it with dense point trajectories demonstrating its usefulness in fine grained activity recognition. This feature is inspired by original Co-occurrence HOG (CoHOG) that is based on histograms of occurrences of pairs of image gradients in the image. Instead relying only on pure histograms we introduce a sum of gradient magnitudes of co-occurring pairs of image gradients in the image. This results in giving the importance to the object boundaries and straightening the difference between the moving foreground and static background. We also couple ECoHOG with dense point trajectories extracted using optical flow from video sequences and demonstrate that they are extremely well suited for fine grained activity recognition. Using our feature we outperform state of the art methods in this task and provide extensive quantitative evaluation.

This is a preview of subscription content,log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 5719; Price includes VAT (Japan)

Softcover Book: JPY 7149; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Dense Optical Flow Trajectory-Based Human Activity Recognition Using Hierarchical Hidden Markov Model

Compact Video Description and Representation for Automated Summarization of Human Activities

3D Activity Recognition Using Motion History and Binary Shape Templates

References

Moeslund, T.B., Hilton, A., Kruger, V., Sigal, L.: Visual Analysis of Humans: Looking at People. Springer, London (2011)
Book Google Scholar
Aggarwal, J.K., Cai, Q.: Human motion analysis: a review. Comput. Vis. Image Underst. (CVIU)73(3), 428–440 (1999)
Article Google Scholar
Moeslund, T.B., Hilton, A., Kruger, V.: A survey of advances in vision-based human motion capture and analysis. Comput. Vis. Image Underst. (CVIU)104(2), 90–126 (2006)
Article Google Scholar
Ryoo, M.S., Aggarwal, J.K.: Human activity analysis: a review. ACM Comput. Surv. (CSUR)43(3), 16 (2011)
Google Scholar
Rohrbach, M., Amin, S., Andriluka, M., Schiele, B.: A database for fine grained activity detection of cooking activities. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2012)
Google Scholar
Watanabe, T., Ito, S., Yokoi, K.: Co-occurrence histograms of oriented gradients for pedestrian detection. In: Wada, T., Huang, F., Lin, S. (eds.) PSIVT 2009. LNCS, vol. 5414, pp. 37–47. Springer, Heidelberg (2009)
Chapter Google Scholar
Huang, C.-H., Boyer, E., Navab, N., Ilic, S.: Human shape and pose tracking using keyframes. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2014)
Google Scholar
Wang, H., Klaser, A., Schmid, C., Liu, C.L.: Action recognition by dense trajectories. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3169–3176 (2011)
Google Scholar
Wang, H., Klaser, A., Schmid, C., Liu, C.L.: Dense trajectories and motion boundary descriptors for action recognition. Int. J. Comput. Vis. (IJCV)103, 60–79 (2013)
Article MathSciNet Google Scholar
Laptev, I.: On space-time interest points. Int. J. Comput. Vis. (IJCV)64, 107–123 (2005)
Article Google Scholar
Laptev, I., Marszalek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8 (2008)
Google Scholar
Klaser, A., Marszalek, M., Schmid, C.: A spatio-temporal descriptor based on 3D-gradients. In: British Machine Vision Conference (BMVC) (2008)
Google Scholar
Marszalek, M., Laptev, I., Schmid, C.: Actions in context. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2929–2936 (2009)
Google Scholar
Everts, I., Gemert, J.C., Gevers, T.: Evaluation of color STIPs for human activity recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2850–2857 (2013)
Google Scholar
Zinnen, A., Blanke, U., Schiele, B.: An analysis of sensor-oriented vs. model - based activity recognition. In: IEEE International Symposium on Wearable Computers (ISWC) (2009)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Conference on Computer Vision and Pattern Recognitino (CVPR), pp. 886–893 (2005)
Google Scholar
Dalal, N., Triggs, B., Schmid, C.: Human detection using oriented histograms of flow and appearance. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3952, pp. 428–441. Springer, Heidelberg (2006)
Chapter Google Scholar
Raptis, M., Kokkinos, I., Soatto, S.: Discovering discriminative action parts from mid-level video representation. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1242–1249 (2013)
Google Scholar
Li, B., Camps, O., Sznaier, M.: Cross-view activity recognition using hankelets. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1362–1369 (2012)
Google Scholar
Jain, M., Jegou, H., Bouthemy, P.: Better exploiting motion for better action recognition. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2555–2562 (2013)
Google Scholar
Peng, X., Qiao, Y., Peng, Q., Qi, X.: Exploring motion boundary based sampling and spatial temporal context descriptors for action recognition. In: British Machine Vision Conference (BMVC) (2013)
Google Scholar
Wang, H., Schmid, C.: Action recognition with improved trajectories. In: International Conference on Computer Vision (ICCV), pp. 3551–3558 (2013)
Google Scholar
Csurka, G., Bray, C., Dance, C., Fan, L.: Visual categorization with bags of keypoints. In: European Conference on Computer Vision (ECCV) Workshop on Statistical Learning in Computer Vision, pp. 59–74 (2004)
Google Scholar
Perronnin, F., Sánchez, J., Mensink, T.: Improving the fisher kernel for large-scale image classification. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 143–156. Springer, Heidelberg (2010)
Chapter Google Scholar
Bhattacharyya, A.: On a measure of divergence between two statistical populations defined by their probability distributions. Bull. Calcutta Math. Soc.35, 99–109 (1943)
MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

The University of Tokyo, Tokyo, Japan
Hirokatsu Kataoka
Keio University, Minato, Japan
Hirokatsu Kataoka, Kiyoshi Hashimoto & Yoshimitsu Aoki
National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba, Japan
Kenji Iwata & Yutaka Satoh
Technische Universität München (TUM), Munich, Germany
Nassir Navab & Slobodan Ilic

Authors

Hirokatsu Kataoka
View author publications
You can also search for this author inPubMed Google Scholar
Kiyoshi Hashimoto
View author publications
You can also search for this author inPubMed Google Scholar
Kenji Iwata
View author publications
You can also search for this author inPubMed Google Scholar
Yutaka Satoh
View author publications
You can also search for this author inPubMed Google Scholar
Nassir Navab
View author publications
You can also search for this author inPubMed Google Scholar
Slobodan Ilic
View author publications
You can also search for this author inPubMed Google Scholar
Yoshimitsu Aoki
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence toHirokatsu Kataoka.

Editor information

Editors and Affiliations

Technische Universität München, Garching, Germany
Daniel Cremers
University of Adelaide, Adelaide, South Australia, Australia
Ian Reid
Keio University, Yokohama, Kanagawa, Japan
Hideo Saito
University of California at Merced, Merced, California, USA
Ming-Hsuan Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kataoka, H.et al. (2015). Extended Co-occurrence HOG with Dense Trajectories for Fine-Grained Activity Recognition. In: Cremers, D., Reid, I., Saito, H., Yang, MH. (eds) Computer Vision -- ACCV 2014. ACCV 2014. Lecture Notes in Computer Science(), vol 9007. Springer, Cham. https://doi.org/10.1007/978-3-319-16814-2_22

Download citation

DOI:https://doi.org/10.1007/978-3-319-16814-2_22
Published:17 April 2015
Publisher Name:Springer, Cham
Print ISBN:978-3-319-16813-5
Online ISBN:978-3-319-16814-2
eBook Packages:Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Movatterモバイル変換

Extended Co-occurrence HOG with Dense Trajectories for Fine-Grained Activity Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Dense Optical Flow Trajectory-Based Human Activity Recognition Using Hierarchical Hidden Markov Model

Compact Video Description and Representation for Automated Summarization of Human Activities

3D Activity Recognition Using Motion History and Binary Shape Templates

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Access this chapter

Subscribe and save

Buy Now