Movatterモバイル変換


[0]ホーム

URL:


Skip to main content

Advertisement

Springer Nature Link
Log in

Information-Based Learning of Deep Architectures for Feature Extraction

  • Conference paper

Part of the book series:Lecture Notes in Computer Science ((LNIP,volume 8157))

Included in the following conference series:

Abstract

Feature extraction is a crucial phase in complex computer vision systems. Mainly two different approaches have been proposed so far. A quite common solution is the design of appropriate filters and features based on image processing techniques, such as the SIFT descriptors. On the other hand, machine learning techniques can be applied, relying on their capabilities to automatically develop optimal processing schemes from a significant set of training examples. Recently, deep neural networks and convolutional neural networks have been shown to yield promising results in many computer vision tasks, such as object detection and recognition. This paper introduces a new computer vision deep architecture model for the hierarchical extraction of pixel–based features, that naturally embed scale and rotation invariances. Hence, the proposed feature extraction process combines the two mentioned approaches, by merging design criteria derived from image processing tools with a learning algorithm able to extract structured feature representations from data. In particular, the learning algorithm is based on information-theoretic principles and it is able to develop invariant features from unsupervised examples. Preliminary experimental results on image classification support this new challenging research direction, when compared with other deep architectures models.

Similar content being viewed by others

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. Bengio, Y.: Learning deep architectures for ai. Foundations and Trends in Machine Learning 2(1), 1–127 (2009)

    Article MATH MathSciNet  Google Scholar 

  2. Farabet, C., Couprie, C., Najman, L., LeCun, Y.: Scene parsing with multiscale feature learning, purity trees, and optimal covers. In: Proceedings of the 29th International Conference on Machine Learning, ICML 2012, Edinburgh, Scotland, UK, June 26-July 1. icml.cc / Omni Press (2012)

    Google Scholar 

  3. Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. Computer Vision and Image Understanding 106(1), 59–70 (2007)

    Article  Google Scholar 

  4. Gori, M., Melacci, S., Lippi, M., Maggini, M.: Information theoretic learning for pixel-based visual agents. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 864–875. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  5. Hinton, G.E., Salakhutdinov, R.R.: Reducing the Dimensionality of Data with Neural Networks. Science 313(5786), 504–507 (2006)

    Article MATH MathSciNet  Google Scholar 

  6. LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proceedings of the IEEE 86(11), 2278–2324 (1998)

    Article  Google Scholar 

  7. Melacci, S., Gori, M.: Unsupervised learning by minimal entropy encoding. IEEE Transactions on Neural Networks and Learning Systems 23(12), 1849–1861 (2012)

    Article  Google Scholar 

  8. Riedmiller, M., Braun, H.: A direct algorithm method for faster backpropagation learning: RPROP. In: Int. Conf. on Neural Networks, pp. 586–591 (1993)

    Google Scholar 

  9. Serre, T., Wolf, L., Poggio, T.: Object recognition with features inspired by visual cortex. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 2, pp. 994–1000 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

  1. Dipartimento di Ingegneria dell’Informazione e Scienze Matematiche, Università degli Studi di Siena, Via Roma 56, I-53100, Siena, Italy

    Stefano Melacci, Marco Lippi, Marco Gori & Marco Maggini

Authors
  1. Stefano Melacci

    You can also search for this author inPubMed Google Scholar

  2. Marco Lippi

    You can also search for this author inPubMed Google Scholar

  3. Marco Gori

    You can also search for this author inPubMed Google Scholar

  4. Marco Maggini

    You can also search for this author inPubMed Google Scholar

Editor information

Editors and Affiliations

  1. Department of Applied Science, University of Naples Parthenope, Centro Direzionale Isola C4, 80133, Napoli, Italy

    Alfredo Petrosino

Rights and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Melacci, S., Lippi, M., Gori, M., Maggini, M. (2013). Information-Based Learning of Deep Architectures for Feature Extraction. In: Petrosino, A. (eds) Image Analysis and Processing – ICIAP 2013. ICIAP 2013. Lecture Notes in Computer Science, vol 8157. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41184-7_11

Download citation

Publish with us

Societies and partnerships


[8]ページ先頭

©2009-2025 Movatter.jp