Model of visual attention for video sequences

Milanova, Mariofanna

doi:10.1186/1471-2105-9-S7-P14

Volume 9 Supplement 7

UT-ORNL-KBRIN Bioinformatics Summit 2008

Poster presentation
Open access
Published: 08 July 2008

Model of visual attention for video sequences

Mariofanna Milanova¹

BMC Bioinformatics volume 9, Article number: P14 (2008) Cite this article

2185 Accesses
Metrics details

Background

In our previous work [1] we generalized Olshausen's algorithm [2] and designed a perceptual learning model using video sequences. In this study we propose to model conjunction search. Conjunction search (search for a unique combination of two features – e.g., orientation and spatial frequency – among distractions that share only one of these features) examines how the system combines features into perceptual wholes. We propose to improve the effectiveness of the decomposition algorithm by providing classification awareness. Attentional guidance does not depend solely on local visual features, but must also include the effects of interactions among features. The idea is to group together filters that will be responsible to extract similar features. It is well known that knowledge about which features define the target improves search performance and/or accuracy [3]. The nearest neighbors of the fixations will share a certain feature.

Methods

The main goal of this work is to use sequences of images and to design an attention model including conjunction search based on unsupervised self-learning. First, Independent Component Analysis algorithm is used to determine an initial set of basis functions from the first image (Figure 1). Second, unsupervised self-organizing learning is used to group together similar bases functions (Figure 2).

Conclusion

It is shown that performing sparse learning codes on video sequences of natural scenes produces results with qualitatively similar spatio-temporal properties of simple receptive field of neurons. The basic functions are similar to those obtained by sparse learning, but in our model they have a particular order (Figure 3). The proposed framework was tested using neurobiological (event related potentials ERP's) and behavioral (eye tracking) data.

References

Milanova M, Wachowiak M, Rubin S, Elmaghraby A: A perceptual learning model based on topological representation, neural networks. Proceedings, IJCNN'01 International Conference 2001, 406–411.
Google Scholar
Olshausen B: Sparse codes and spikes. In Probabilistic Models of Perception and Brain Function. Edited by: Rao RPN, Olshausen BA, Lewicki MS. MIT Press; 2001.
Google Scholar
Rutishauser U, Koch C: Probabilistic modeling of eye movement data during conjunction search via feature-based attention. Journal of Vision 2007, 7(6):5. 1–20 1–20 10.1167/7.6.5
Article PubMed Google Scholar

Download references

Acknowledgements

The project described was supported by NIH Grant Number P20 RR-16460 from the IDeA Networks of Biomedical Research Excellence (INBRE) Program of the National Center for Research Resources.

Author information

Authors and Affiliations

Department of Computer Science, University of Arkansas at Little Rock, Little Rock, AR, 72204, USA
Mariofanna Milanova

Authors

Mariofanna Milanova
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mariofanna Milanova.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Milanova, M. Model of visual attention for video sequences. BMC Bioinformatics 9 (Suppl 7), P14 (2008). https://doi.org/10.1186/1471-2105-9-S7-P14

Download citation

Published: 08 July 2008
DOI: https://doi.org/10.1186/1471-2105-9-S7-P14

UT-ORNL-KBRIN Bioinformatics Summit 2008

Model of visual attention for video sequences

Background

Methods

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Bioinformatics

Contact us

UT-ORNL-KBRIN Bioinformatics Summit 2008

Model of visual attention for video sequences

Background

Methods

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us