Statistics of natural scene structures and scene categorization

Chen, Xin; Wan, Weibing; Yong, Zhiyong

doi:10.1186/1471-2202-13-S1-P7

Volume 13 Supplement 1

Twenty First Annual Computational Neuroscience Meeting: CNS*2012

Poster presentation
Open access
Published: 16 July 2012

Statistics of natural scene structures and scene categorization

Xin Chen¹,
Weibing Wan¹ &
Zhiyong Yong^1,2,3

BMC Neuroscience volume 13, Article number: P7 (2012) Cite this article

1447 Accesses
2 Citations
Metrics details

Humans can grasp the gist of complex natural scenes very quickly and can remember extraordinarily rich details in thousands of scenes viewed for a very brief period [1, 2]. This amazing ability of rapid scene perception challenges both the traditional view of image-based, bottom-up visual processing [3] and recent models of scene categorization based on global visual features and features at low spatial frequency [4]. Low-level visual features such as edges, junctions, and various image gradients are insufficient for revealing the content of complex natural scenes. On the other hand, global visual features and features at low spatial frequency cannot encode the extraordinarily rich spatial concatenations of visual features in natural scenes.

We proposed natural scene structures, i.e., multi-size, multi-scale, spatial concatenations of visual features, as the basic encoding units of natural scenes and scene categories. Natural scene structures convey various amount of information about scene identities and categories since general structures are shared by more scenes while specific structures are shared by only a few scenes. Thus, any natural scene and category can be represented by a probability distribution based on a set of natural scene structures and their spatial concatenations. These structural representations are robust against variations due to noises, occlusions, changes in scales, and other factors and require no isolation of objects or figure-background segmentation, nor computation of global scene features.

To test this model of natural scenes, we compiled a large set of natural scene structures from a database of natural scenes and examined the information conveyed by the natural scene structures about natural scene categories. We then selected a set of natural scene structures with high information content, organized them into clusters, and developed a probabilistic model on the clusters of selected scene structures for each scene category. Finally, we categorized natural scenes by performing Bayesian inference based on these probability distributions. We found that the model achieved a high performance of categorization on a large dataset of natural scene categorizes. We also tested this model of natural scenes using human psychophysics. We constructed experimental stimuli that consisted of only the selected natural scene structures and asked human subjects to perform scene categorization. We either maintained or shuffled the spatial locations of the natural scene structures in the experimental stimuli. We found that the subject performance was significantly above chance even when the selected scene structures covered only a small portion of the scenes. Furthermore, shuffling the spatial locations of the scene structures significantly reduced the subject performance. These results support our statistical model of natural scenes using natural scene structures as encoding units.

References

Li FF, VanRullen R, Koch C, Perona P: Rapid natural scene categorization in the near absence of attention. Proc Natl Acad Sci USA. 2002, 99: 9596-9601. 10.1073/pnas.092277599.
Article PubMed Central CAS PubMed Google Scholar
Brady TF, Konkle T, Alvarez GA, Oliva A: Visual long-term memory has a massive storage capacity for object details. Proc Natl Acad Sci USA. 2008, 105: 14325-14329. 10.1073/pnas.0803390105.
Article PubMed Central CAS PubMed Google Scholar
Serre T, Oliva A, Poggio T: A feedforward architecture accounts for rapid categorization. Proc Natl Acad Sci USA. 2007, 104: 6424-6429. 10.1073/pnas.0700622104.
Article PubMed Central CAS PubMed Google Scholar
Greene MR, Oliva A: Recognition of natural scenes from global properties: Seeing the forest without representing the trees. Cognitive Psychology. 2009, 58: 137-176. 10.1016/j.cogpsych.2008.06.001.
Article PubMed Central PubMed Google Scholar

Download references

Acknowledgements

This material is based upon work supported by, or in part by, the U. S. Army Research Laboratory and the U. S. Army Research Office under contracts/grant numbers W911NF-11-1-0105 (Dr. Chen) and W911NF-10-1-0303 (Dr. Yang) and. This work was supported by a VDI/GHSU pilot award and the Knights Templar Education Foundation.

Author information

Authors and Affiliations

Brain and Behavior Discovery Institute, USA
Xin Chen, Weibing Wan & Zhiyong Yong
Department of Ophthalmology, USA
Zhiyong Yong
Vision Discovery Institute, Georgia Health Sciences University, Augusta, Georgia, 30912, USA
Zhiyong Yong

Authors

Xin Chen
View author publications
You can also search for this author in PubMed Google Scholar
Weibing Wan
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyong Yong
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Chen, X., Wan, W. & Yong, Z. Statistics of natural scene structures and scene categorization. BMC Neurosci 13 (Suppl 1), P7 (2012). https://doi.org/10.1186/1471-2202-13-S1-P7

Download citation

Published: 16 July 2012
DOI: https://doi.org/10.1186/1471-2202-13-S1-P7

Twenty First Annual Computational Neuroscience Meeting: CNS*2012

Statistics of natural scene structures and scene categorization

References

Acknowledgements

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

BMC Neuroscience

Contact us

Twenty First Annual Computational Neuroscience Meeting: CNS*2012

Statistics of natural scene structures and scene categorization

References

Acknowledgements

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Neuroscience

Contact us