Cognitive vision for efficient scene processign and object categorization in highly cluttered environments
Abstract
One of the key competencies required in modern
robots is finding objects in complex environments. For the last
decade, significant progress in computer vision and machine
learning literatures has increased the recognition performance
of well localized objects. However, the performance of these
techniques is still far from human performance, especially in
cluttered environments. We believe that the performance gap
between robots and humans is due in part to humans' use of an
attention system. According to cognitive psychology, the human
visual system uses two stages of visual processing to interpret
visual input. The first stage is a pre-attentive process perceiving
scenes fast and coarsely to select potentially interesting regions.
The second stage is a more complex process analyzing the
regions hypothesized in the previous stage. These two stages
play an important role in enabling efficient use of the limited
cognitive resources available. Inspired by this biological fact,
we propose a visual attentional object categorization approach
for robots that enables object recognition in real environments
under a critical time limitation. We quantitatively evaluate
the performance for recognition of objects in highly cluttered
scenes without significant loss of detection rates across several
experimental settings.