Search
Now showing items 1-10 of 41
Movement Pattern Histogram for Action Recognition and Retrieval
(Georgia Institute of Technology, 2014)
We present a novel action representation based on encoding the global
temporal movement of an action. We represent an action as a set of movement
pattern histograms that encode the global temporal dynamics of an action. ...
Joint Semantic Segmentation and 3D Reconstruction from Monocular Video
(Georgia Institute of Technology, 2014-09)
We present an approach for joint inference of 3D scene structure and semantic labeling for monocular video. Starting with monocular
image stream, our framework produces a 3D volumetric semantic + occupancy map, which is ...
Categorizing Turn-Taking Interactions
(Georgia Institute of Technology, 2012-10)
We address the problem of categorizing turn-taking interactions between individuals. Social interactions are characterized by turn-taking and arise
frequently in real-world videos. Our approach is based on the use of ...
Weakly Supervised Learning of Object Segmentations from Web-Scale Video
(Georgia Institute of Technology, 2012-10)
We propose to learn pixel-level segmentations of objects from
weakly labeled (tagged) internet videos. Specifically, given a large collection of raw YouTube content, along with potentially noisy tags, our
goal is to ...
CENTRIST: A Visual Descriptor for Scene Categorization
(Georgia Institute of Technology, 2011-08)
CENTRIST (CENsus TRansform hISTogram), a new visual descriptor for recognizing topological places or scene
categories, is introduced in this paper. We show that place and scene recognition, especially for indoor environments, ...
Visual Place Categorization: Problem, Dataset, and Algorithm
(Georgia Institute of Technology, 2009-10)
In this paper we describe the problem of Visual
Place Categorization (VPC) for mobile robotics, which
involves predicting the semantic category of a place from image
measurements acquired from an autonomous platform. ...
Learning to Recognize Daily Actions using Gaze
(Georgia Institute of Technology, 2012-10)
We present a probabilistic generative model for simultaneously recognizing daily actions and predicting gaze locations in videos
recorded from an egocentric camera. We focus on activities requiring
eye-hand coordination ...
Computerized Macular Pathology Diagnosis in Spectral Domain Optical Coherence Tomography Scans Based on Multiscale Texture and Shape Features
(Georgia Institute of Technology, 2011-10)
To develop an automated method to identify the
normal macula and three macular pathologies (macular hole
[MH], macular edema [ME], and age-related macular degeneration
[AMD]) from the fovea-centered cross sections in ...
Haptic Classification and Recognition of Objects Using a Tactile Sensing Forearm
(Georgia Institute of Technology, 2012-10)
In this paper, we demonstrate data-driven inference of mechanical properties of objects using a tactile sensor
array (skin) covering a robot’s forearm. We focus on the
mobility (sliding vs. fixed), compliance (soft vs. ...
Decoupling Behavior, Perception, and Control for Autonomous Learning of Affordances
(Georgia Institute of Technology, 2013-05)
A novel behavior representation is introduced that
permits a robot to systematically explore the best methods by
which to successfully execute an affordance-based behavior for
a particular object. The approach decomposes ...