Now showing items 1-2 of 2
Encoding 3D contextual information for dynamic scene understanding
(Georgia Institute of Technology, 2020-04-27)
This thesis aims to demonstrate how using 3D cues improves semantic labeling and object classification. Specifically, we will consider depth, surface normals, object classification, and pixel-wise semantic labeling in this ...
Towards natural human-AI interactions in vision and language
(Georgia Institute of Technology, 2019-11-07)
Inter-human interaction is a rich form of communication. Human interactions typically leverage a good theory of mind, involve pragmatics, story-telling, humor, sarcasm, empathy, sympathy, etc. Recently, we have seen a ...