Weakly Supervised Learning from Images and Video
Abstract
Recent progress in visual recognition goes hand-in-hand with the supervised learning and large-scale training data. While the amount of existing images and videos is huge, their detailed annotation is expensive and often ambiguous. To address these problems, in this talk we will focus on weakly-supervised learning methods using incomplete and noisy supervision for training. In the first part I will discuss recognition from still images and will describe our work on weakly-supervised convolutional networks for recognizing and localizing objects and human actions. The second part of the talk will focus on the learning of human actions from videos. In particular we will consider understanding specific tasks from YouTube instruction videos and corresponding narrations. We will conclude with future challenges and opportunities of visual recognition.
Collections
- IRIM Seminar Series [121]