Search
Now showing items 1-10 of 21
Explaining model decisions and fixing them via human feedback
(Georgia Institute of Technology, 2020-05-07)
Deep networks have enabled unprecedented breakthroughs in a variety of computer vision tasks. While these models enable superior performance, their increasing complexity and lack of decomposability into individually intuitive ...
Reconciling data privacy and utility in the era of big data
(Georgia Institute of Technology, 2019-04-08)
The widespread use of internet-connected mobile devices, internet of things(IoT) and cloud computing has enabled a large scale collection of personal data, including user profiles, daily activities, locations, photos and ...
Visual question answering and beyond
(Georgia Institute of Technology, 2019-09-03)
In this dissertation, I propose and study a multi-modal Artificial Intelligence (AI) task called Visual Question Answering (VQA) -- given an image and a natural language question about the image (e.g., "What kind of store ...
Real time detection of traffic signs on mobile device
(Georgia Institute of Technology, 2019-12-09)
In this work we propose a new approach to the object detection problem using Deep Neural Network, in the context of traffic sign detection. Our approach simplifies the detection head complexity by making the requirement ...
Domain adaptation via data augmentation
(Georgia Institute of Technology, 2020-04-28)
Deep learning (DL) models require large labeled datasets for training. Practitioners often need to adapt an existing DL model to a different domain. For instance, a practitioner in a company developing autonomous vehicles ...
EvalAI: Evaluating AI systems at scale
(Georgia Institute of Technology, 2018-12-06)
Artificial Intelligence research has progressed tremendously in the last few years. There has been the introduction of several new multi-modal datasets and tasks due to which it is becoming much harder to compare new ...
Manipulating state space distributions for sample-efficient imitation-learning
(Georgia Institute of Technology, 2020-03-16)
Imitation learning has emerged as one of the most effective approaches to train agents to act intelligently in unstructured and unknown domains. On its own or in combination with reinforcement learning, it enables agents ...
Learning neural algorithms with graph structures
(Georgia Institute of Technology, 2020-01-13)
Graph structures, like syntax trees, social networks, and programs, are ubiquitous in many real world applications including knowledge graph inference, chemistry and social network analysis. Over the past several decades, ...
Visually grounded language understanding and generation
(Georgia Institute of Technology, 2020-01-13)
The world around us involves multiple modalities -- we see objects, feel texture, hear sounds, smell odors and so on. In order for Artificial Intelligence (AI) to make progress in understanding the world around us, it needs ...
Towards tighter integration of machine learning and discrete optimization
(Georgia Institute of Technology, 2019-03-28)
Discrete Optimization algorithms underlie intelligent decision-making in a wide variety of domains. From airline fleet scheduling to data center resource management and matching in ride-sharing services, decisions are often ...