Now showing items 1-2 of 2
EvalAI: Evaluating AI systems at scale
(Georgia Institute of Technology, 2018-12-06)
Artificial Intelligence research has progressed tremendously in the last few years. There has been the introduction of several new multi-modal datasets and tasks due to which it is becoming much harder to compare new ...
Evaluating visual conversational agents via cooperative human-AI games
(Georgia Institute of Technology, 2019-04-26)
As AI continues to advance, human-AI teams are inevitable. However, progress in AI is routinely measured in isolation, without a human in the loop. It is crucial to benchmark progress in AI, not just in isolation, but ...