工艺：有关力和互动的因果推理的基准

论文标题

工艺：有关力和互动的因果推理的基准

CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions

论文作者

Ates, Tayfun, Atesoglu, M. Samil, Yigit, Cagatay, Kesen, Ilker, Kobas, Mert, Erdem, Erkut, Erdem, Aykut, Goksun, Tilbe, Yuret, Deniz

论文摘要

人类能够感知，理解和理由有关因果事件。开发具有相似身体和因果理解能力的模型是人工智能的长期目标。作为朝着这个方向发展的一步，我们介绍了Craft，这是一个新的视频问题，回答数据集，需要有关物理力和对象相互作用的因果推理。它包含58K视频和问题对，这些视频和问题对是从20个不同虚拟环境中的10K视频产生的，其中包含各种运动中的各种对象，这些对象相互交互和场景。工艺中的两个问题类别包括先前研究的描述性和反事实问题。此外，受认知语言学中力动力学理论的启发，我们引入了一个新的因果问题类别，涉及通过原因，启用和预防概念来理解对象之间的因果关系。我们的结果表明，即使人类的工艺问题很容易，但经过测试的基线模型（包括现有的最新方法）尚未应对我们基准中带来的挑战。

Humans are able to perceive, understand and reason about causal events. Developing models with similar physical and causal understanding capabilities is a long-standing goal of artificial intelligence. As a step towards this direction, we introduce CRAFT, a new video question answering dataset that requires causal reasoning about physical forces and object interactions. It contains 58K video and question pairs that are generated from 10K videos from 20 different virtual environments, containing various objects in motion that interact with each other and the scene. Two question categories in CRAFT include previously studied descriptive and counterfactual questions. Additionally, inspired by the Force Dynamics Theory in cognitive linguistics, we introduce a new causal question category that involves understanding the causal interactions between objects through notions like cause, enable, and prevent. Our results show that even though the questions in CRAFT are easy for humans, the tested baseline models, including existing state-of-the-art methods, do not yet deal with the challenges posed in our benchmark.

下载PDF全文

下载文献需遵守相关版权规定

论文标题