IEEE transactions on pattern analysis and machine intelligence

Robust Disentangled Counterfactual Learning for Physical Audiovisual Commonsense Reasoning.

Mengshi Qi, Changsheng Lv, Huadong Ma

Published: 202510.1109/TPAMI.2025.3627224

Abstract

In this paper, we propose a new Robust Disentangled Counterfactual Learning (RDCL) approach for physical audiovisual commonsense reasoning. The task aims to infer objects' physics commonsense based on both video and audio input, with the main challen…

Preview only. Read the full abstract at the source

View at DOI