MindPower: Enabling Theory-of-Mind Reasoning in VLM-based Embodied Agents

MindPower Reasoning tion. To address this, we propose MindPower, a Robot-Centric framework integrating Perception, Mental Reasoning, Decision Making and Action. Given multimodal inputs, MindPower firs

MindPower: Enabling Theory-of-Mind Reasoning in VLM-based Embodied Agents

MindPower Reasoning tion. To address this, we propose MindPower, a Robot-Centric framework integrating Perception, Mental Reasoning, Decision Making and Action. Given multimodal inputs, MindPower first perceives the environment and human states, then performs ToM Reasoning to model both self and others, and finally generates decisions and actions guided by inferred mental states. Furthermore, we introduce Mind-Reward, a novel optimization objective that encourages


📜 Original Paper Content

🚀 Synchronizing high-quality layout from 1TB storage...