2606.11637v1 Jun 10, 2026 cs.AI

TouchThinker: Scaling Tactile Commonsense Reasoning to the Open World with Large-scale Data and Action-aware Representation

Shuicheng Yan
Shuicheng Yan
Citations: 113
h-index: 6
Di Wu
Di Wu
Citations: 75
h-index: 5
Xiaobin Hu
Xiaobin Hu
Citations: 32
h-index: 3
Kailin Lyu
Kailin Lyu
Citations: 18
h-index: 2
Kangyi Wu
Kangyi Wu
Citations: 64
h-index: 4
Pengna Li
Pengna Li
Citations: 81
h-index: 6
Pengwei Zhang
Pengwei Zhang
Citations: 9
h-index: 2
Yuhang Zheng
Yuhang Zheng
Citations: 439
h-index: 9
Ying Lai
Ying Lai
Citations: 3
h-index: 1
Long Xiao
Long Xiao
Citations: 7
h-index: 2
Chen Gao
Chen Gao
Citations: 68
h-index: 3
Lianyu Hu
Lianyu Hu
Citations: 621
h-index: 12
Jie Hao
Jie Hao
Citations: 2
h-index: 1
Ce Hao
Ce Hao
Citations: 85
h-index: 5
Weihao Yuan
Weihao Yuan
Citations: 1,140
h-index: 15

Touch is a key modality for embodied agents to understand the physical world. Although recent work has incorporated tactile signals into language systems for tactile commonsense reasoning, scaling such systems to realistic open-world settings remains challenging due to two key bottlenecks: (1) current tactile reasoning datasets remain limited in format and scale, providing insufficient supervision for reasoning from tactile observations to physical commonsense and hindering the learning of transferable tactile commonsense; (2) Tactile signals are inherently redundant and action-specific, yet existing methods often overlook these properties, resulting in inefficient representations with limited semantic expressiveness. To address these limitations, we propose TouchThinker, a tactile-language framework that scales tactile commonsense reasoning to the open world from both data and representation perspectives. First, we construct TouchThinker-1M, a million-scale, multi-source tactile reasoning dataset covering \textbf{415} objects, \textbf{8} scenarios, and \textbf{7} sensor types, providing a solid data foundation for open-world generalization. We further introduce TouchThinker-Bench, an open-world benchmark with more realistic and diverse tasks. Then, we propose action-aware modeling mechanism to improve tactile representation efficiency and enable efficient reasoning. Experimental results demonstrate that TouchThinker achieves competitive performance against state-of-the-art models across multiple datasets. Our code and dataset will be made available at: https://github.com/lvkailin0118/TouchThinker.

0 Citations
0 Influential
39.92453324894 Altmetric
199.6 Score
Original PDF
11

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!