← 返回日期列表

论文列表 2026-03-25

共 20 篇论文

1
游戏问答:面向三维虚拟代理决策密集型视点同步多视频理解的基准测试框架 GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents
Yunzhe Wang, Runhui Xu, Kexin Zheng 等7人
2
亲爱的,我把科学家变小了——评估用于显微镜下样本导航的2D、3D及VR界面 Honey, I shrunk the scientist -- Evaluating 2D, 3D, and VR interfaces for navigating samples under the microscope
Jan Tiemann, Matthew McGinity, Ulrik Günther
3
持续机器人学习中“自我”涌现的证据 Evidence of an Emergent "Self" in Continual Robot Learning
Adidev Jhunjhunwala, Judah Goldfeder, Hod Lipson
4
由电液驱动器驱动的无传感器、固有柔顺仿生肌肉骨骼手 A Sensorless, Inherently Compliant Anthropomorphic Musculoskeletal Hand Driven by Electrohydraulic Actuators
Misato Sonoda, Ronan Hinchet, Amirhossein Kazemipour 等5人
5
LATS:面向交通信号控制的多智能体强化学习中的大语言模型辅助师生框架 LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control
Yifeng Zhang, Peizhuo Li, Tingguang Zhou 等5人
6
CoordLight:学习去中心化协调以实现网络级交通信号控制 CoordLight: Learning Decentralized Coordination for Network-Wide Traffic Signal Control
Yifeng Zhang, Harsh Goel, Peizhuo Li 等6人
7
VLA三维融合模块:将基于VGGT的三维信息集成至视觉-语言-动作模型的即插即用方案 3D-Mix for VLA: A Plug-and-Play Module for Integrating VGGT-based 3D Information into Vision-Language-Action Models
Bin Yu, Shijie Lian, Xiaopeng Lin 等11人
8
提升无人机灯光秀表演效果:集群无人机编队的最优分配与轨迹规划 Enhancing Drone Light Shows Performances: Optimal Allocation and Trajectories for Swarm Drone Formations
Yunes Alqudsi
9
师生扩散模型:文本驱动的三维手部动作生成 Teacher-Student Diffusion Model for Text-Driven 3D Hand Motion Generation
Ching-Lam Cheng, Bin Zhu, Shengfeng He
10
大型智能体匿名多智能体路径规划中的约束松弛 Relaxing Constraints in Anonymous Multi Agent Path Finding for Large Agents
Stepan Dergachev, Dmitry Avdeev
11
用于腔内介入的微型纤维增强软体弯曲执行器的设计、建模与表征 Design, Modelling and Characterisation of a Miniature Fibre-Reinforced Soft Bending Actuator for Endoluminal Interventions
Xiangyi Tan, Aoife McDonald-Bowyer, Danail Stoyanov 等4人
12
揭示缺陷工程4H$_\mathrm{b}$-TaS$_2$中的电荷转移机制 Revealing Charge Transfer in Defect-Engineered 4H$_\mathrm{b}$-TaS$_2$
Siavash Karbasizadeh, Wooin Yang, Wonhee Ko 等7人
13
迈向基于安全学习的非线性模型预测控制:通过递归神经网络建模实现 Towards Safe Learning-Based Non-Linear Model Predictive Control through Recurrent Neural Network Modeling
Mihaela-Larisa Clement, Mónika Farsang, Agnes Poks 等7人
14
迈向无需训练的场景文本编辑 Towards Training-Free Scene Text Editing
Yubo Li, Xugong Qin, Peng Zhang 等6人
15
变色龙:面向长时程机器人操作的片段记忆系统 Chameleon: Episodic Memory for Long-Horizon Robotic Manipulation
Xinying Guo, Chenxi Jiang, Hyun Bin Kim 等7人
16
EndoVGGT:基于图神经网络增强的手术三维重建深度估计 EndoVGGT: GNN-Enhanced Depth Estimation for Surgical 3D Reconstruction
Falong Fan, Yi Xie, Arnis Lektauers 等5人
17
潜在世界行动建模:面向端到端自动驾驶的潜在世界行动建模 Latent-WAM: Latent World Action Modeling for End-to-End Autonomous Driving
Linbo Wang, Yupeng Zheng, Qiang Chen 等16人
18
标签:视觉-语言-动作模型中稳定目标中心推理的无目标引导 TAG: Target-Agnostic Guidance for Stable Object-Centric Inference in Vision-Language-Action Models
Jiaying Zhou, Zhihao Zhan, Ruifeng Zhai 等8人
19
DreamerAD:基于潜在世界模型的高效自动驾驶强化学习 DreamerAD: Efficient Reinforcement Learning via Latent World Model for Autonomous Driving
Pengxuan Yang, Yupeng Zheng, Deheng Qian 等14人
20
YingMusic-Singer:具备灵活歌词操控与无标注旋律引导的可控歌唱语音合成系统 YingMusic-Singer: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance
Chunbo Hao, Junjie Zheng, Guobin Ma 等9人