ChartMuseum is a chart question answering benchmark designed to evaluate reasoning capabilities of large vision-language models (LVLMs) over real-world chart images. The benchmark consists of 1162 ...
Abstract: Transparent object manipulation has long posed a significant challenge in robotic grasping tasks. Existing methods for transparent object grasping rely heavily on visual sensors, aiming to ...
Vibe coding allows manufacturing personnel to create software using everyday speech instead of traditional programming, enabling production managers to simply say "build a monitoring dashboard for ...
Abstract: Object-relative mobile robot navigation is essential for a variety of tasks, e.g. autonomous critical infrastructure inspection, but requires the capability to extract semantic information ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results