Quality-Focused Active Adversarial Policy for Safe Grasping in Human-Robot Interaction

Chenghao Li, Razvan Beuran, Nak Young Chong

Japan Advanced Institute of Science and Technology (JAIST)

Abstract

Vision-guided robot grasping methods based on Deep Neural Networks (DNNs) have achieved remarkable success in handling unknown objects, attributable to their powerful generalizability. However, these methods with this generalizability tend to recognize the human hand and its adjacent objects as graspable targets, compromising safety during Human-Robot Interaction (HRI). In this work, we propose the Quality-focused Active Adversarial Policy (QFAAP) to solve this problem. Specifically, the first part is the Adversarial Quality Patch (AQP), wherein we design the adversarial quality patch loss and leverage the grasp dataset to optimize a patch with high quality scores. Next, we construct the Projected Quality Gradient Descent (PQGD) and integrate it with the AQP, which contains only the hand region within each real-time frame, endowing the AQP with fast adaptability to the human hand shape. Through AQP and PQGD, the hand can be actively adversarial with the surrounding objects, lowering their quality scores. Therefore, further setting the quality score of the hand to zero will reduce the grasping priority of both the hand and its adjacent objects, enabling the robot to grasp other objects away from the hand without emergency stops. We conduct extensive experiments on the datasets and a cobot, showing the effectiveness of QFAAP.

Highlights

This work explores adversarial attacks from a safety-oriented perspective, leveraging their benign potential to realize safe grasping in cluttered HRI scenarios. By designing shape-adaptive, quality score–based benign adversarial examples, our method modifies the grasping sequence to prioritize objects away from the human hand and its vicinity, thereby reducing collision risks without triggering emergency stops. This study provides a new paradigm for benign adversarial attacks in real-world robotic grasping and offers practical guidance for the safe deployment of DNN-based grasping systems.
QFAAP Pipeline

The pipeline of QFAAP: AQP + hand segmentation → PQGD adaptation → quality map modification → safe grasp selection.

Results - Detection Comparison with First-Group Baselines

The first-group baselines including Original (the original grasping model) and Original-SZ (a variant of the grasping model where the quality score of the hand region is set to zero). This experiment demonstrates that our method significantly outperforms these methods in suppressing the highest quality scores (second row of the video).

Results - Detection Comparison with Second-Group Baselines

The second-group baselines including Original-DSZ (enhanced version of Original-SZ with the zeroed area dilation) and Original-Decay (enhanced version of Original-DSZ with the distance-based linear decay). Our method also shows better performance than these methods.

Results - Grasping Comparison with and without PQGD

When PQGD is not applied, AQP shows limited effectiveness due to its poor adaptability to variations in hand shape, thus hard to realize safe grasping in human-robot interaction.

Results - Grasping in Static and Dynamic Cluttered Scenarios

Our method enables safe grasping in both static cluttered (top) and dynamic cluttered (bottom) scenarios when human hands interact with the robot. Specifically, the human hand can directly act as a source of quality score interference, suppressing the grasp quality scores of nearby objects. This, in turn, alters the grasping priority and guides the robot to grasp other objects that are farther away from the human hands.

BibTeX

@article{li2025qfaap,
  title={Quality-Focused Active Adversarial Policy for Safe Grasping in Human-Robot Interaction},
  author={Chenghao Li, Razvan Beuran, and Nak Young Chong},
  journal={IEEE Transactions on Automation Science and Engineering},
  year={2025}
}