Structure of RGRL. (IMAGE) Beijing Zhongke Journal Publising Co. Ltd. Caption Deepreinforcement learning has disadvantages such as low sample utilization and slow convergence, and thousandsof trial-and-error iterations are required to perform reinforcement learning in realistic scenarios, which iscostly. To alleviate this problem, RGRL first simulates a robot grasping an object from a user in a simulatedscene in which tens of thousands of learning sessions are performed. Domain randomization is used to narrowthe gap between the simulated and real scenes, and a multi-objective reward function is used to effectivelyaccelerate the convergence of the reinforcementlearning algorithm. Credit Beijing Zhongke Journal Publising Co. Ltd. Usage Restrictions Credit must be given to the creator. License CC BY Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.