Redundancy-aware Action Spaces
for Robot Learning

Pietro Mazzaglia*

Nicholas Backshall*

Xiao Ma

Stephen James

* equal contribution

Article Code

Abstract

Joint space and task space control are the two dominant action modes for controlling robot arms within the robot learning literature. Actions in joint space provide precise control over the robot's pose, but tend to suffer from inefficient training; actions in task space boast data-efficient training but sacrifice the ability to perform tasks in confined spaces due to limited control over the full joint configuration. This work analyses the criteria for designing action spaces for robot manipulation and introduces ER (End-effector Redundancy), a novel action space formulation that, by addressing the redundancies present in the manipulator, aims to combine the advantages of both joint and task spaces, offering fine-grained comprehensive control with overactuated robot arms whilst achieving highly efficient robot learning. We present two implementations of ER, ERAngle (ERA) and ERJoint (ERJ), and we show that ERJ in particular demonstrates superior performance across multiple settings, especially when precise control over the robot configuration is required. We validate our results both in simulated and real robotic environments.


Reinforcement Learning in RLBench (full-body tasks)


ERJ policies shown solving the tasks in the RLBench simulation environments. The videos show the agent's inputs cameras (third-view and wrist) in the resolution given to the agent (84x84).



Imitation Learning in Real-world

Retrieve Bear

The agent operating in task space struggles to retrieve the bear from the shelf, as it keeps its elbow straight, failing the task. However, the agent operating in ERJ space is able to bend its elbow and successfully retrieve the bear.

Push Button Elbow

The agent operating in task space is unable to push the button, as it is unable to control the free motion of the elbow. However, the agent operating in ERJ space is able to control the elbow and successfully push the button.

Grasp and Remove Cup

The agent operating in task space is unable to enter the cabinet and grasp the cup, as it is unable to control the free motion of the elbow. However, the agent operating in ERJ space is able to control the elbow and successfully grasp and remove the cup.