We demonstrate that the qubit-routing problem has a natural interpretation as a reinforcement learning problem. The results show state-of-the-art performance when qubit routing is treated as an abstracted problem and suggest that reinforcement learning may lead to further gains being made when addressing backend optimisation more generally.

Download PDF