Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation[2021] E. Parisotto and R. Salakhutdinov[PDF] Deep Transformer Q-Networks for Partially Observable Reinforcement ...