Language Pre-Training and Auxiliary Tasks for Vision and Language Navigation