Reward misspecification and instrumental convergenceReward misspecification and instrumental convergence