Token-level Adaptive Training for Neural Machine Translation