Modeling Discourse Structure for Document-level Neural Machine Translation