Source: https://research.fb.com/wp-content/uploads/2018/11/Classical-Structured-Prediction-Losses-for-Sequence-to-Sequence-Learning.pdf