1: The neural network(https://www.youtube.com/watch?v=gJ9kaJsE78k) The network before the softmax, add the mask https://www.youtube.com/watch?v=gJ9kaJsE78k