Self-attention By JamesSand Pytorch version self-attention can be found here Tensorflow versino self-attention can be found here