Attention is about adding weights to words in a sequence on the basis of its importance to other words