These use a structured attention mechanism that has a two-dimensional matrix, capturing diverse aspects of the input sequence. 27.07.2023 17:54 aior