An evolution of the Transformer model that applies the same self-attention mechanism repeatedly across several time-steps, making it more effective for certain types of tasks. 27.07.2023 17:54 aior