An activation function that is used in various architectures such as Transformer models and was found to perform better than ReLU in some cases. 27.07.2023 17:54 aior