An implementation of BERT-like models which is significantly faster for serving in real-time applications, thanks to efficient C++ deployments. 27.07.2023 17:54 aior