These networks use product quantization to compress the weights, reducing memory requirements and speeding up computation. 27.07.2023 17:54 aior