This method allows a small student model to learn from a larger teacher model by matching their output Jacobians, not just their output values. 27.07.2023 17:54 aior