1. It's hard to remove more than 50% of the parameters.
2. Compression is achieved via a combination of sparsification, distillation, and (optionally) quantization.
↩️
1. It's hard to remove more than 50% of the parameters.
2. Compression is achieved via a combination of sparsification, distillation, and (optionally) quantization.
↩️