2024-08-07 05:12 AM
I have a large PyTorch model that I pruned using unstructured methods to decrease its size. The model size can only decrease if I filter out the zeros and save the model in a sparse format.
However, my problem is that ONNX doesn't support the sparse format. Does anyone know how I could prune a model, filter out the zeros to reduce the model size, and implement it on an evaluation board?
2024-08-26 03:24 AM
Sparsity is not supported by STM32Cube.AI either.