We have designed a custom packer to pack the sparsity in pruned models for specific target hardwares. After pruning, when the model is exported to .h5 format, the size of model is dropping significantly.
- Why is this the case as the number of weights are still same(though 65% are 0s) and precision is float32 ?
- Is there any compression inherently happening in storing the data in h5 format ?
- If there is some compression/sparsity packing happening while saving the model in h5, how can we stop it so it retains the original size of the model ?
Thanks in advance.