Stochastic Depth [1] is a training-time technique to drop outputs of a layer.
Here (Colab Notebook) I’m using Stochastic Depth which should not cause the number of model parameters to drop but unfortunately, it seems to be dropping it. Otherwise, there’s something wrong with my understanding and I appreciate any help.