TFlite model too heavy for android

I trained a yolov3 model for a specific dataset and converted it to tflite, the tflite model weighs 200 MB, how do other object detection apps create light tflite models? Is it about the network size? here is the model:
keras-yolo3/ at master · roboflow-ai/keras-yolo3 · GitHub,

TF Lite includes a Model Optimization Toolkit for helping you trade off size, latency, accuracy, etc. There’s also this model optimization guide.

Typically you do things like quantization (e.g. converting 32-bit floats into something with less precision), model pruning (e.g. trimming the parts unused for inference) and clustering (e.g. reducing the number of unique weights by sharing when approximately equal). The toolkit and guide can help you with all of these.

1 Like