Memory Resident Vision Model

I have a pytorch model working using computer vision and each time I need to run detect.py. This takes a few seconds to load and produce the output predictions. I was seeking a way to have a CV model always running and ready to receive new images for it to detect objects on. What do you recommend in order to have a memory resident model always on, such that I can continue sending individual images and not have to reload the parameters into memory each and every time?