Model prediction (real-time) in Docker became slower after some time


I have deployed my model in the server with the docker container(with GPU support) which is a real-time prediction application. But after some time like 1 or 2 hours, the real-time prediction is becoming slow. Any suggestion?

Are you using Tensorflow serving?