Error when using TFLite interpreter in Flask

MSI · October 15, 2021, 8:15pm

@Bhack As you suggested, I tried to follow your provided TF-serving technique. But faced some problem there. But from another website implemented the TF-Serving with flask. As far as I learnt we can’t use tflite in TF-serving. As a result I converted my h5(size was 42.0 MB) to pb(and required format). Which worked fine. But still slow in process. Do you think my PC needs to be more stronger ?

NB: Current PC config 8gb ram, 1TB HDD, 4gb(grapics) !

MSI · October 15, 2021, 8:17pm

Modified the frame generate code for prediction like below. Is there any problem ?

def generate_frames(frame):

    img_face = cv2.resize(frame,(256,256))    
    img_face = cv2.cvtColor(img_face, cv2.COLOR_BGR2RGBA)

    #converting into float32
    img_face_f = (img_face/255.0).astype(np.float32)
    img = img_face_f[:,:,:3]

    payload = {
        "instances": [{'input_1': img.tolist()}]
    }

    r = requests.post('http://localhost:8501/v1/models/model_name:predict', json=payload)
    
    mask= json.loads(r.content.decode('utf-8'))

    mask= np.array(mask['predictions'])[0]
            
    final_result = (mask*255).astype(np.uint8)
            
    ret,buffer=cv2.imencode('.jpg',final_result)

    frame=buffer.tobytes()

    return frame

Bhack · October 15, 2021, 8:18pm

Yes it is correct

Do you think my PC need to be more stronger ?

What is you GPU?

MSI · October 15, 2021, 8:21pm

NVIDIA- GeForce-940MX (4 GB DDR3 dedicated)

Bhack · October 15, 2021, 8:26pm

Do you have followed the TF serving for GPU steps?

Also if it is running correctly on GPU this specific model could be still relatively slow if your if your model is too heavy. See:

MSI · October 15, 2021, 8:30pm

Yes, I think ! I tried with tensorflow/serving:latest-gpu image.

Bhack · October 15, 2021, 8:59pm

check with nvidia-smi that your GPU is occupied.

MSI · October 16, 2021, 7:19am

@Bhack I think you got the right point !! Somehow it’s not utilizing my GPU !!!

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 471.41       Driver Version: 471.41       CUDA Version: 11.4     |
|-------------------------------+----------------------+----------------------+
| GPU  Name            TCC/WDDM | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  NVIDIA GeForce ... WDDM  | 00000000:01:00.0 Off |                  N/A |
| N/A    0C    P8    N/A /  N/A |     40MiB /  4096MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

After running the program! but what’s wrong with it !!

MSI · October 16, 2021, 7:56am

I think I missed the nvidia docker point !

MSI · October 16, 2021, 9:48am

@Bhack Apart from this I saw when I am using tflite normally in my PC it’s not utilizing the GPU but the normal model does. What’s the point here?

Bhack · October 16, 2021, 12:02pm

Yes TFlite has not a Nvidia/CUDA GPU delegate currently.
On that GPU you need to use regular TF. See:

MSI · October 18, 2021, 4:30pm

@Bhack Thinking about hair segment of mediapipe. But it’s available in Android & C++… Is there any way to use it in python ?

Bhack · October 18, 2021, 6:27pm

With serving you need to use a “regular” TF model:

Bhack · October 18, 2021, 7:42pm

It is probably exoerimental but If you need a specific TFlite model you could try to conver your model with:

Then probably you could write your own service with TF.js node GPU:

MSI · December 5, 2021, 6:49am

@Bhack Thanks for all the suggestions. Can you please take a look on this issue? Thank you.

Bhack · December 5, 2021, 11:44am

Isn’t that one the same issue?

MSI · December 5, 2021, 2:15pm

No, this time normal model is working fine but when predicting for the first time after starting server it takes time !!

MSI · December 6, 2021, 9:06am

@Bhack Going through some confusion. Almost in every article, we can see they recommend the TF serving for deployment but when I should avoid it?

Bhack · December 6, 2021, 10:48am

You have Tensorflow serving or you can experiment with TF.js node

gdubey · September 16, 2022, 3:39am

@Bhack @George_Soloupis Any solution which helps in: Flask/FastAPI serving (cannot reload/refresh model)? Please help;

Tried removing all the possible references of objects related to interpreter (input_details, output_details) etc;