Model accuracy for tflite pose detection using a custom classifier

Hi I’ve been trying to train tflite pose classifiers to work with the movenet pi example

I’ve used the suggested tutorial and colab

and carefully followed the example yoga pose dataset (with train and test examples) but the classification results seem almost random (2 classifiers out 4 successfully classify a pose about 50% of the time).

My dataset is here

The training in the colab works well with a confusion matrix with no errors. The model accuracy is also high (0.9977) and the increasing accuracy through the training epochs looks as expected. I wondered if anyone has used this colab with success?