Model Overfiting with LSTM layers

OriAlpha · July 25, 2021, 11:44am

I am trying to train a model over human skeleton data and was able to achieve good accuracy overtraining but the validation loss reaches a point and starts to increase again. Model validation accuracy doesn’t decrease over time. Clearly, it overfitting and I totally understand. For reducing this I tried most of the techniques but was not able to decrease the validation_loss. I had tried dropouts, reducing model capacity, and adding loss for layers but no luck.
Log graph would be seen below

Any ideas to improve the model??

Bhack · July 25, 2021, 4:28pm

Have you tried to augment the dataset?

OriAlpha · July 25, 2021, 4:41pm

Yap, I have tried that too. But I fell for this model parameter it’s more data. Maybe I could be wrong.

Bhack · July 25, 2021, 5:33pm

Have you tried to overfit also the train+validation?

OriAlpha · July 25, 2021, 6:02pm

No when i did data augmentation, train seems to be fine but the valdication loss only increases back

Bhack · July 25, 2021, 6:58pm

Are you handling a 2d or 3d pose dataset?

OriAlpha · July 25, 2021, 7:52pm

its 3d pose but it has been reshaped to 75 (i.e. 25 key points * 3)

Bhack · July 25, 2021, 9:13pm

I don’t know your dataset but If you cannot collect more train data to cover your validation distribtuion you can try with some interesting augmentation approach like:

https://arxiv.org/abs/2105.02465

OriAlpha · July 25, 2021, 9:26pm

I am using the NTU-RGBD dataset for training. According to your idea what should be validation distribution. My dataset size is around 18000 samples and split 80:10:10. Also model parameters is around 210,864.

Bhack · July 26, 2021, 6:41pm

Are you traning on NTU-RGBD and evaluating on your own custom dataset?

OriAlpha · July 26, 2021, 9:03pm

I am using NTU-RGBD for both training, and validation.

Bhack · July 26, 2021, 9:11pm

Was in your graph the loss/accurancy on the Action recognition or on the keypoints?

OriAlpha · July 26, 2021, 9:14pm

it was on action recognition because i have considered 20 classes for prediction

Bhack · July 26, 2021, 9:17pm

Have you corrently sampled/balanced all the classes in training set?

OriAlpha · July 26, 2021, 9:21pm

yes, I have considered while preparing dataset

Bhack · July 26, 2021, 9:26pm

Have you tried to build the confusion Matrix or the classification error for each class to check how It is distributed?

OriAlpha · July 27, 2021, 8:57am

This is data distribution, numbers respond to classes.
Counter({14: 850, 16: 849, 9: 848, 18: 848, 5: 848, 4: 847, 6: 847, 17: 846, 1: 845, 19: 845, 3: 845, 15: 845, 12: 844, 8: 844, 10: 844, 2: 843, 0: 841, 11: 840, 13: 837, 7: 834})

Bhack · July 27, 2021, 10:30am

Yes but I meant how the validation error is distributed over classes.

Before debugging your custom model have you tried to reproduce approximate results with any well known model on this dataset?

OriAlpha · July 27, 2021, 5:42pm

how can i get validation error over distributed classes. I mean how to visualise loss over classes.

Bhack · July 27, 2021, 7:43pm

You can play the confusion matrix preparing the validation GT label and predictions