Transfer learning and Quantization aware training. Subclassed model

aqaaqa · November 28, 2021, 5:53pm

Hello
I am getting the following error when trying to do quantization aware training with tensorflow 2.7:

ValueError: `to_quantize` can only either be a tf.keras Sequential or Functional model.

The error occurs when calling this method:

quantize_model = tfmot.quantization.keras.quantize_model(model)

The model is defined below. I suppose the reason is that subclassed models are not supported? I have already trained(normal training, not QAT) multiple models with the definition below. Post-training quantization works, but i would like to try quantization aware training to see if it improves performance. Is there a way to be able to do quantization aware training with the model below, or alternatively define it in another way and redo normal training.

import tensorflow as tf
from tensorflow import keras
from tensorflow.keras.layers import Dense, Flatten, Activation, Dropout
from tensorflow.keras.models import Model


class Mobilenet_v2_transfer(Model):

    def __init__(self):
        super(Mobilenet_v2_transfer, self).__init__()
        self.base = tf.keras.applications.mobilenet_v2.MobileNetV2(
            input_shape=(224, 224, 3), alpha=1.0, include_top=False, weights='imagenet',
            pooling='avg')
        self.base.trainable = True
        for layer in self.base.layers[:130]:
            layer.trainable =  False
        self.flatten = Flatten()
        self.dense = Dense(1, kernel_regularizer=tf.keras.regularizers.L2(0.01)
        self.sigmoid = Activation('sigmoid')
        
    def call(self, x):
        x = self.base(x)
        x = self.flatten(x)
        x = self.dense(x)
        x = self.sigmoid(x)
        return x

Bhack · November 28, 2021, 7:18pm

Have you checked:

github.com/tensorflow/model-optimization

QAT (quantization aware training) Support quantizing models recursively

opened 04:55PM - 04 May 20 UTC

CRosero

feature request technique:qat

**Describe the bug** I'm doing transfer learning and would like to (at the end)… quantize my model. The problem is that when I try to use the _quantize_model()_ function (which is used successfully in numerous tutorials and videos), I get an error. How am I supposed to do quantization for transfer learning (using an already previously built model as a feature extractor)? **System information** TensorFlow installed from (source or binary): pip TensorFlow version: tf-nightly 2.2.0 TensorFlow Model Optimization version: 0.3.0 Python version: 3.7.7 **Describe the expected behavior** I expect the model to be successfully quantized and for no error messages to appear. **Describe the current behavior** I get the error: "ValueError: Quantizing a tf.keras Model inside another tf.keras Model is not supported." **Code to reproduce the issue** Can be found [here](https://colab.research.google.com/drive/17MES7jHh_BkngRgmGHhPhhujTsYwPMbe)

Jaehong_Kim · December 3, 2021, 1:02am

Basically current QAT API doesn’t supports subclass model with just a quantize_model API.

I’d like to recommend you change the model to be functional model if the model is simple enough. (it may requires redo normal training to make sure the model is exactly same.)

Functional model Example) models/classification_model.py at f8f4845cc85ef674d6285337b55e43638039ff91 · tensorflow/models · GitHub

super().init(inputs=inputs, outputs=x) is a patten that you can create a subclass functional model which is internally same as an example here: The Functional API | TensorFlow Core

But it also not supports to quantize recursively as Bhack@ mentioned.

So you may have to put a flag to init to determine you quantize the self.base model manually (by calling quantize_apply) on init method. (also you have to call quantize_apply for the entire model to quantize some layers outside of self.base.) As similar to the Bhack@ mentioned link.