Modification of layers in hub.KerasLayer

Ujjwal · June 29, 2021, 4:02pm

hub.KerasLayer object does not allow modification of existing layer parameters. This is debilitating to users who would like to use tfhub models in their own architecture with minor changes. Some specific use cases are mentioned below

Residual networks (2D and 3D) can be dilated appropriately to increase the size of feature maps and they have been known to benefit performance in tasks such as semantic segmentation and image classification such as here.
Removing temporal pooling in 3D CNNs. In many applications such as tracking and action detection, it is of interest to remove temporal pooling to keep same number of frames in input and output. This also requires modification of layer parameters.
Access to intermediate activations is also important for explanability issues. This requires accessing intermediate layers which is currently not supported.
Access to intermediate layers is also important to utilize multiple layers of a CNN directly in cases such as FPN and in some object detectors such as MSCNN.

I therefore believe that usability of tfhub models would increase tremendously if hub.KerasLayer object is allowed to be modified. Is there any plan to consider this because it seems pretty important to improve the outreach of tfhub.

lgusm · June 30, 2021, 10:25am

HI @Ujjwal

Thanks for your detailed feedback

Given the structure of SavedModel, I don’t think this request is address-able in general. TF Hub only can expose elements that a publisher decided to provide callable end-points for.

For the modification that you are looking for working directly with code like the ones from Model Garden ( blog post is the way to go.

Ujjwal · June 30, 2021, 12:02pm

Hi @lgusm ,

Maybe my understanding is wrong but here is what I observe about the situation.

TensorFlow website states that TF2 SavedModelFormat is the preferred way to publish models on TFHub. Now the problem is that one can load a TF2 SavedModel and modify its layers. But on TFHub it is not possible because it appears that the whole model is further wrapped around a hub.KerasLayer class. This seems to be a consistency issue if I am not wrong.
The current scenario prevents researchers like me ( I know so many others with similar concerns ) from adopting pre-trained TFHub models and using them in novel settings for experimentation.
Due to points 1) and 2) on many occassions researchers are unfortunately compelled to rewrite their prototype codes in other frameworks where the model sharing ecosystem is more consistent and pre-trained models are easily available.

Especially because of point 1), I think that revamping the hub.KerasLayer class could be a good idea as it will allow people not only to use pretrained models in simplistic settings of standard finetuning and feature extraction, but would also allow people like me to use them in more research settings.

Bhack · June 30, 2021, 7:43pm

Please, read the thread on the Model Garden refactoring (starting from this point):

github.com/tensorflow/community

RFC: TensorFlow Official Model Garden Redesign

tensorflow:master ← rachellj218:patch-1

opened 06:34AM - 03 Aug 19 UTC

rachellj218

+186 -0

This RFC will be open for comment until Friday, August 16th, 2019. # TensorFl…ow Official Model Garden Redesign | Status | Proposed | :-------------- |:---------------------------------------------------- | |**RFC#**| 130| | **Author(s)** | Jing Li (jingli@google.com), Hongkun Yu (hongkuny@google.com), Xiaodan Song (xiaodansong@google.com) | | **Sponsor** | Edd Wilder-James (ewj@google.com) | | **Updated** | 2019-08-02 | ## Objective This document presents a proposal to redesign TensorFlow official model garden. We aim to provide a central and reliable place to contain popular examples, state-of-the-art models and tutorials to demonstrate the best practice in TF2.0 and illustrate real-world use cases.

innat · September 28, 2021, 1:35pm

@Ujjwal
I also face the same issue with Keras layer hub or whatever. It’s really disappointing.

@Bhack
I went to the RFC but it’s too long to read. Could you please summarize?

Bhack · September 28, 2021, 1:41pm

Nothing specific, if you see the comment I pointed, it was related to improve TFHub and model garden interaction and coordination when the Model Garden redesign was proposed in that RFC

lgusm · September 28, 2021, 2:24pm

I guess, as of today, if you want to make changes to the model, you’ll need it’s source code and work from the checkpoint to load the weights. Model Garden has both.

Bhack · September 28, 2021, 2:45pm

Yes exactly, my point at that time was to think about if and how we could broke this rigid barrier between inference/fine-tuning and model garden full model control.

I still think that we could have some evolutionary step in this direction in one of the next roadmaps.