Converting PyTorch to Keras: Internal Blocks Not Showing Up

Jonathan_Bechtel · December 20, 2021, 5:48pm

Hi everyone,

For a personal project I’m trying to recreate the NBeats architecture into Keras, and I don’t think I’m doing it correctly but am not sure why.

The page I’m working off of as a ground truth can be found here: https://github.com/ElementAI/N-BEATS/blob/master/models/nbeats.py

Here’s the starter PyTorch code that I’m trying to convert:

class NBeatsBlock(t.nn.Module):
    def __init__(self,
                 input_size,
                 theta_size: int,
                 basis_function: t.nn.Module,
                 layers: int,
                 layer_size: int):
        super().__init__()
        self.layers = t.nn.ModuleList([t.nn.Linear(in_features=input_size, out_features=layer_size)] +
                                      [t.nn.Linear(in_features=layer_size, out_features=layer_size)
                                       for _ in range(layers - 1)])
        self.basis_parameters = t.nn.Linear(in_features=layer_size, out_features=theta_size)
        self.basis_function = basis_function

    def forward(self, x: t.Tensor) -> Tuple[t.Tensor, t.Tensor]:
        block_input = x
        for layer in self.layers:
            block_input = t.relu(layer(block_input))
        basis_parameters = self.basis_parameters(block_input)
        return self.basis_function(basis_parameters)


class NBeats(t.nn.Module):
    def __init__(self, blocks: t.nn.ModuleList):
        super().__init__()
        self.blocks = blocks

    def forward(self, x: t.Tensor, input_mask: t.Tensor) -> t.Tensor:
        residuals = x.flip(dims=(1,))
        input_mask = input_mask.flip(dims=(1,))
        forecast = x[:, -1:]
        for i, block in enumerate(self.blocks):
            backcast, block_forecast = block(residuals)
            residuals = (residuals - backcast) * input_mask
            forecast = forecast + block_forecast
        return forecast

class GenericBasis(t.nn.Module):
    def __init__(self, backcast_size: int, forecast_size: int):
        super().__init__()
        self.backcast_size = backcast_size
        self.forecast_size = forecast_size

    def forward(self, theta: t.Tensor):
        return theta[:, :self.backcast_size], theta[:, -self.forecast_size:]

Here’s the Keras code I have to translate:

class NBeatsBlock(keras.layers.Layer):
    def __init__(self, 
                 theta_size: int,
                 basis_function: keras.layers.Layer,
                 layer_size: int = 4):
        super(NBeatsBlock, self).__init__()
        self.layers_          = [keras.layers.Dense(layer_size, activation = 'relu') 
                                    for i in range(layer_size)]
        self.basis_parameters = keras.layers.Dense(theta_size)
        self.basis_function   = basis_function
        
    def call(self, inputs):
        x = self.layers_[0](inputs)
        for layer in self.layers_[1:]:
            x = layer(x)
        x = self.basis_parameters(x)
        return self.basis_function(x)
    
class NBeats(keras.layers.Layer):
    def __init__(self, 
                 blocksize: int,
                 theta_size: int,
                 basis_function: keras.layers.Layer):
        super(NBeats, self).__init__()
        self.blocks = [NBeatsBlock(theta_size =  theta_size, basis_function =  basis_function) for i in range(blocksize)]
        
    def call(self, inputs):
        residuals = K.reverse(inputs, axes = 0)
        forecast  = inputs[:, -1:]
        for block in self.blocks:
            backcast, block_forecast = block(residuals)
            residuals                = residuals - backcast
            forecast                 = forecast + block_forecast
        return forecast
    
class GenericBasis(keras.layers.Layer):
    def __init__(self, backcast_size: int, forecast_size: int):
        super().__init__()
        self.backcast_size = backcast_size
        self.forecast_size = forecast_size
        
    def call(self, inputs):
        return inputs[:, :self.backcast_size], inputs[:, -self.forecast_size:]

If I try and make a model from the Keras code it works, but I don’t think it’s constructed correctly.

Here’s a simple model:

inputs = Input(shape = (1, ))

nbeats = NBeats(blocksize = 4, theta_size = 7, basis_function = GenericBasis(7, 7))(inputs)
out = keras.layers.Dense(7)(nbeats)

model = Model(inputs, out)

My concern is that the internal NBeatsBlock layers are not actually being used in the model I just created.

My model summary reads like this:

NBeats_summary ,

And as you can see there’s nothing that indicates the internal Dense layers are there.

And if I plot the model I get the following diagram:

NBeats_graph

So I don’t think I’m doing things correctly but I’m also not sure where I’m going wrong with how I’m constructing it. I’m guessing there are small differences in how PyTorch & Keras work that I’m not picking up on.

Bhack · December 20, 2021, 10:03pm

I’ve not personally verified the correct implementation but there was a parallel Pytorch and Keras impl at:

Bhack · December 20, 2021, 11:41pm

Also please check how to print summaries with nested models:

Or plot_model:

github.com/keras-team/keras

[P] [RELNOTES] Add ability to visualize wrapped models with plot_model function

keras-team:master ← yoks:master

opened 09:19PM - 18 Oct 18 UTC

yoks

+73 -20

### Summary Added two new optional arguments to `plot_model` function in `vis_u…tils.py`, to be able to visualize nested (wrapped) models. Also adds `dpi` param to have ability to produce high resolution graphs. For example giving this model: ```python sentence_input = Input(shape=(2, 3), dtype='float32', name="input2") l_lstm = Bidirectional(LSTM(16))(sentence_input) sent_encoder = Model(sentence_input, l_lstm) review_input = Input(shape=(5, 2, 3), dtype='float32') review_encoder = TimeDistributed(sent_encoder)(review_input) l_lstm_sent = LSTM(16)(review_encoder) preds = Dense(5, activation='softmax')(l_lstm_sent) model = Model(review_input, preds) vis_utils.plot_model(model, to_file='model3.png', show_shapes=True, expand_nested=True, dpi=300) ``` Will produce: ![model3](https://user-images.githubusercontent.com/3962002/47184729-4d86fe80-d2e0-11e8-903d-cbf5c393f3eb.png) While calling it without new arguments will produce plot as before: ```python vis_utils.plot_model(model, to_file='model3.png', show_shapes=True) ``` ![model3](https://user-images.githubusercontent.com/3962002/47184794-7a3b1600-d2e0-11e8-8bf0-bc62b1043969.png) ### Related Issues #5937 ### PR Overview - [x] This PR requires new unit tests [y/n] (make sure tests are included) - [x] This PR requires to update the documentation [y/n] (make sure the docs are up-to-date) - [x] This PR is backwards compatible [y/n] - [x] This PR changes the current API [y/n] (all API changes need to be approved by fchollet)

innat · December 23, 2021, 2:15am

Not an actual solution but some pointer.