Predict method with transformer encoder decoder

I have fitted a model using a fairly standard transformer encoder-decoder architecture using keras_nlp layers. When instantiating the model, I provide input as ([input, target], offset_target). So, target is just a sequence that goes into decoder and offset_target is the target sequence offset by 1 step. This is a standard nlp seq2seq model.

How do I correctly call predict method on the fitted model, as I have no target sequences?