Shape of the integer sequences in TextVectorization layer

Deependra_Parichha · June 1, 2022, 8:42am

How the shape of integer sequences is determined in TextVectorization preprocessing layer? and how to handle the shape?

markdaoust · June 3, 2022, 1:59pm

Can you post the code that got you to this output?

inp_text_preprocessor = ???

inp = ???

I think mostly you’re just looking for the “output_sequence_length” argument.

If none, it’s padded to the longest sequence, otherwise it’s padded or truncated to fit.

Deependra_Parichha · June 4, 2022, 10:04am

input is a list of Portuguese sentences.

Deependra_Parichha · June 4, 2022, 10:07am

Yes, @markdaoust I’m confused with the output_sequence_length. How did it come 64 in the first pic?

Deependra_Parichha · June 4, 2022, 10:12am

So if I specify output_sequence_length=None it will automatically get padded to the longest sequence?

markdaoust · June 4, 2022, 1:44pm

specify output_sequence_length=None it will automatically get padded to the longest sequence?

It’s supposed to.

input is a list of Portuguese sentences.

This screenshot doesn’t help anything, most of the arguments are not shown.

Deependra_Parichha · June 5, 2022, 5:55am

Thank you @markdaoust I think I got the point we can control the output sequence length with the parameter output_sequence_length.