I was doing some data science work when I get stuck with this issue, I'm trying to create a model for a supervised task, where both Input and Output are of variable length.
Here is an example on how the Input and Output look like:
Input[0]: [2.42, 0.43, -5.2, -54.9]
Output[0]: ['class1', 'class3', 'class12']
The problem is that in the dataset that I have, the Inputs and Outputs are of variable length, so in order to be able to use the entire dataset, I need to find a way of encoding this data.
First I encoded the Outputs classes and added a padding to equal all Outputs length in the dataset, (let's say of length=(6,)):
Encoded_output[0]: [1, 3, 12, 0, 0, 0]
But I can't figure out a way of encoding the Input, because, as the original Input data are floats, I cannot create an encoding and add padding. I don't know what other options I have and I would like to hear how would you solve this.