My data looks like this. They are floats and they are in a big numpy array [700000,3]. There are no empty fields.
Label | Values1 | Values2
1. | 0.01 | 0.01
1. | ... | ...
1. |
2. |
2. |
3. |
...
The idea is to feed in the set of values1 and values2 and have it identify the label using classification.
But I don't want to feed the data row by row, but input all values1/2 that belong to label 1 as a set (e.g. inputting the first 3 rows is supposed to return [1,0,...], inputting the next 2 rows as a set [0,1,...])
Is there a non-complex way of feeding the data in this way? (i.e. feed batch where column label equals 1)
I am currently sorting the data and thinking about using pointers to the start and having loops which check if the next row is equal to the current to find a pointer to the end of the set and get the number of rows of that batch. But this more or less prevents randomizing input order.