My initial RDD; Array[Array[String]], looks something like:
a b h c ....... d e x q
d r d x ....... e q g m
f e u t ....... q t g y
a b h c ....... d e x q
d r d x ....... e q g m
f e u t ....... q t g y
....
....
New RDD; Array[Array[String]], that contains:
u t ...... q t
h c ...... d e
Any idea on how to work on it? Size of the array is large, so one could want the sub-array of any size. Thanks in advance.
bigToSmallArraythen you just have to dordd ofSmallArrays = rddOfBigArrays.map(bigToSmallArray)