I am performing a join, and I have my data across over 100 nodes. So I have a small list of key/value that I am joining with another key/value pair.
My list looks like such:
[[1, 0], [2, 0], [3, 0], [4, 0], [5, 0], [6, 0], [7, 0], [8, 0], [9, 0], [10, 0], [11, 0], [16, 0], [18, 0], [19, 0], [20, 0], [21, 0], [22, 0], [23, 0], [24, 0], [25, 0], [26, 0], [27, 0], [28, 0], [29, 0], [36, 0], [37, 0], [38, 0], [39, 0], [40, 0], [41, 0], [42, 0], [44, 0], [46, 0]]
I have broadcast variable:
numB = sc.broadcast(numValuesKV)
When I do my join:
numRDD = columnRDD.join(numB.value)
I get the following error:
AttributeError: 'list' object has no attribute 'map'