I have a PySpark dataframe with a column that contains comma separated values. The number of values that the column contains is fixed (say 4). Example:
+------------------------+
|col1 |
+------------------------+
|1,val1, val4 |
|2,val1 |
|3,val1, val2, val3 |
|4,val1, val2, val3, val4|
+------------------------+
Now I want it to be split into 2 columns like below
+----+------------------------+
|col1|col2 |
+----+------------------------+
| 1|[val1, val4] |
| 2|[val1] |
| 3|[val1, val2, val3] |
| 4|[val1, val2, val3, val4]|
+----+------------------------+
How can this be done?