I am using spark with python using the pyspark command.
I came across this new library
https://github.com/databricks/spark-csv
which I want to integrate into my spark program written in python.
I have downloaded the complete codebase shown in the above link but not sure how to proceed from there. Should I just copy its contents somewhere?
Any guidance would help as I am still learning scala and spark and want to use the above library code.