fingltd / 4mc

4mc - splittable lz4 and zstd in hadoop/spark/flink
Other
108 stars 37 forks source link

Added `pyspark` example to readme file #22

Closed surjikal closed 6 years ago

surjikal commented 7 years ago

It took me a long time to figure out how to load data properly in pyspark. Figured it'd be good to have an example in the readme, especially since sc.newAPIHadoopFile is not something you see often in blog posts and examples.