Open Gunju-Ko opened 3 years ago
df.write.option("compression","gzip").csv("path")
출처 : https://stackoverflow.com/questions/40163996/how-to-save-a-dataframe-as-compressed-gzipped-csv
rdd.saveAsTextFile("/user/cloudera/sfpd.2", classOf[org.apache.hadoop.io.compress.GzipCodec])
출처 : https://sites.google.com/a/einext.com/einext_original/apache-spark/compress-output-files-in-spark
gzip으로 압축하여 쓰기
출처 : https://stackoverflow.com/questions/40163996/how-to-save-a-dataframe-as-compressed-gzipped-csv