Closed RobinL closed 4 years ago
Want the ability to do
df = spark.read.parquet("path_to_parquet") pmeta_json = df.schema.json() tab = tablemeta_from_parquet_meta(pmeta_json, name, location)
or
from pyarrow.parquet import ParquetFile md = ParquetFile("test_nest.parquet").metadata pmeta_json = md.metadata[b"org.apache.spark.sql.parquet.row.metadata"] tab = tablemeta_from_parquet_meta(pmeta_json, name, location)
Want the ability to do
or