Open manuzhang opened 5 years ago
you can load data like this, and you need to prepare the schema
sparkSession.read.format("tfrecords").schema(schema).load(inputPattern)
Unless I don't know the schema and want to find out with printSchema
Unless I don't know the schema and want to find out with
printSchema
I guess the process need to scan the whole data to generate the schema, may be you can try use less data
I find all input data are loaded when I just want to print schema with the following code.