Closed kelvinksau closed 7 years ago
It looks like your version doesn't use the schema we have to use to load this data, so it ends up being:
airlines = spark.read.format('com.databricks.spark.csv')\
.options(header='false', nullValue='\\N')\
.schema(schema)
.load('data/airlines.csv')
airlines.show()
See https://github.com/rjurney/Agile_Data_Code_2/commit/014f07608e5beaaa1e8292dc3fb9460eb317d298 and https://github.com/rjurney/Agile_Data_Code_2/commit/a5af14613a0008da49795c9cc176258b3776282d
should change to