GoogleCloudPlatform / data-science-on-gcp

Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Apache License 2.0
1.31k stars 715 forks source link

Chap6: quantization.ipynb , Py4JJavaError #88

Closed mokpolar closed 4 years ago

mokpolar commented 4 years ago

I see this error in running quantization.ipynb.

Flights = spark.read\ .schema(schema)\ .csv (inputs)

Py4JJavaError: An error has been detected while calling o32.csv.: java.io.IOException: No FileSystem for Schema: gs ...

Why is this error happening?

lakshmanok commented 4 years ago

Looks like the cloud storage connector is not installed. Are you running on Dataproc?