PacktPublishing / Distributed-Data-Systems-with-Azure-Databricks

Distributed Data Systems with Azure Databricks, published by Packt
MIT License
12 stars 10 forks source link

Chapter 8: Page 262 #4

Open tanthiamhuat opened 2 years ago

tanthiamhuat commented 2 years ago

I can just said this book is simply a great disappointment. From a look at the code below, it just shows how sloppy the whole book is:

import pandas as pd white_wine = pd.read_csv("/dbfs/databricks-datasets/winequality/winequality-white.csv", sep=";") red_wine = pd.read_csv("/dbfs/databricks-datasets/winequality/winequality-red.csv", sep=";")

Any novice Spark user will know the above code would run with errors.

DataSpacon commented 1 year ago

Actually, it works. The path in your code is misspelled. Correct lines are: (taken from the book) ` import pandas as pd white_wine = pd.read_csv("/dbfs/databricks-datasets/wine-quality/winequality-white.csv", sep=";") red_wine = pd.read_csv("/dbfs/databricks-datasets/wine-quality/winequality-red.csv", sep=";")

`