dask / dask-tutorial

Dask tutorial
https://tutorial.dask.org
BSD 3-Clause "New" or "Revised" License
1.83k stars 702 forks source link

01_dask.delayed.ipynb dask.delayed parallelization section does not have a filenames variable declared #218

Closed ghost closed 2 years ago

ghost commented 3 years ago

sums = [] counts = [] for fn in filenames: #filenames variable does not exist df = pd.read_csv(fn) by_origin = df.groupby('Origin') total = by_origin.DepDelay.sum() count = by_origin.DepDelay.count() sums.append(total) counts.append(count) total_delays = sum(sums) n_flights = sum(counts) mean = total_delays / n_flights

jrbourbeau commented 3 years ago

Thanks for raising an issue @gurvinder-yadav. filenames is defined here

Screen Shot 2021-07-21 at 6 09 37 PM

Are you sure you ran all the notebook cells in order?

jacobtomlinson commented 2 years ago

Closing due to inactivity.