dask / dask-examples

Easy-to-run example notebooks for Dask
https://examples.dask.org/
Creative Commons Attribution Share Alike 4.0 International
375 stars 228 forks source link

notebook example for transfering code from pure pandas to dask #68

Open sephib opened 5 years ago

sephib commented 5 years ago

Hi, I'm writing a notebook example to highlight some key differences between pandas and dask. Are you interested in such a PR? If so i have currently the following topics - (are there any additional topics that I should include?) :

  1. Dask does not update - thus no "inplace=True": (e.g. rename, reset_index, dropna,)
  2. reading/saving dataframes (with *)
  3. some gotcha's with index
  4. dd.Aggregatoin vs groupby.apply
mrocklin commented 5 years ago

Sure, something like that seems nicely in scope. I look forward to seeing what you come up with!

On Tue, Apr 23, 2019 at 9:36 AM sephib notifications@github.com wrote:

Hi, I'm writing a notebook example to highlight some key differences between pandas and dask. Are you interested in such a PR? If so i have currently the following topics - (are there any additional topics that I should include?) :

  1. Dask does not update - thus no "inplace=True": (e.g. rename, reset_index, dropna,)
  2. reading/saving dataframes (with *)
  3. some gotcha's with index
  4. dd.Aggregatoin vs groupby.apply

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/dask/dask-examples/issues/68, or mute the thread https://github.com/notifications/unsubscribe-auth/AACKZTFHKJMVVGOSEZIZSWLPR4NHJANCNFSM4HHYTBCA .

sephib commented 5 years ago

Hi, I'll try and get some information from the stackoverflow data Once I analyze it I'll update to get your feedback.