apache / datafusion

Apache DataFusion SQL Query Engine
https://datafusion.apache.org/
Apache License 2.0
6.39k stars 1.21k forks source link

Write a case study about using DataFusion #7015

Open alamb opened 1 year ago

alamb commented 1 year ago

Is your feature request related to a problem or challenge?

Part of getting people to use DataFusion is to hear what other people are doing with it, why, and what success they found

This helps potential new users evaluate if they want to spend more time learning about DataFusion

Describe the solution you'd like

I would love it if people wrote some case studies (1-2 page):

  1. What their project is
  2. How they use DataFusion
  3. Why they chose DataFusion
  4. What features of DataFusion they use

Describe alternatives you've considered

No response

Additional context

This was suggested by @MrPowers

MrPowers commented 1 year ago

I think dask-sql would be a great case study because it was successful and allowed them to eliminate an unpleasant Java dependency.

alamb commented 1 year ago

@andygrove or @jdye64 I wonder if you know who might be interested in writing a case study for Dask?

jdye64 commented 1 year ago

@alamb I'd be glad to take this on. Do we have any existing case studies I could use as a template or do we know what platform/medium the case study would be hosted on as that might help spark what little creativity I have =)

I'll be honest I'm a little busy at the moment but could see myself freeing up in the next couple of weeks to have time for something like this unless someone else gets to it first.

alamb commented 1 year ago

@alamb I'd be glad to take this on. Do we have any existing case studies I could use as a template or do we know what platform/medium the case study would be hosted on as that might help spark what little creativity I have =)

There aren't any existing studies that I know of, but I plan to write one for IOx

I'll be honest I'm a little busy at the moment but could see myself freeing up in the next couple of weeks to have time for something like this unless someone else gets to it first.

That would be great -- thanks @jdye64 -- I don't think this is super urgent, I view it as part of a longer term push to make DataFusion easier to find and used by more projects