fornax-navo / fornax-demo-notebooks

Demo notebooks for the Fornax project
https://fornax-navo.github.io/fornax-demo-notebooks/
BSD 3-Clause "New" or "Revised" License
7 stars 19 forks source link

GSOC 2024 Dask starter problem solution. #256

Closed keerthikumanduri closed 1 month ago

keerthikumanduri commented 3 months ago

This pull request has a solution to implement the serial code using Dask. Leveraging Dask dataframes, I've explored a method where I convert a Pandas dataframe to a Dask dataframe and then compute it over a local Dask cluster. I considered this might be a good approach to efficiently handle large datasets.

Additionally, I plan to further investigate alternative methods to address the problem outlined in the accompanying text.md file.

Screenshot 2024-03-18 at 3 26 57 PM
jkrick commented 1 month ago

This was a starter problem for GSoC2024 and not meant to be merged.