coiled / dask-community

Issue tracker for the Dask community team
MIT License
2 stars 0 forks source link

[Stack Overflow] Operating large .csv file with pandas/dask Python #781

Open github-actions[bot] opened 2 years ago

github-actions[bot] commented 2 years ago

I've got a large .csv file (5GB) from UK land registry. I need to find all real estate that has been bought/sold two or more times.

Each row of the table looks like this:

{F887F88E-7D15-4415-804E-52EAC2F10958},"70000","1995-07-07 00:00","MK15 9HP","…

Would you like to know more?

Read the full article on the following website:

https://stackoverflow.com/questions/71628546/operating-large-csv-file-with-pandas-dask-python

pavithraes commented 2 years ago

Sultan and Michael Delgado have engaged on this. We can keep an eye on it though. :)