Eventual-Inc / Daft

Distributed DataFrame for Python designed for the cloud, powered by Rust
https://getdaft.io
Apache License 2.0
1.79k stars 108 forks source link

Can you provide an example of large-scale text deduplication, such as the following example #2235

Open simplew2011 opened 3 weeks ago

simplew2011 commented 3 weeks ago
jaychia commented 3 weeks ago

Great idea! Let me work on something :)