mongulu-cm / tchoung-te

Map of Cameroonian associations in France
https://tchoung-te.mongulu.cm/
GNU General Public License v3.0
1 stars 0 forks source link

Replace pyspark by memory efficient pandas wrapper ? #16

Closed billmetangmo closed 1 year ago

billmetangmo commented 2 years ago
billmetangmo commented 2 years ago

Pour faire la comparaison entre pyspark et les autres solutions. Faire un benchmark en utilisant soit %time comme dans la page github modin ou https://fastero.readthedocs.io/ & https://github.com/bloomberg/memray. Les captures d'écran devront être jointes pour finaliser la tâche.

billmetangmo commented 1 year ago

Apache spark is 10 times faster than the previous option

image

billmetangmo commented 1 year ago

Polar.rs is 83 times faster than pandas

Image