fugue-project / tutorials

Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask without any rewrites.
https://fugue-tutorials.readthedocs.io/
Apache License 2.0
111 stars 19 forks source link

PyPI version PyPI pyversions PyPI license codecov Codacy Badge Downloads

Tutorials API Documentation Chat with us on slack!
Jupyter Book Badge Doc Slack Status

Fugue is a unified interface for distributed computing that lets users execute Python, pandas, and SQL code on Spark, Dask, and Ray with minimal rewrites.

Tutorials

The best way to get started with Fugue is to work through the 10 minute tutorials:

Running Tutorials Interactively

Using Binder

Binder

Note it runs slow on binder because the machine on binder isn't powerful enough for a distributed framework such as Spark. Parallel executions can become sequential, so some of the performance comparison examples will not give you the correct numbers.

Using Docker

Alternatively, you should get decent performance by running this Docker image on your own machine:

docker run -p 8888:8888 fugueproject/tutorials:latest

Community

Feel free to message us on Slack