data-apis / scipy-2023-presentation

Materials for the SciPy 2023 Data APIs Consortium presentation and proceedings paper
MIT License
3 stars 11 forks source link

Motivating example #2

Closed asmeurer closed 1 year ago

asmeurer commented 1 year ago

Do we have a good motivating example for the talk/paper? I know we have @anirudhdagar's scipy demo https://github.com/scipy/scipy/compare/main...AnirudhDagar:scipy:array-api-demo as well as @thomasjpfan's scikit-learn PR https://github.com/scikit-learn/scikit-learn/pull/22554/files. I could crib some relevant parts from the diff(s) there. Or should we come up with a standalone script that does something? Some good things to show in the example would be:

I can demonstrate all of these using the above scipy and scikit-learn PRs. So it's a question of whether it's better to show the actual real world usage, or if it's better to make the example more coherent and self-contained.

And we'll definitely mention scipy and scikit-learn efforts later regardless of the example we choose.

thomasjpfan commented 1 year ago

For talks, I usually try to start with "something interesting enabled by the new tech" to get everyone excited. For this case, "Look at all the benefits SciPy and scikit-learn have from using Array API". For scikit-learn, we have docs on ArrayAPI usage, and this benchmark notebook to show performance benefits.

Afterwards, one can dive into the implementation details for getting Array API to work in the attendee's projects. For implementation details, I think it's better to be more coherent and self-contained. This can include real world usage if it is self-contained enough.

asmeurer commented 1 year ago

By the way if you have any benchmarks that are graphs or anything else with a nice figure we can include that would be useful.

thomasjpfan commented 1 year ago

I reran benchmarks using scikit-learn that compared CuPy, PyTorch+GPU, PyTorch+CPU, and NumPy in this gist containing a notebook and a CSV file with the results.