iris-hep / analysis-grand-challenge

Repository dedicated to AGC preparations & execution
https://agc.readthedocs.io
MIT License
24 stars 39 forks source link

Core dependencies for running AGC implementation #199

Open alexander-held opened 1 year ago

alexander-held commented 1 year ago

We had some discussions in a few places (#140, #141, #144) about how to provide the relevant information for environment setup. This is meant to gather the core pieces that would allow running the latest non-notebook version (otherwise Jupyter is needed) with Python futures (see also #143, otherwise Dask is required as well).

The list currently includes:

coffea cabinetry servicex-clients scikit-learn xgboost

This will pick up a lot of core dependencies as well automatically.

Please report any issues and missing libraries encountered.

cc @eguiraud

alexander-held commented 1 year ago

Possible future pitfalls to be aware of: coffea versions beyond 0.7 ("coffea 2023") will only be compatible with future versions of the AGC implementation here (#116) and otherwise not work with the current notebook. The same holds for awkward>=2.

eguiraud commented 11 months ago

Hi, I tried just putting those 5 packages in a requirements.txt and run pip install over it in an empty virtualenv, and pip takes forever to find a set of versions that are compatible with each other.

I still find the conda environment.yml is the simplest way to provide an environment that works out of the box (as it also fixes the python version, see #144 ).

matthewfeickert commented 11 months ago

@eguiraud Can I have you look at https://iris-hep.org/analysis-systems-env-nightlies/ and check to see what additional packages would be needed to test a nightly AGC lock file build?