Update dependencies - Githubissues

glatterf42 commented 9 months ago

While troubleshooting a message-ix-models CI failure, I noticed that I couldn't update to the latest version of the dependencies because of outdated dependency versions in ixmp4 0.6.0. So this PR will eventually bump these/all dependencies.

At the moment, though, it still faces several warnings that need to be addressed. I'm not sure we can remove all of these warnings, though:

pytest 8.0.0 is incompatible with pytest-lazy-fixture, whose main branch has not seen an update in two years. So for now, I've adapted all tests to not use pytest-lazy-fixture, but as you can see, this adds quite a bit of repetition. A friendly coder has posted a different possible fix; please take a look @meksor and let me know which you prefer.
pandas is considering to add pyarrow as a mandatory dependency in 3.0.0. While this is still debated, they have already introduced a DeprecationWarning so that users can add pyarrow to their dependencies already. However, simply doing that caused several test failures for me (all related to data/db/base.py:bulk_upsert_chunk() in the meta tests) as detailed here.
Mypy is now complaining about lines 189 and 285 of db/filters.py, where the result of field_info.json_schema_extra.get() can be a multitude of types, while we only expect to deal with specific ones (though for line 189, I'm not sure whether we're expecting a dict or a list).
There are also several new Future- and DeprecationWarnings from pandas that we should address.

However, once these are clarified, I suggest releasing the next minor version.

Side note: I also want to try if we can support python 3.12; we might need a more recent version of e.g. pluggy, but this is transitive, so I don't know how much work this might be.

glatterf42 commented 9 months ago

@meksor I haven't quite figured out how to best address even the first pandas warnings that I started to look at. In particular, please check out data/db/meta/repository.py: the original solution is now triggering this:

tests/core/test_meta.py::test_run_meta[test_sqlite_mp]
tests/core/test_meta.py::test_run_meta[test_sqlite_mp]
tests/core/test_meta.py::test_run_meta[test_sqlite_mp]
tests/core/test_meta.py::test_run_meta[test_sqlite_mp]
tests/core/test_meta.py::test_run_meta[test_sqlite_mp]
  /home/fridolin/ixmp4/ixmp4/data/db/meta/repository.py:176: DeprecationWarning: DataFrameGroupBy.apply operated on the grouping columns. This behavior is deprecated, and in a future version of pandas the grouping columns will be excluded from the operation. Either pass `include_groups=False` to exclude the groupings or explicitly select the grouping columns after groupby to silence this warning.
    return df.groupby("type", group_keys=False).apply(map_value_column)

Until now, I haven't been able to replicate the exact behaviour of this without triggering warnings. The suggested solutions don't work for me; include_groups=False leads to an error because "type" is not available for being changed in map_value_column(), and the only way I found to select a group (get_group()) can only ever select one group, while we want to work on all groups. Maybe looping over groups is still the preferable solution here since the others aren't quite working either. The other solutions all come from the same question on StackOverflow, specifically these answers:

None of these are quite there yet, I feel the one from numba might even be closest, but numba.jit doesn't seem to support types generic enough to hold our value column, I'm afraid.

Please also note that my solution for the first FutureWarning was to just not use pd.DataFrame.fillna() at all since the test dataframe already contains only np.nan and None, but this might not generally be true for future use cases.

glatterf42 commented 9 months ago

We also already receive warnings for the next big changed announced for pandas 3.0.0: Copy-on-Write will be the default on only mode of operation. Fortunately, there already exists a migration guide and we can invest some time to future-proof our code now.