Closed jacobtomlinson closed 7 months ago
Thanks for handling this @jacobtomlinson . +1 to merge from my perspective.
However, I'm also curious about the failing tests. If you have even more appetite to fix things and want to fix that that would be welcome.
Leaving when to merge up to you.
Yeah I'm wondering if upgrading to 0.16
has caused some other issues. Let me dig a little.
Ok it looks like deltalake
has changed how it reads a few dtypes from the all_primitive_types
dataset.
binary string[pyarrow] # detlalake
binary object # dask
timestamp datetime64[ns, UTC] # detlalake
timestamp datetime64[ns] # dask
It looks like the failing test is already smoothing out some dtype issues and is assuming deltalake
is correct so I've updated the test to do the same here.
However I still seem to be running into a problem with the test failing complaining the dtypes are not correct and I'm a little stumped.
cc @phofl who might have thoughts on dtypes returned here (I suspect that deltalake is using arrow)
cc @jrbourbeau who also opened #74 about this issue.
The string thing seems fine, the utc change is a little bit odd
@jrbourbeau @phofl do either of you have thoughts on how I can move this PR forward? I'm a little stuck as to why the assertion is failing.
Do you have the dtypes of both objects?
Yeah but when I print them out that appear identical. So something deeper must be happening.
Can you post them here?
Attention: Patch coverage is 25.00000%
with 3 lines
in your changes are missing coverage. Please review.
Project coverage is 70.90%. Comparing base (
ec1c90c
) to head (696cfd3
).
Files | Patch % | Lines |
---|---|---|
dask_deltatable/write.py | 33.33% | 2 Missing :warning: |
dask_deltatable/core.py | 0.00% | 1 Missing :warning: |
:exclamation: Your organization needs to install the Codecov GitHub app to enable full functionality.
:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.
I came to give my review, noticed some conflicts and thought I'd make a PR to yours (https://github.com/jacobtomlinson/dask-deltatable/pull/1) in order to help resolve the conflicts caused by https://github.com/dask-contrib/dask-deltatable/pull/78, then sorta noticed that did most of what was done here. My apologies for stepping on your toes here.
Thanks for the reviews and iterations on this everyone. I'm going to merge this in now.
There were some mypy issues in #71 that were unrelated. Following up here to fix things up. Closes #74
There seems to be two issues when running
pre-commit run --all-files
:MAX_SUPPORTED_PYARROW_WRITER_VERSION
was renamed indeltalake>=0.16
. Updated the name and bumped the minimum version.**kwargs
totokenize
makesmypy
unhappy. Explicitly ignoring that.