Closed Thomzoy closed 1 year ago
Base: 91.89% // Head: 91.92% // Increases project coverage by +0.02%
:tada:
Coverage data is based on head (
bd30bca
) compared to base (e29f4a5
). Patch coverage: 91.30% of modified lines in pull request are covered.
:mega: This organization is not using Codecov’s GitHub App Integration. We recommend you install it so Codecov can continue to function properly for your repositories. Learn more
:umbrella: View full report at Codecov.
:loudspeaker: Do you have feedback about the report comment? Let us know in this issue.
3 small fixes:
Saving tables locally
In spark client or cluster mode, saving a table as parquet won't work because of permission error: executors and driver aren't the same user.
How its solved:
Tables are first collected, and then saved locally by the driver only.
Incorrect timestamp error
When collecting tables,
pyarrow
throws an error when stumbling upon incorrect timestamps (smaller thanpd.Timestamp.min
or bigger thanpd.Timestamp.max
).How it's solved
A filtering is done in
HiveData
, which remplace incorrect timestampsBiology configuration file
When creating a configuration file via
create_config_from_stats
, one row per code AND care site is created, along with a line aggregating all care sites. We only want to keep this row (identified bydf.care_site_short_name == "ALL"
)How it's solved
Filtering, when necessary, on the
care_site_short_name
column