Closed krlmlr closed 2 weeks ago
Hi, just to add, I've found an issue that I think is related:
I was finding the maximum value for several variables using the following code:
vector_of_file_paths |>
map(\(path)
duckplyr_df_from_csv(path) |>
select(any_of(vector_of_numeric_col_names)) |>
rename_with(tolower) |>
head(5) |>
summarise(across(everything(), ~ max(.x, na.rm = T))) |>
bind_rows()
and it was failing at the last step when a column only happened to have NAs in the first 5 rows. If I didn't row bind, I could see the result column had a value of -Inf
. I can see from the output during run that these result columns were interpreted by duckplyr as "varchar"
I am unable to share logs from the environment I work in, sorry.
Thanks, @nicki-dese, this is a different problem. Can you please open a new issue?
A reprex would be very useful. This would be a self-contained example with toy data, see, e.g., https://reprex.tidyverse.org/articles/reprex-dos-and-donts.html .
Set operations are good now, joins need https://github.com/tidyverse/dplyr/pull/7029.
Sorry for the delayed reply. I tried generating a reprex and it didn’t error in the same way. If I can manage to reproduce, I’ll post a new issue.
Nicki Norris Assistant Director - Data Phone (02) 6240 8969
From: Kirill Müller @.> Sent: Tuesday, May 21, 2024 2:48 AM To: duckdblabs/duckplyr @.> Cc: NORRIS,Nicki @.>; Mention @.> Subject: Re: [duckdblabs/duckplyr] Need to be stricter about column compatibility (Issue #168)
CAUTION: This email originated from outside of the organisation. Do not click links or open attachments unless you recognise the sender and know the content is safe.
Thanks, @nicki-desehttps://github.com/nicki-dese, this is a different problem. Can you please open a new issue?
A reprex would be very useful. This would be a self-contained example with toy data, see, e.g., https://reprex.tidyverse.org/articles/reprex-dos-and-donts.html .
— Reply to this email directly, view it on GitHubhttps://github.com/duckdblabs/duckplyr/issues/168#issuecomment-2120822504, or unsubscribehttps://github.com/notifications/unsubscribe-auth/APJUET37UGV7JJ4GWTEVKZLZDISNNAVCNFSM6AAAAABHZJ4TVOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMRQHAZDENJQGQ. You are receiving this because you were mentioned.Message ID: @.**@.>> Notice:
The information contained in this email message and any attached files may be confidential information, and may also be the subject of legal professional privilege. If you are not the intended recipient, any use, disclosure or copying of this email is unauthorised. If you received this email in error, please notify the sender by contacting the department's switchboard on 1300 566 046 during business hours (8:30am - 5pm Canberra time) and delete all copies of this transmission together with any attachments.
Done now.
Also, use simple identity instead of
r_base::==
for joins.Created on 2024-05-16 with reprex v2.1.0