GoogleCloudPlatform / covid-19-open-data

Datasets of daily time-series data related to COVID-19 for over 20,000 distinct locations around the world.
Apache License 2.0
472 stars 130 forks source link

For R, suggest using fread from data.table #380

Open demetrios1 opened 3 years ago

demetrios1 commented 3 years ago

Seeing how big this file is, I think data.table::fread('https://storage.googleapis.com/covid19-open-data/v2/main.csv', na.strings = c("", "NA", "#N/A")) may be more appropriate than read.csv for the R-documentation!

owahltinez commented 3 years ago

@demetrios1 sorry for the late reply. This sounds like a great suggestion! Admittedly I'm not an expert in R. If you are open to it, we'd absolute accept a pull request to improve the documentation.

Do you know if the data.table::fread() function is able to handle gzipped files?

demetrios1 commented 3 years ago

hi @owahltinez . Thanks for getting back! I'm not sure if it can to be honest.

owahltinez commented 3 years ago

@a27cheung will be looking into this as part of a big documentation update coming soon :-)

a27cheung commented 3 years ago

Hi @demetrios1 - will get to this as soon! Apologies for the delay - we’re working on a presentation to a large group this week :)