CSSEGISandData / COVID-19

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
https://systems.jhu.edu/research/public-health/ncov/
29.11k stars 18.39k forks source link

US Counties #1111

Open tghkwomack opened 4 years ago

tghkwomack commented 4 years ago

It would be nice to have the US Counties represented in a way that didn't include a comma. It throws off attempts to load the data as a comma-delimited file using methods that don't readily allow a text qualifier

jcmartel commented 4 years ago

Where are you seeing data for US Counties? I was looking for it but just see select cities and counties in the master data files with the countries. Is there a separate file for US Counties?

tghkwomack commented 4 years ago

Thanks for your response! I see it in the time series files (sample below). The Confirmed, Recoveries and Deaths files.

[cid:image001.png@01D5FEA0.90F70D30]

Kevin Womack, MHA Sr. Data Warehouse Analyst Tampa General Hospital 813.844.5074 – Office 813.546.2156 – Cell kwomack@tgh.orgmailto:kwomack@tgh.org

We Heal. We Teach. We Innovate. Care for everyone. Every day. [imageTGH]https://health.usnews.com/best-hospitals/area/fl/tampa-general-hospital-6391060 From: J.C. Martel notifications@github.com Sent: Friday, March 20, 2020 10:12 AM To: CSSEGISandData/COVID-19 COVID-19@noreply.github.com Cc: Womack, Kevin kwomack@tgh.org; Author author@noreply.github.com Subject: Re: [CSSEGISandData/COVID-19] US Counties (#1111)

WARNING: This email came from an external source outside of Tampa General Hospital.

Where are you seeing data for US Counties? I was looking for it but just see select cities and counties in the master data files with the countries. Is there a separate file for US Counties?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_CSSEGISandData_COVID-2D19_issues_1111-23issuecomment-2D601720876&d=DwMCaQ&c=WxSiHUljzaZa6AO7gfNF1nsjLMo-cFTcJAt_xkKy0sg&r=qIvnisYbWutEKtaBj-ylrg&m=lXoY_4gWrnJz8M9y6_Tk22cn6QqdKvCcQZnIwVVMQeA&s=s8C3JHRgFqQxetR8aK5ecQlHR5Y9jIThkmyeAa1v_Ag&e=, or unsubscribehttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_notifications_unsubscribe-2Dauth_AO3BT5ZJSBXJZ4VJEASALSLRIN2SHANCNFSM4LQME77A&d=DwMCaQ&c=WxSiHUljzaZa6AO7gfNF1nsjLMo-cFTcJAt_xkKy0sg&r=qIvnisYbWutEKtaBj-ylrg&m=lXoY_4gWrnJz8M9y6_Tk22cn6QqdKvCcQZnIwVVMQeA&s=NZid7Z14LT6_tFEaF0wzzZWlTso4zRUrfkNPuJT5nG8&e=.

DO NOT CLICK on links or attachments from unsolicited senders that you don’t recognize or trust. NEVER provide your account ID and password.

JiPiBi commented 4 years ago

@tghkwomack please dont resend the entire mail you received , answer on the site

GitHub4PeterRoss commented 4 years ago

Suggestion - download csv, reorder columns, do "text to columns", pluck out non-US, and then pivot. Working on a cleaner thought/approach and will share.

dhmacq commented 4 years ago

It would be nice to have the US Counties represented in a way that didn't include a comma. It throws off attempts to load the data as a comma-delimited file using methods that don't readily allow a text qualifier

Actually, I just finished using the commas in code to extract the subset of counties. I'd rather the commas continue to be present.

I would support a change from comma delimiters to tab delimiters (csv to tsv). Excel and R both read the files without trouble. By "text qualifier" you mean putting quotes around cases where the text includes a comma?

tghkwomack commented 4 years ago

Yes, that's what I mean by text qualifier. I'm using Bulk Insert to load the data into a table. But, I'd rather not have to create a format file, and the version of SQL Server I'm using doesn't allow you to specify a text qualifier on the command line. Regardless, a tab-delimited format would be a great alternative. Thanks for your response!