covidatlas / li

Next-generation serverless crawler for COVID-19 data
Apache License 2.0
57 stars 33 forks source link

Translate Japanese patient status #423

Closed jzohrab closed 4 years ago

jzohrab commented 4 years ago

Original issue https://github.com/covidatlas/coronadatascraper/issues/833, transferred here on Thursday Apr 16, 2020 at 16:56 GMT


Japan's prefecture-level data appears to be a list of patients. If we get the status translated, we can probably get more than just cases:

image

jzohrab commented 4 years ago

(Transferred comment)

@camjc if you extract all the values in the fields that look like Japanese strings, I have a few friends I can send to to try to get translations.

jzohrab commented 4 years ago

(Transferred comment)

@shaperilio if you turn my grouping function from looking at prefectures to looking at the status and log out the keys that’ll be the list :)

jzohrab commented 4 years ago

(Transferred comment)

"Status": {
    "1週間以内": "Within a Week",
    "2週間より前": "Two Weeks Ago",
    "未確定": "Unconfirmed"
}
jzohrab commented 4 years ago

(Transferred comment)

Interesting; thanks for that!

jzohrab commented 4 years ago

(Transferred comment)

I guess they mean nothing much in terms of our current schema, and 2/3 can be derived from date anyway. Unconfirmed may be useful in the future though. Thanks heaps for the translation!

jzohrab commented 4 years ago

(Transferred comment)

Estonia provides patient level data too... We should be capturing this https://www.terviseamet.ee/et/koroonaviirus/avaandmed

jzohrab commented 4 years ago

(Transferred comment)

It looks like @qgolsteyn has already handled this in the EE scraper?

jzohrab commented 4 years ago

(Transferred comment)

Are we happy to close this issue?

jzohrab commented 4 years ago

(Transferred comment)

It looks like @qgolsteyn has already handled this in the EE scraper?

The estonian test result data looks like it's only being used to determine when a new positive case was detected.

This japanese Status field seems a little more vague than the estonian test data.

Are we happy to close this issue?

If the data's available and more regions start publishing it, it may as well be used semantically. :woman_shrugging: