openaq / openaq-fetch

A tool to collect data for OpenAQ platform.
MIT License
85 stars 39 forks source link

Malaysia - Source (not yet useable, seeking help in contacting correct person for data) #11

Closed RocketD0g closed 1 year ago

RocketD0g commented 8 years ago

Listed as a priority given recent AQ issues in region, but NOTE: There is an issue with these data as they are currently presented: They are shown as an index value of combined pollutants into a unitless value. The FAQ's indicate to contact an individual for historical concentrations in non-index units, though currently unsure how to do that.

After discussions with WRI/GFW, they are particularly interested in AQ data from countries affected by the fires in Indonesia.

AQ data for Malaysia: http://apims.doe.gov.my/v2/ (click 'apims' when given choice at portal)

FAQ's (and 5th one down is the one on hourly concentration data): http://apims.doe.gov.my/v2/faq.html

Seeking help in identifying who to contact about access to hourly AQ data.

magsyg commented 4 years ago

It does look like that source previously mentioned, has been updated to http://apims.doe.gov.my/public_v2/home.html, this source has metadata for locations, and has parameters and units for their values

RocketD0g commented 4 years ago

Thanks, @magsyg! I'm missing where the table has physical values reported, still just seeing API values. Can you point me to them?

magsyg commented 4 years ago

@RocketD0g, In the API-table screen, there is an information button, that shows what parameters and units the different signs are representing

RocketD0g commented 4 years ago

Ahh, I see that - but I don't think it's actually pointing to any information with those units. The scale at the bottom is in unitless Air Pollution Index (API), when looking at the map on that page or hovering over the data points on, I see no information reporting (though a blank spot for API), and when I click on the hourly table, it is in all API too.

So I'm not really sure what the 'info' button is showing units for (took a screenshot below to make sure we're on the same page).

Screen Shot 2020-05-20 at 3 31 22 PM

magsyg commented 4 years ago

The API map does not seem to work for this site at the moment, but the API Table [Hourly] does, on this page: http://apims.doe.gov.my/public_v2/api_table.html image

RocketD0g commented 4 years ago

I just looked over the manual for the API calculation and it has the standard calc that takes a physical AQ measurement and put into a API/AQI type thing here:

Screen Shot 2020-05-20 at 3 52 21 PM

So either:

It's not clear to me which, but I'd be willing to bet they are API values and the info box tells you the base units the stations measured data in.

If it is showing physical AQ measurements, there'd need to be a check for the super-script symbol, to make sure the value is a single pollutant, if so, which one, and if it is multiple, then to ignore it.

(If you have more insight on this, let me know. I'd like it to be clearer so we could just ingest the data :)).

RocketD0g commented 4 years ago

Hey @magsyg - Just a heads up that I'm pinging a Malaysian scientist on our Slack channel to see if he can help us figure out what's shared on that table. Will comment here if I find out anything definitive!

(And also, thanks again for your many contributions to the platform already!)

RocketD0g commented 4 years ago

Hi @magsyg - Just got a reply from a contact who is a professor/scientist in Malaysia. He believes they are sharing Air Pollution Index values and not straight physical measurements. He also doesn't believe the gov-measured data are shared in a physical place anywhere (to his knowledge) in Malaysia.

magsyg commented 4 years ago

Okay @RocketD0g , so shall we then not add this source at all, or make a method transforming the data into an approximate using the manual of API calculation? It looks like it is the same formula as the one Singapore is using

RocketD0g commented 4 years ago

Thanks, @magsyg. I think we'll be unable to add this source at this time. We've resisted adding in data sources where the data has to be back-calculated from AQI/API values. The breakpoints change over time, and it's often not obvious when they change.