Open paigea58 opened 2 months ago
Thanks for posting the issue. This is an RSIG issue that you're noticing through pyrsig. Using the RSIG URL api directly demonstrates that this is independent of pyrsig.[1]
It looks like the FIPS code is both being truncated AND used as an ID. If you looked at the whole US, you'd see 72,283 unique FIPS prediction rows per day in the downloaded files. In the RSIG api coverage, you'd see just 47,455 prediction rows per day. Interestingly, 47,455 is also the unique number of FIPS codes from the downloaded files if you first truncate them to 9 characters long.
I am checking with the RSIG developer and then will get back to you with a resolution when available.
The expert is on the case. It has been tracked down to a data type issue (32-bit int vs 64-bit int). The fix has to be thoroughly tested for impacts on other parts of the system before it is rolled out. I'll keep you posted.
Sorry about the wait... For now, you should just use the downloaded files. The RSIG update may take a bit. If you download the file, you can use it along with other pyrsig data by uses geopandas to open it.
Update! RSIG has a revised version of the SITE xdr format that supports longer ids (8-byte). As a result, the STATION now include the full FIPS and the number of rows per day (72,283) is correct. And, your example
4820124110 is subdivided into 48201241101, 48201241102, 48201241103
is now correctly subdivided! If you're using pyrsig, you'll need to delete the old files. Otherwise, it will try to reuse them.
Hi, I did a comparison of the faqsd.pm25_daily_average 2011 data (downloaded from here https://www.epa.gov/hesc/rsig-related-downloadable-data-files#input) to the data retrieved from the API. I noticed in the API version the last digit of the census tracts is not listed in the API data, whereas in the website version it is. For example, census tract 4820124110 is subdivided into 48201241101, 48201241102, 48201241103. The website data shows values for each of these. The API version does not. How can I use the API to get these values?
Code used: