jgliss / pyplis

Python toolbox for the analysis of UV SO2 camera data
GNU General Public License v3.0
7 stars 5 forks source link

NOAA volcano website no longer available #35

Open twVolc opened 3 years ago

twVolc commented 3 years ago

Hi @jgliss I've just been working through some of the small bug fixes on my fork, to try to help with a PR, and noticed, as I was fixing issue #32, it seems that the NOAA URL used for retrieving volcano info in get_source_info_online() no longer works (line 561 of inout.py).

An alternative may be https://www.ngdc.noaa.gov/hazel/view/hazards/volcano/loc-data, but that table seems embedded in the webpage in a different way - urlopen() didn't return any of the volcano data from this query. I've had a bit of a look at how to retrieve the info presented on that page using python, but I couldn't find an obvious solution (I know very little about website querying). The data can be downloaded as a TSV file with the click of a button from a user, so one option is to keep a version of this file in the pyplis package and query this, but I'm guessing this isn't preferable in case the NOAA database is updated with new volcanoes over time, for instance. Alternatively, specific volcanoes can be searched for with https://www.ngdc.noaa.gov/hazel/view/hazards/volcano/loc-data?nameInclude=lascar for Lascar, for example. If there's an easy way to then get this information from the site directly, or download that TSV and extract the info from there, this would be a solution that keeps the use of the online database directly. As with the entire dataset though, I haven't found a way to extract the displayed info using python.