Closed simonrolph closed 8 months ago
Thank you for pointing out this problem @simonrolph. If you want to suggest where in the documentation we should add these instructions, feel free to open a PR!
...and I hear you on the dependency reliance. This is a package still in development and, as such, sometimes the trimming of dependencies has to happen down the line.
Hey @simonrolph :) thanks for raising this. You're right, we should have a note about this if people are running the documentation themselves. I've added a comment about this in the DataProcessor
page.
It would be preferred if the example has as minimal dependencies as possible.
We need weather station data to demonstrate DeepSensor, and get-station-data
is a very convenient way to do so. So I'd argue we are already at the minimum number of dependencies.
For some background:
The reason @scotthosking's get-station-data
doesn't get installed alongside deepsensor
is that pip doesn't allow for GitHub repos as direct dependencies (https://github.com/pypi/warehouse/issues/7136). If get-station-data
were itself on pip then we could depend on it, but unfortunately it is only a GitHub repo.
The best workaround I could think of was to prompt the user to manually install get-station-data
if they run the function that depends on get-station-data
, and do this via a (hopefully) informative error message from your stack trace above. This is also mentioned as a note in the API reference for get_ghcnd_station_data
.
I would be against adding get-station-data
to the Installation Instructions since many users will not require it and it may be confusing. However, hopefully you are happy with the addition to the DataProcessor page :)
Ok all makes sense, thanks for resolving this. Just holding you accountable that the user guide states "The pages of this guide are Jupyter notebooks and are fully-reproducible." 😉
As you should @simonrolph!
Documentation does not state that I needed to run this line:
in order to successfully run this line:
I tried running the
station_raw_df
line and I got an error which produced this error:I was running this in a google colab notebook and I had previously run these lines as specified in the installation instructions
It would be preferred if the example has as minimal dependencies as possible.