numpy / numpy.org

The NumPy home page
http://numpy.org/
BSD 3-Clause "New" or "Revised" License
110 stars 106 forks source link

ecosystem: feature geoscience (pangeo) #319

Closed mdeff closed 3 years ago

mdeff commented 4 years ago

Geosciences prominently use data and computations and certainly merits to be featured. The community seems organized under https://pangeo.io. I'm however unqualified to pick projects to highlight. There's a list of packages in the xarray doc. Maybe @jhamman or @rabernat can advise.

Or does the fact that these packages mostly use xarray disqualifies them from numpy's landing page?

rgommers commented 4 years ago

I'm kind of doubtful about having both "Geographic processing" and "Geosciences", they're quite similar. We have Xarray in the array libraries and GeoPandas, Shapely and Folium in the scientific domains image. I think that's enough coverage, and AFAIK all the packages in that Xarray docs list, with the exception of Open Data Cube, are less prominent than the already features packages.

mdeff commented 4 years ago

There's a difference between geoscience (i.e., the study of the physics and chemistry of the Earth system, data not necessarily geospatial) and GIS (i.e., the handling of geospatial data, whether for geoscience, social science, whatever), the current focus of the "Geographic processing" domain. Whether those can be reasonably merged is beyond my knowledge of those fields. It would be great to have inputs from experts.

Either way, I think pangeo should make it as the federating project for geoscience. Agree regarding the reasonable coverage of the packages from the list. Maybe there's more but I'm not a user.

rabernat commented 4 years ago

There's a difference between geoscience (i.e., the study of the physics and chemistry of the Earth system, data not necessarily geospatial) and GIS (i.e., the handling of geospatial data, whether for geoscience, social science, whatever)

:+1:

Some prominent non xarray-based geoscience packages which use NumPy heavily are

jhamman commented 4 years ago

Just chiming in with a few quick thoughts. I first want to say the new website looks great so nice work to all involved. Also, I know from experience that navigating decisions around which 3rd party projects to highlight can be tricky and is likely to eventually leave someone disappointed. With the possible exception of the packages @rabernat listed, I don't think there is a huge need to pull packages from the xarray ecosystem into this list.

I was personally surprised to not find Xarray in the Scientific Domains or Data Science tabs. If I were to make an unsolicited suggestion on where Xarray could be highlighted in these sections:

Like I said above, balancing content in a section like this is hard so I leave it up to you on whether to take action here.

rgommers commented 3 years ago

Sorry for the huge delay. I added a policy/procedure on updating the Ecosystem tab in gh-313. tl;dr we're good to add Geoscience as a category.

We can add up to four projects. Looking at the suggestions here, my order of preference would be:

rgommers commented 3 years ago

I was personally surprised to not find Xarray in the Scientific Domains or Data Science tabs. If I were to make an unsolicited suggestion on where Xarray could be highlighted in these sections:

So basically, next to Pandas:) To me under "Statistical computing" makes more sense than in the Data Science tab. The latter tab is really focused on data science as someone with a "data scientist" title in industry dealing mostly with tabular data practices, not the broader "science with data" interpretation.

rgommers commented 3 years ago

All suggestions adopted:

image

Thanks for the input everyone.