pangeo-data / pangeo

Pangeo website + discussion of general issues related to the project.
http://pangeo.io
699 stars 188 forks source link

Engagement with industry startups in climate fields #204

Closed rabernat closed 6 years ago

rabernat commented 6 years ago

There is a growing list of tech-savvy companies / startups focused on climate-related issues. Many of these are oriented heavily towards remote sensing, but others are using models etc. in a way similar to academic scientists. There is a good opportunity here for Pangeo to partner with such organizations. We have common needs for cloud infrastructure and scalable computing tools. A great outcome would be for developers in those companies to contribute to pangeo-related open-source projects.

Some of the companies I have in mind are.

Who else should be on this list?

Should we try to engage with these companies? If so, what do they have to gain from collaborating with us?

niallrobinson commented 6 years ago

Hi @rabernat you should know that this https://www.exeter.ac.uk/business/news/articles/usingbigdatatogrowsmallbu.html group, the "Impacts Lab" has just started in the same room as us, and directly includes three of our staff, including Pangeo lynch-pin @jacobtomlinson. Their job is to work with umpteed small/medium enterprises over the next couple of years. Pangeo is top of our list of tools to suggest to them! We're gettign the first SMEs, probably around end of summer.

I guess my point is that if you need to point at stuff for funding/evidence of success then hopefully we'll have some stuff

kmpaul commented 6 years ago

Several ex-NCAR people are at Jupiter Intel, and NCAR has a fledgling project with them to set up data analysis in the cloud. We are talking with them about setting up a Pangeo JupyterHub on AWS to test with the LENS dataset. My understanding is that some of this has already been done on the Pangeo side, but I haven’t had time to catch up on it.

darothen commented 6 years ago

The company where I currently work, ClimaCell, is leaning heavily on the Pangeo stack on our science/tech teams. We're integrating components of the stack for background for various different purposes, particularly for more easily working with large datasets split up into collections of GeoTiffs and stored on Google Storage. There's been parallel talk on the xarray forum about a GeoTiff writer/backend beyond what rasterio currently offers, and I'm seeing if we might be able to open source our internal solution there, as well as finding other ways we can contribute back to xarray and the other core projects in the stack.

I evangelize Pangeo every chance I can, and have won a lot of converts internally. We've even got a day carved out next week to set up the Pangeo JupyterHub instance on our dev project on GCP :)

NickMortimer commented 6 years ago

@darothen ClimaCell looks interesting do you do much ocean weather prediction?

jgerardsimcock commented 6 years ago

Maybe some of these companies too? https://darksky.net/about http://echoparklabs.io/ https://new.surfline.com/

rabernat commented 6 years ago

There's been parallel talk on the xarray forum about a GeoTiff writer/backend beyond what rasterio currently offers, and I'm seeing if we might be able to open source our internal solution there, as well as finding other ways we can contribute back to xarray and the other core projects in the stack.

This comment from @darothen exemplifies what I hope to get out of partnership with such companies. They have similar problems to us, strong motivation to solve them, and substantial developer resources. They can potentially move the dial significantly on important features.

Thoughts on how to most effectively engage? Mine are as follows:

stale[bot] commented 6 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] commented 6 years ago

This issue has been automatically closed because it had not seen recent activity. The issue can always be reopened at a later date.