openghg / openghg

A cloud platform for greenhouse gas (GHG) data analysis and collaboration.
https://www.openghg.org
Apache License 2.0
30 stars 4 forks source link

Implement "insert" or "combine" case for adding new data #881

Open rt17603 opened 10 months ago

rt17603 commented 10 months ago

What is your issue?

See #591 for original details on this. At the standardisation layer (standarise_* functions) we want to ensure that the user can specify how to add / combine new data with currently existing data in the object store.

For the new if_exists input flag (within a standardise_* function) we want to implement a case for an "insert" option.

rt17603 commented 2 weeks ago

PR #1166 is working to address this issue, including laying groundwork to support this.

For this the suggestion is to approach this with 3 goals in mind that build up this functionality:

  1. Update version handling (neutral, restructure change)
    • Would be useful to include some diagrams of how the classes (e.g. ABC, Store, VersionedStore, LocalZarrStore etc. build on and relate to each other)
  2. Formalise and update index comparison (may be neutral or may include updated functionality)
  3. Use these new structures to add the new mode for combining data