cat-cfs / twobilliontoolkit

This repo stands as a singular place for all the tools that will be developed for the processing of 2 Billion Trees data and information
MIT License
1 stars 0 forks source link

SpatialTransformer - handle vast amounts of aspatial data #16

Closed nesdolya closed 4 months ago

nesdolya commented 6 months ago

Two issues need to be addressed:

  1. Need to determine how we are going to handle the vast amounts of aspatial data. 2BT data team did not separate the spatial files from the aspatial files, so there will be tonnes of aspatial records in the raw_data_tracker.
  2. Need to determine how RecordsReviser will display only the most relevant information - e.g. dropped = False and planting year = YYYY. Or some sort of sorting filtering in the GUI itself.
nesdolya commented 6 months ago

Addressing point #1: We decided to add a event_type field with a spatial/aspatial flag To address point #2: @AnthonyRodway - How difficult would it be to filter the records reviser to only display the records that were input within the last 24 hours? This would ONLY apply to when the records reviser automatically pops up at the end of the SpatialTransformer run.

AnthonyRodway commented 5 months ago

Changes were made in this commit: https://github.com/cat-cfs/twobilliontoolkit/commit/c97d71018ca74810367038626a15b2a6dab668d0

Changes:

AnthonyRodway commented 5 months ago

The entry type was made quite a while ago in a previous commit but forgot to document which one, sorry about that. It is implemented in the new spatial logic branch

nesdolya commented 4 months ago

Tested and functioning