lemma-osu / naip-cnn

Modeling forest attributes from aerial imagery using CNNs
0 stars 0 forks source link

Sample and train with other years and attributes #6

Open aazuspan opened 1 year ago

aazuspan commented 1 year ago

Radiometric variability between NAIP acquisitions may make it difficult for the model to generalize between years. Training and/or validating with NAIP imagery and corresponding LiDAR data from multiple years should a) let us know how well we can predict to other years, and b) hopefully allow the model to generalize better.

Additionally, we should look at other LiDAR metrics, e.g. RH95 or understory cover, to see how well we can predict other attributes.

This should all be doable with the current sampling and modeling workflows just by modifying the notebooks, but there may be some convenience features we can add to simplify that process, if we're potentially going to be extracting a dozen attributes over a dozen years.

aazuspan commented 11 months ago

With the dataset update in #11, we'll have one HDF5 dataset per LiDAR/NAIP aquisition that includes all relevant LiDAR attributes. To train on multiple acquisitions at once, we may want to use interleave, although I'm not 100% sure what that gives us over concatenate. In either case, we'll need to ensure that any merged datasets are shuffled.