developmentseed / satTS

ML pipeline to classify crop types with multi-spectral and multi-temporal EO data
MIT License
18 stars 8 forks source link

Project outline: Temporal crop classification #1

Open JmeCS opened 6 years ago

JmeCS commented 6 years ago

Background

Accurate and timely (e.g. within season) crop classification using remotely sensed data presents many unique challenges. Unlike static objects such as roads and buildings, crop phenology is dynamic within a growing season, and unique crop classes are generally not distinguishable by sight, even with high-resolution satellite imagery. The complexity of the problem increases in developing countries in which plot sizes are small (limiting the effectiveness of low- and medium-resolution satellite imagery), species-level crop diversity is high, and ground-truth training data for classification algorithms are limited.

Accurate, in-season spatial extent and yield estimates could prove beneficial to a variety of audiences and will be critical in the development of insurance and financial services products for small-scale agriculturalists in developing countries.

This project explores possible approaches to addressing certain problems inherent to crop classification in developing countries. Specifically, a time-series of pixels from Sentinel-2 tiles will first be clustered using an unsupervised learning algorithm. Within cluster time-series curves (e.g. NDVI), along with their spatial distribution, will then be examined and matched with individual crop and other landcover classes. The crop and other land cover classes labeled in the unsupervised step will then be used as training data in a supervised classification model. Tanzania has been identified as a target country due to it's relatively simple crop mix (dominated by maize production), low maize productivity and potential synergies with ongoing projects.

Team

(TO BE FINALIZED)

Project phases

  1. Area of Interest (AOI) identification and data wrangling
  2. Data pre-processing and clustering
  3. Supervised classification

Project phases in detail

1. AOI identification and data wrangling:

Phase 1 tasks:

pct_crop_barplot

Phase 1 deliverables:

AOI identified, and an associated time-series of Sentinel-2 tiles selected. Vector data for generic land-cover sampling generated. Various time-series to be used in the development of the clustering algorithm generated. A write-up will be provided describing successes and obstacles, and goals and procedures for subsequent steps will be updated, if necessary.

2. Data pre-processing and clustering

Phase 2 tasks:

Phase 2 deliverables:

Labeled training data corresponding to one or more crop classes as well as other land cover classes, if they can be identified. A write-up will be provided describing successes and obstacles, and goals and procedures for subsequent steps will be updated, if necessary.

3. Supervised classification

Phase 3 tasks:

Phase 3 deliverables:

Trained classification model with a focus on predicting the spatial extent of maize in the Southern Highlands of Tanzania.

abarciauskas-bgse commented 6 years ago

cc @developmentseed/all

Jamey will be presenting the project outline Friday, June 22 at 10am ET for team feedback and questions. Let me know if you are interested in joining and I will add you to the invite - perhaps just add a 👍 to this comment.