hellodfan / GrainSpace

GrainSpace: A Large-scale Dataset for Fine-grained and Domain-adaptive Recognition of Cereal Grains. CVPR2022
8 stars 1 forks source link

GrainSpace

Paper GrainSpace: A Large-scale Dataset for Fine-grained and Domain-adaptive Recognition of Cereal Grains

Work done when interning in Gaozhe Technology

Introduction

This is the project page for GrainSpace, to the best of our knowledge, this is the first publicly released dataset for cereal grain inspections. Cereal grains are a vital part of human diets and are important commodities for people’s livelihood and international trade. Grain Appearance Inspection (GAI) serves as one of the crucial steps for the determination of grain quality and grain stratification for proper circulation, storage and food processing, etc. GAI is routinely performed manually by qualified inspectors with the aid of some hand tools. Therefore, our goal is to build automated GAI that has the benefit of greatly assisting inspectors with their jobs.

We believe that smart agriculture is a critical research field for social good and a large dataset can lead to significant advancement in CV techniques for this field. We hope that GrainSpace can stimulate and draw more attention to the development of intelligent agriculture, triggering more researchers to devote themselves to smart agriculture, helping countries with food security, quality of food, etc. We believe computer vision techniques can revolutionize GAI-related applications.

Dataset

GrainSpace includes a total of 5.25 million images determined by professional inspectors, since all extra matters (e.g., impurities or foreign cereals) in raw grain samples are removed manually. The grain samples including wheat, maize and rice are collected from five countries and more than 30 regions. Considering the manufacturing cost and reproducibility of data devices, we construct three types of device prototypes for data acquisition: Professional-600 (P600), General-600 (G600) and Mobile-600 (M600).

TRAIN & VAL

The training and val sets can be accessed after contacting with us.

The GrainSpace dataset is licensed under the Creative Commons BY-NC-SA 4.0 license. Note that All data must not be used for commercial purposes.

TEST

FAQ

FAQ1: About data imbalance

FAQ2: About Unlabeled data

FAQ3: About Wheat region grouping standard

Benchmark

We formulate GAI as three ubiquitous computer vision tasks: fine-grained recognition, domain adaptation and out-of-distribution recognition. Detailed benchmarking results can be found in the results folder.

Fine-grained Recognition

ResNet50:

DCL

SwimT

MixMatch

MoCo

Domain Adaptation

CDAN

MCD

MCC

Out-of-distribution

Deep SVDD

Rot

CSI

Citation

@InProceedings{GrainSpace_2022_CVPR,
author = {Lei Fan, Yiwen Ding, Dongdong Fan, Donglin Di, Maurice Pagnucco, Yang Song},
title = {GrainSpace: A Large-scale Dataset for Fine-grained and Domain-adaptive Recognition of Cereal Grains},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2022},
}

TODO