Paper GrainSpace: A Large-scale Dataset for Fine-grained and Domain-adaptive Recognition of Cereal Grains
Work done when interning in Gaozhe Technology
This is the project page for GrainSpace, to the best of our knowledge, this is the first publicly released dataset for cereal grain inspections. Cereal grains are a vital part of human diets and are important commodities for people’s livelihood and international trade. Grain Appearance Inspection (GAI) serves as one of the crucial steps for the determination of grain quality and grain stratification for proper circulation, storage and food processing, etc. GAI is routinely performed manually by qualified inspectors with the aid of some hand tools. Therefore, our goal is to build automated GAI that has the benefit of greatly assisting inspectors with their jobs.
We believe that smart agriculture is a critical research field for social good and a large dataset can lead to significant advancement in CV techniques for this field. We hope that GrainSpace can stimulate and draw more attention to the development of intelligent agriculture, triggering more researchers to devote themselves to smart agriculture, helping countries with food security, quality of food, etc. We believe computer vision techniques can revolutionize GAI-related applications.
GrainSpace includes a total of 5.25 million images determined by professional inspectors, since all extra matters (e.g., impurities or foreign cereals) in raw grain samples are removed manually. The grain samples including wheat, maize and rice are collected from five countries and more than 30 regions. Considering the manufacturing cost and reproducibility of data devices, we construct three types of device prototypes for data acquisition: Professional-600 (P600), General-600 (G600) and Mobile-600 (M600).
The training and val sets can be accessed after contacting with us.
The GrainSpace dataset is licensed under the Creative Commons BY-NC-SA 4.0 license. Note that All data must not be used for commercial purposes.
FAQ1: About data imbalance
FAQ2: About Unlabeled data
FAQ3: About Wheat region grouping standard
We formulate GAI as three ubiquitous computer vision tasks: fine-grained recognition, domain adaptation and out-of-distribution recognition. Detailed benchmarking results can be found in the results folder.
ResNet50:
DCL
SwimT
MixMatch
MoCo
@InProceedings{GrainSpace_2022_CVPR,
author = {Lei Fan, Yiwen Ding, Dongdong Fan, Donglin Di, Maurice Pagnucco, Yang Song},
title = {GrainSpace: A Large-scale Dataset for Fine-grained and Domain-adaptive Recognition of Cereal Grains},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2022},
}