Utility for generating Lucene indexed datasets for collections of emails. Includes the following components:
The datasets generated by this library are structured simply as a directory containing multiple files and/or subdirectories:
index
directory contains all files used by Apache Lucene for creating and searching over indexes.database.mv.db
file is the self-contained H2 relational database that contains all emails and any associated tags.metadata.properties
file contains meta information about the dataset. Currently stores the version number.