The open tool development kit provides java libraries for efficient implementation of data processing tools. Primarily the libraries have been build to close gaps within the process of test automation, but they can be helpful for implementing any application that needs to collect data from different sources, transform the data and generate output in different formats (textual, graphical etc.).
With the setMetaData method of the DataContainer class, it is possible to define columns and values the will be assigned to each dataset which is added to the container. The metadata needs to be declared before the rows are added to the container from the source. After metadata is set using the setMetaData method, the defined column and values will be added to each dataset afterwards.
With the current implementation, there are several open issues like:
automatically attach metadata to existing datasets - all datasets or a filtered subset
replace the metadata values for existing datasets - all datasets or a filtered subset
using metadata with tree formats like XMLDataContainer, JSONDataContainer or YAMLDataContainer has not been tested and is probably not implemented
verification, if filters also work with metadata
implementation of regression tests for metadata
A concept is required for fully implement the metadata functionality, so that also:
metadata can be added to DataContainer with existing data
either for all newly added rows, so that only an empty column will be added to existing rows
or for all existing rows and all newly added rows
getter methods for records including metadata and excluding metadata are available
methods for deleting metadata column from the records are available
etc.
Sample code using DataContainer with metadata with current implementation:
DataContainer dc = new DataContainer(";", EHeader.COLUMN);
dc.setMetaData("Company", "happy inc.");
dc.readData("c:\\data\\happy-inc.csv");
With the setMetaData method of the DataContainer class, it is possible to define columns and values the will be assigned to each dataset which is added to the container. The metadata needs to be declared before the rows are added to the container from the source. After metadata is set using the setMetaData method, the defined column and values will be added to each dataset afterwards.
With the current implementation, there are several open issues like:
A concept is required for fully implement the metadata functionality, so that also:
Sample code using DataContainer with metadata with current implementation:
Content of happy-inc.csv
Content within DataContainer dc after performing the readData method