MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
The instanceLimitOption in the EvaluateClustering task (line 43) describes its function as
Maximum number of instances to test/train on (-1 = no limit).
If "-1" is passed as an argument, however, (for example if the streamOption is set as a FileStream and the user wants the whole ARFF file processed) then no instances are passed to the learner and the results produced in the dumpFile are NULL outside of the header.
This is because the BatchCmd run method (line 147) uses the following while loop to determine if another instance should be passed:
The totalInstances variable is the local representation of the user's selected value for EvaluateClustering instanceLimitOption, but it is created with no understanding of the meaning of "-1." Instead totalInstances is set to -1 in the BatchCmd constructor method (line 68) and then the condition in the above while loop immediately evaluates as false.
The instanceLimitOption in the EvaluateClustering task (line 43) describes its function as
If "-1" is passed as an argument, however, (for example if the streamOption is set as a FileStream and the user wants the whole ARFF file processed) then no instances are passed to the learner and the results produced in the dumpFile are NULL outside of the header.
This is because the BatchCmd run method (line 147) uses the following while loop to determine if another instance should be passed:
The totalInstances variable is the local representation of the user's selected value for EvaluateClustering instanceLimitOption, but it is created with no understanding of the meaning of "-1." Instead totalInstances is set to -1 in the BatchCmd constructor method (line 68) and then the condition in the above while loop immediately evaluates as false.