tsegall / fta

Metadata/data identification Java library. Identifies Semantic Type information (e.g. Gender, Age, Color, Country,...). Extensive country/language support. Extensible via user-defined plugins. Comprehensive Profiling support.
Apache License 2.0
26 stars 3 forks source link

Extend support for tracking entire set metrics #13

Closed tsegall closed 2 years ago

tsegall commented 2 years ago

With the exception of totalCount and distinctCount FTA has no concept of tracking metrics across an entire data set as opposed to the sample set that FTA observes. Add support for total versions of NullCount, BlankCount, Mean, Standard Deviation, Min, Max, MinLength, and MaxLength.