We have years of historical data in addition to (daily) updated data streams.
Can I use Deequ to view metrics over time? For us, data quality is a trajectory, not a point. Based on the examples I see, I'd have to create hundreds (thousands, actually) of DataFrames for each day of data and run the analysis for each. Is there no way to run the metrics for all my data and disaggregate by date?
TIA.
We have years of historical data in addition to (daily) updated data streams. Can I use Deequ to view metrics over time? For us, data quality is a trajectory, not a point. Based on the examples I see, I'd have to create hundreds (thousands, actually) of DataFrames for each day of data and run the analysis for each. Is there no way to run the metrics for all my data and disaggregate by date? TIA.