Open shawncao opened 3 years ago
Some use case needs sampling support during data ingestion for some super heavy data source, users can get insights without scan full data.
a few initial thoughts
there are more into sampling when it comes to data science and statistical (old name of ML) use cases. we can start from simple one pass algorithm.
Some use case needs sampling support during data ingestion for some super heavy data source, users can get insights without scan full data.
a few initial thoughts