Open MarcelBeining opened 1 year ago
Hi @MarcelBeining thank you for sharing this with us. So this is the old, block config-driven way of utilizing data assistant that we no longer support, so issues will not be bound to surface. We have moved on to the Fluent Data Source style, please see the docs on it here: https://docs.greatexpectations.io/docs/guides/expectations/data_assistants/how_to_create_an_expectation_suite_with_the_onboarding_data_assistant/
Same error using the fluent version:
import great_expectations as ge
import pandas as pd
ge_context = ge.data_context.DataContext()
datasource = ge_context.sources.add_pandas(name="validation_datasource")
data_asset = datasource.add_dataframe_asset(name="asset")
batch_request = data_asset.build_batch_request(dataframe=pd.DataFrame([{'a': True, 'b': 0},
{'a': False, 'b': 0},
{'b': 0}]))
result = ge_context.assistants.onboarding.run(
batch_request=batch_request,
exclude_column_names=[],
)
ValueError: zero-size array to reduction operation minimum which has no identity
The error does not seem to appear with newest version 0.17.9, however I am forced to keep with version 0.16.5 as long as the problem from #8387 / #8392 persists :-/
Describe the bug I wanted to use the Assistant for profiling my Data in Kedro, however most of the columns produce errors during profiling because they also contain None values. Wondering why I am the first one reporting this.
To Reproduce Run this code
_greatexpectations.yml
Error Traceback_
Expected behavior Fill my expectation suite with valuable expectations about column a and b
Environment (please complete the following information):