Closed neubig closed 1 year ago
@neubig In the example above, X_test
and y_test
are type numpy.ndarray
. For the explainaboard_client.wrap_tabular_data
function, was the input expected to be a numpy.ndarray
or a pandas.Dataframe
?
In the custom feature JSON example, most custom features are categorical, which wouldn't work for numpy.ndarray
. So I am thinking, instead of passing in X_test
and y_test
, maybe we should let the users pass in dataset
(type pandas.Dataframe
) instead. What do you think?
wrapped_dataset = explainaboard_client.wrap_tabular_dataset(
dataset
columns_to_analyze=['sepal-length', 'sepal-width', 'petal-length', 'petal-width'],
)
Yep, that sounds great, thanks @noelchen90 !
Currently classification and regression over tabular data (extracted features) are supported through the
tabular-regression
andtabular-classification
tasks. However in the processor for these tasks, they use basically no input features for analysis by default.Because of this, any features that you want to analyze need to be declared as custom features in a JSON file.
It'd be nice to make this process as easy as possible. Here is an example for a front-end interface we could aim for, similar to a combination of
The only additional thing that would need to be implemented would be the
explainaboard_client.wrap_tabular_data
function.