filter_words with token_frequency method

Summary of bug

When using box.explain.token_frequency, the filter_words option does not work well: even if we specify a list of stop_words, such words still appear at the top of the ranking.

Environment information

Platform: Windows 11
Python version: 3.10.4
explabox version: ?

Reproducing the bug

Steps to reproduce the behavior:

with any dataset with >2 classes, shaped as two dataframes df_train and df_test, with text and label columns,
apply all desired cleaning and vectorizing steps within a classifier here called pipe (from sklearn pipeline),
provide a dictionnary labels_dict = {0: name_class_0, ...}
choose any sample input_text = "your text here about class 0 but not about class 3"
and use the following code:

from explabox import Explabox
from explabox import import_data
from explabox import import_model

data = import_data({'train': df_train, 'test': df_test}, data_cols='text', label_cols='label', 
                   label_map=labels_dict)
model = import_model(pipe, data, label_map=labels_dict)

box = Explabox(data=data,
               model=model,
               splits={'train': 'train', 'test': 'test'})

import string
punctuations = string.punctuation

from spacy.lang.en.stop_words import STOP_WORDS
#nlp = spacy.load("en_core_web_lg")
stop_words = spacy.lang.en.stop_words.STOP_WORDS

filter_list = list(stop_words)+list(punctuations)

box.explain.token_frequency(splits='test', explain_model=False, labelwise=True, filter_words=filter_list)

Solutions Attempted

I tried with different lists but it was never filtered.

MarcelRobeer / explabox

filter_words with token_frequency method #11

Summary of bug

Environment information

Reproducing the bug

Solutions Attempted