I am in learning scattertext. I have a dataset of English documents which are categorized and labeled as D1, D2, D3, etc. One category such as D1 can have multiple documents. So, following is the sketch of dataset:
The parsed_col argument needs to be a column in the data frame which contains spaCy Doc objects or something equivalent. Please refer to the first example in the readme.
Hi,
I am in learning scattertext. I have a dataset of English documents which are categorized and labeled as D1, D2, D3, etc. One category such as D1 can have multiple documents. So, following is the sketch of dataset:
Category; Text D1; abc sdf....... D1; jhs dgf.... D2; sdf dfh..... . . . . . . DN; xyz jha....
Now, I would like to plot the corpus content in terms of scattertext. But, when I run the following code, I am getting an error:
{AttributeError}("'numpy.str_' object has no attribute 'sents'", 'occurred at index 0')
Please guide me how can I resolve it?