explosion / spacy-stanza

💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
MIT License
723 stars 59 forks source link

Can't use Spacy-Stanza in a databricks/spark UDF #86

Open DanTyackDB opened 1 year ago

DanTyackDB commented 1 year ago

I am using Spacy in a pyspark dataframe in a UDF. Regular Spacy works fine, but with spacy-stanza I get a 'pickling error', which suggests to me that the library is not serializable. Here's the error: [PicklingError: Could not serialize object: TypeError: can't pickle _thread.RLock objects