m3hrdadfi / sentence-transformers

Sentence Embeddings with ParsBERT
Apache License 2.0
51 stars 6 forks source link

clustring notebook doesn't work #9

Open myprogrammerpersonality opened 6 months ago

myprogrammerpersonality commented 6 months ago

It seems that there is package version conflict between new sentence transformer and the model loaded there. I first solved the mismatch with installing emoji==1.7.0 but then faced this:

ValueError Traceback (most recent call last) in <cell line: 1>() ----> 1 from preprocessing import cleaning 2 from IPython import display 3 4 import numpy as np 5 import pandas as pd

3 frames /content/preprocessing.py in 5 6 normalizer = hazm.Normalizer() ----> 7 tagger = hazm.POSTagger(model='./resources/postagger.model') 8 9

/usr/local/lib/python3.10/dist-packages/hazm/pos_tagger.py in init(self, model, data_maker, universal_tag) 38 data_maker = self.data_maker if data_maker is None else data_maker 39 self.is_universal = universal_tag ---> 40 super().init__(model, data_maker) 41 42 def __universal_converter(self: "POSTagger", tagged_list):

/usr/local/lib/python3.10/dist-packages/hazm/sequence_tagger.py in init(self, model, data_maker) 65 def init(self: "SequenceTagger", model=None, data_maker=data_maker) -> None: 66 if model is not None: ---> 67 self.load_model(model) 68 else: 69 self.model = None

/usr/local/lib/python3.10/dist-packages/hazm/sequence_tagger.py in load_model(self, model) 111 """ 112 tagger = Tagger() --> 113 tagger.open(model) 114 self.model = tagger 115

pycrfsuite/_pycrfsuite.pyx in pycrfsuite._pycrfsuite.Tagger.open()

pycrfsuite/_pycrfsuite.pyx in pycrfsuite._pycrfsuite.Tagger._check_model()

pycrfsuite/_pycrfsuite.pyx in pycrfsuite._pycrfsuite.Tagger._check_model()

ValueError: Invalid model file './resources/postagger.model'

myprogrammerpersonality commented 6 months ago

I could fix it with replacing the pos_tagger.model with a newer version downloaded from hazm.