lisja / KIK-LG211_HALM

Our group project for the course KIK-LG211 Building NLP Applications
2 stars 0 forks source link

#15 checking parser test #15

Closed haraldsDev closed 1 year ago

haraldsDev commented 1 year ago

Hi, @mikzolot I'm just checking the parser part you built.

There seems to be some edge cases:

I guess it all boils down to this line: article = re.sub(r"[^a-z\s]", "", article)

I couldn't find exaclty what happens with instances like "California.", "opposed;" etc where there is not a space but another character after the word - but it seems that the parser will just skip them.