WolfgangFahl / ProceedingsTitleParser

Shallow Semantic Parser to extract metadata from scientific proceedings titles
Apache License 2.0
3 stars 1 forks source link

Use Aho Corasic algorithm #68

Open WolfgangFahl opened 3 years ago

WolfgangFahl commented 3 years ago

see e.g. https://en.wikipedia.org/wiki/Aho%E2%80%93Corasick_algorithm

and a pure python library at: https://github.com/abusix/ahocorapy

compare to https://www.kaggle.com/c/cprod1/data?select=CPROD1_baseline.120707.pl