GateNLP / python-gatenlp

Python text processing, pattern matching, and NLP framework
https://gatenlp.github.io/python-gatenlp/
Apache License 2.0
62 stars 8 forks source link

orthomatcher python support #205

Open luisenriqueramos1977 opened 11 months ago

luisenriqueramos1977 commented 11 months ago

currently in Gate editor the orthomatcher is included by default, so I can identify aliases like: ibm -> International Business Machines.

Thus, I wonder if such functionality is currently available for python library?, or not?.

If available, where I could find related doc?

Luis Ramos

johann-petrak commented 11 months ago

There is currently no component in GateNLP which provides equivalent functionality to the GATE Orthomatcher. However, the orthomatcher provides more functionality than just the detection of aliases. If finding aliases from a list of known entity names is the major task, then perhaps something similar can be accomplished by using the string or token gazetteer.

luisenriqueramos1977 commented 11 months ago

HI Johann,

Many thanks for your answer.

Actually, my current goal is finding aliases from a list of known entity names.

As you say I can accomplish this with string or token gazetteer, the question is how?, is there any document?, code sample? that I could use.

Because in the documentation https://gatenlp.github.io/python-gatenlp/, there is explanation how to identify terms from gazetteers, but I do not find how to get aliases identified.

Best regards

El mié, 22 nov 2023 a las 12:56, Johann Petrak @.***>) escribió:

There is currently no component in GateNLP which provides equivalent functionality to the GATE Orthomatcher. However, the orthomatcher provides more functionality than just the detection of aliases. If finding aliases from a list of known entity names is the major task, then perhaps something similar can be accomplished by using the string or token gazetteer.

— Reply to this email directly, view it on GitHub https://github.com/GateNLP/python-gatenlp/issues/205#issuecomment-1822633322, or unsubscribe https://github.com/notifications/unsubscribe-auth/ALS7GADMFINBG45HUFKB2CDYFXR57AVCNFSM6AAAAAA7R3S42CVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQMRSGYZTGMZSGI . You are receiving this because you authored the thread.Message ID: @.***>