[ ] list companynya perlu dibuat alias/nama yg simpel/sering muncul di berita supaya bisa menangkap company mentions secara luas. contoh: "Telkom Indonesia (Persero) Tbk" --> "Telkom" (case sensitive), "Bank Pembangunan Daerah Jawa T" --> "BPD Jatim" atau "BPD Jawa Timur". also no need for "Tbk" and/or "Persero" at the end. 1 company bisa >=1 alias
[x] sepertinya nama2nya banyak yg terpotong? contoh: "BJTM,Bank Pembangunan Daerah Jawa T"
Need a function in
nlp/ner.py
that:Each company mention consists of (start, end of the text position; company ID).
Alternatively:
@reinhack 23 Aug 2023 RE: pull request list company: