fhightower / ioc-finder

Simple, effective, and modular package for parsing observables (indicators of compromise (IOCs), network data, and other, security related information) from text. It uses grammars rather than regexes which makes it more readable, maintainable, and hackable. Explore our interactive documentation here: https://hightower.space/ioc-finder/
GNU Lesser General Public License v3.0
158 stars 42 forks source link

IOC Finder is not able to parse Unicode characters #298

Open shashank1503-cipher opened 2 months ago

shashank1503-cipher commented 2 months ago

Pyparsing library has the support for Unicode characters but we are not using it. Resulting iocs like warrıors[dot]com not getting parsed.

Need to add a keyword argument that accepts unicode characters and alters the set of characters used for parsing in ioc grammer

fhightower commented 1 month ago

Good call - thanks for noting this. I'll take a look into this.

shashank1503-cipher commented 1 month ago

Hi @fhightower , I would like to contribute to this issue. Please assign it to me.

fhightower commented 1 month ago

Done - thanks!