mitre / cti

Cyber Threat Intelligence Repository expressed in STIX 2.0
Other
1.71k stars 410 forks source link

enterprise-attack.json contains formatting issues (unescaped quotes) and illegal Unicode chars #201

Open kentmanley opened 2 years ago

kentmanley commented 2 years ago

I was trying to use the latest copy of https://github.com/vmapps/attack2neo project to load MITRE Enterprise ATT&CK into a Neo4j database, and received numerous errors.

1) There are embedded Unicode characters (Japanese or Korean) in a few of the descriptions. 2) There are Microsoft "smart quotes" both single and double in a few of the descriptions. 3) Numerous "description" entries (values) contain either multiple, un-escaped single and double quotes, which causes parsers to fail on those lines.

I can make a diff file if that would be helpful.

I have not double checked the Mobile or ICS versions yet.