mhtang1995 / BOPN

6 stars 1 forks source link

请问作者能否提供数据集或者这些数据集处理成json格式的方式呢?我在复现您的代码时在打开json格式的数据时总是报错,因为我自己下载的比如这些中文数据集的原始格式并不是json #1

Open Alieziar opened 9 months ago

Shaun-Wong commented 8 months ago

[ { "sentence": [ "AFP_ENG_20030428", ".", "0720", "NEWS", "STORY", "20030428", "NKorea", "offers", "to", "scrap", "nuke", ",", "missile", "programs", "but", "wants", "big", "concessions", ":", "US", "by", "Matthew", "Lee", "ATTENTION", "-", "UPDATES", "///", "WASHINGTON", ",", "April", "28", "(", "AFP", ")", "-", "North", "Korea", "has", "offered", "to", "scrap", "its", "nuclear", "weapons", "and", "missile", "programs", ",", "but", "only", "in", "return", "for", "\"", "considerable", "\"", "diplomatic", ",", "political", "and", "economic", "concessions", ",", "the", "United", "States", "said", "Monday", "." ], "ner": [ { "index": [ 6 ], "type": "GPE" }, { "index": [ 10 ], "type": "WEA" }, { "index": [ 12 ], "type": "WEA" }, { "index": [ 19 ], "type": "GPE" }, { "index": [ 21, 22 ], "type": "PER" }, { "index": [ 27 ], "type": "GPE" }, { "index": [ 32 ], "type": "ORG" }, { "index": [ 35, 36 ], "type": "GPE" }, { "index": [ 41 ], "type": "GPE" }, { "index": [ 41, 42, 43 ], "type": "WEA" }, { "index": [ 45 ], "type": "WEA" }, { "index": [ 63, 64, 65 ], "type": "GPE" } ] }] 这种格式的