Question about result evaluation

light8lee commented 4 years ago

why the gold data you generated in CDR dataset contains 1:NR:2? I think it should only consider about 1:CID:2 type. For example, you generated follow lines as gold data

8701013|D015738|D003693|NON-CROSS|1:CID:2
8701013|D015738|D014456|NON-CROSS|1:NR:2
439781|D007213|D007022|NON-CROSS|1:CID:2
439781|D012964|D007022|NON-CROSS|1:NR:2
439781|D011453|D007022|CROSS|1:NR:2
439781|D000809|D007022|CROSS|1:NR:2

why not

8701013|D015738|D003693|NON-CROSS|1:CID:2
439781|D007213|D007022|NON-CROSS|1:CID:2

fenchri commented 4 years ago

During evaluation I consider only the positive pairs, but I generated this for some analysis I did. If you check the evaluation.py script you will see that the NR pairs are actually skipped.

BTW, the script works for binary classification problems, I will make it generalisable soon.

light8lee commented 4 years ago

OK, I find that part, you are right.

fenchri / edge-oriented-graph

Question about result evaluation #11