11693-2 / project-team02

Software Method Team Project
4 stars 1 forks source link

Error Analysis #14

Open xiliuhk opened 9 years ago

xiliuhk commented 9 years ago

Methodology

First of all, we need to evaluate the overall performance of the system.
Secondly, we need to evaluate the performance of each classifier.
Thirdly, we need to do error analysis for each stage on the pipeline, and adjust the model.
Finally, we iterate the three steps above until a satisfying performance is achieved.

Evaluation Targets

We need to perform evaluation on the following items.

Retrieved items Unordered retrieval measures Ordered retrieval measures
concepts mean percision, recall, F-measure MAP,GMAP
articles mean percision, recall, F-measure MAP,GMAP
triples mean percision, recall, F-measure MAP,GMAP

Flat Evaluation

We need to perform evaluations for each classifier. The following measures should be taken.

The classification is hierarchical so flat evaluation measures do not work sufficiently. In the multiple levels of classification, once there is an error in one classifier the final result is incorrect. Flat measures fail for this case because we could not tell the cause of the error by its evaluation. So we need to design a hierarchy of measurements, taking the relations and performance for each classifier into consideration. Kiritchenko et al. proposed a hierarchical precision as:

For the general evaluation, flat micro-F1 measure will be used.