HazyResearch / mindbender

Tools for iterative knowledge base development with DeepDive
116 stars 32 forks source link

is_correct counts disappear after restarting mindtagger #54

Closed xiaoling closed 8 years ago

xiaoling commented 8 years ago

I suppose mindtagger reloads the annoatations from tags.json every time it restarts. The is_correct annotations are still there but the distribution of is_correct values is missing. Please see the attached screenshot. -Xiao screen shot 2015-09-29 at 7 13 32 pm

xiaoling commented 8 years ago

Ok. I noticed that my tags.json file contained additional annotations on the instances other than the ones in input.csv . After I removed these annotations, the is_correct stats re-appeared.

zhangce commented 8 years ago

@xiaoling Sorry for the late reply--I think @netj is traveling right now and we will get back to you later today.

netj commented 8 years ago

@xiaoling @raphaelhoffmann Sorry this took so long!

It was partly due to the UNKNOWN value for the tag, making it recognized as free-text type tag, hence dropping the histogram. Now, tags up to 10 distinct values will be treated as "simple" tags keeping these histograms.