Open dilekc opened 7 years ago
Has this been resolved?
Here was my response (by email). Here for everyone else also:
Thanks so much for raising the flag. I think in this case it is not a bug, but due to some idiosyncrasies of the data:
All paragraphs that contain exactly the same (textual) content share the same id (i.e., they are indistinguishable)
This happens quite a bit, especially for short paragraphs such as "We use the example below."
You may want to look at the contents for "83423c198b6099edba08f185f940042d5dba3b79" see if this is the case here also.
Please do let me know if you see something suspicious.
The paragraph 83423c198b6099edba08f185f940042d5dba3b79 is annotated as relevant to more than one section_ids although the following statement occurs in the track web page:
*.cbor.hierarchical.qrels: every paragraph is relevant only for its leaf most specific section (example: PageName/Heading1/Heading1.1 - but not PageName/Heading1!)
yields the following output