allenai / bff

Apache License 2.0
37 stars 8 forks source link

bff_duplicate_spans are index in bytes not characters #5

Open IanMagnusson opened 1 year ago

IanMagnusson commented 1 year ago

One thing that might be worth documenting when we get a chance is that the "bff_duplicate_spans" that are created by the --annotate-only are byte spans rather than character spans as a python person such as myself might first assume.