Closed tahmedge closed 4 years ago
Hi Tahmid, it might be better to ask this question on the BERT baseline repository, rather than this dataset repository.
I believe that the original BERT baseline ignored HTML tokens, but I'm not sure how that was implemented.
I cannot understand the char offset included in the BERT-BASELINE code. Why HTML tokens were not considered while calculating the offset values. Can you please elaborate?