Open ghostfly23333 opened 8 months ago
Hi @lu-wang-g
Asking here for code review as I'm not allowed to assign you as reviewer in the reviewers sidebar.
Also if the example is ought to be the origin version, feel free to comment below! Have a good day! ghostfly23333
In the Bert QA testcase provided by README.md, I think the context is seperated by different types of whitespace. I also print the results of the subwords of tokens. It shows these.
So the tokens are not correctly been seperated by the absl::StrSplit() in https://github.com/tensorflow/tflite-support/blob/2ab77502e1f2937923ef105547c1196a1e81a1c4/tensorflow_lite_support/cc/task/text/bert_question_answerer.cc#L205
I check the hex values of the origin context string it shows typos about whitespace in the context and I patch the issue.