UAlbertaALTLab / korp-config

0 stars 0 forks source link

HTML rendition of forward slash appears as HTML code, not as the actual character. #4

Open aarppe opened 1 month ago

aarppe commented 1 month ago

In some exploratory search, the HTML code / for forward slash / appears as-is in the corpora, rather than as /. For instance:

image

I'm wondering if the conversion of special characters to HTML code should not apply to the first column with the token, but for all the other fields?

fbanados commented 6 days ago

Now it appears as / in corpus search. However, it seems that the simple search conversions are not working right. The following advanced searches work:

[word = "/"]

[lemma = "/"]

But simple search is generating [word = "/"], which is failing.