Closed BrightXiaoHan closed 1 year ago
I'm working on a machine translation task. When I encode corpus with bpemb, the output is always lower case. Is it possible to retain case information after encode my corpus?
somehow missed this issue, but no, all embeddings are uncased / lower-case only
I'm working on a machine translation task. When I encode corpus with bpemb, the output is always lower case. Is it possible to retain case information after encode my corpus?