caikit / caikit-nlp

Apache License 2.0
12 stars 45 forks source link

Tokenization with TGIS models resulting in bad wiretype #350

Closed evaline-ju closed 4 months ago

evaline-ju commented 4 months ago

Describe the bug

Tokenization tasks were added for modules that used TGIS in #325 in order to leverage the Tokenize endpoint. However, when a GRPC TokenizeTaskPredict call was tried with a TGIS model, grpc: failed to unmarshal the received message proto: bad wiretype was seen. For REST, even when /tokenization was used, a generation result would be returned. It was observed that the TGIS Generate endpoint appeared to be used instead of the expected Tokenize one

Platform

Please provide details about the environment you are using, including the following:

Sample Code

TokenizeTaskPredict with a TGIS model as mm-model-id

Expected behavior

tokenization result

Observed behavior

bad wiretype or generation result