Xilinx / inference-server

https://xilinx.github.io/inference-server/
Apache License 2.0
43 stars 13 forks source link

Support raw content in gRPC #78

Open varunsh-xilinx opened 2 years ago

varunsh-xilinx commented 2 years ago

The KServe spec for gRPC uses a field called raw_*_content to manage data that can't be easily represented by native types in gRPC (e.g. FP16). For such fields, the inference server currently converts the type to/from the closest valid type when data is converted from/to gRPC. Using the raw content field can allow native processing of these data types but it needs special handling logic on the server since the data is always assumed to be part of the InputTensor rather than in the request itself.