Closed eolivelli closed 1 year ago
Summary: when running in docker we use thread to simulate pods and the pipelines share the JVM. There are some issues with PyTorch/DJL and classloading. This patch fixes them and moved the Ollame example to using HuggingFace
Summary: when running in docker we use thread to simulate pods and the pipelines share the JVM. There are some issues with PyTorch/DJL and classloading. This patch fixes them and moved the Ollame example to using HuggingFace