elastic / eland

Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
https://eland.readthedocs.io
Apache License 2.0
635 stars 98 forks source link

Add the required packages for BERT Japanese Tokenizer to the Eland docker image #709

Open davidkyle opened 2 months ago

davidkyle commented 2 months ago

The BERT Japanese Tokenizer requires the following packages that are not including in the Docker image.

The list comes from the PR description where BERT Japanese was added https://github.com/elastic/eland/pull/529