ADAH-EviDENce / NewsReader

Docker build of full NewsReader pipeline in Dutch.
Apache License 2.0
2 stars 4 forks source link

Dockerfile: Remove redundant non-Dutch data and models #20

Closed MartineDeVos closed 6 years ago

MartineDeVos commented 6 years ago

The resulting Docker container of the Newsreader pipeline will probably very big, as it contains data and models on multi languages. We expect the container size to be a lot smaller if we only keep the data and models that are needed for the processing of Dutch documents

wmkouw commented 6 years ago

Dockerfile now removes irrelevant languages and models.