opensemanticsearch / open-semantic-etl

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
https://opensemanticsearch.org/etl
GNU General Public License v3.0
255 stars 69 forks source link

Document Crawl have not changed for days #134

Open movanet opened 3 years ago

movanet commented 3 years ago

It has been like this for days, the number of imported document have not changed. The system doesn't freeze, it just that it stays on 584029 documents to extract and analyze and 2491 documents to OCR for days.

image

I am running this on Proxmox

image