opensemanticsearch / open-semantic-etl

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
https://opensemanticsearch.org/etl
GNU General Public License v3.0
254 stars 69 forks source link

No additional ETL errors by following plugins, if main plugin failed #104

Closed opensemanticsearch closed 4 years ago

opensemanticsearch commented 4 years ago

We dont need additional ETL errors for plugins like enhance_contenttype_group, enhance_warc and so on, if error is result of yet statused failure of main plugin like Tika extraction.

Mandalka commented 4 years ago

Implemented for plugin enhance_contenttype_group

Mandalka commented 4 years ago

Implemented for plugin enhance_warc

Mandalka commented 4 years ago

Implemented for plugin enhance_rdf

Mandalka commented 4 years ago

Implemented for optional plugin enhance_pdf_ocr

Mandalka commented 4 years ago

Implemented for optional plugin enhance_pdf_page