Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Law code subcodes in text like "a b c § 123 Abs. 3 d e f" should be extracted to multiple law codes "§ 123" and "§ 123 Abs. 3", so texts can be overviewed/filtered more general and deeper by law code taxonomy.
Law code subcodes in text like "a b c § 123 Abs. 3 d e f" should be extracted to multiple law codes "§ 123" and "§ 123 Abs. 3", so texts can be overviewed/filtered more general and deeper by law code taxonomy.