opensemanticsearch / open-semantic-etl

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
https://opensemanticsearch.org/etl
GNU General Public License v3.0
254 stars 69 forks source link

Law code subreferences / taxonomy #124

Open opensemanticsearch opened 4 years ago

opensemanticsearch commented 4 years ago

Law code subcodes in text like "a b c § 123 Abs. 3 d e f" should be extracted to multiple law codes "§ 123" and "§ 123 Abs. 3", so texts can be overviewed/filtered more general and deeper by law code taxonomy.