Solr doesn't provide the Upsert methodology out of the box. Created another field dedupe_id field in Solr which stores the SHA256(crawl_id-url).
Also, commented overrides: id from solr-schema-map.yaml to include id in the field formatting for Solr. Please note that this id field is from Tika metadata.
Closes #71 and #72
Solr doesn't provide the Upsert methodology out of the box. Created another field
dedupe_id
field in Solr which stores theSHA256(crawl_id-url)
.Also, commented
overrides: id
from solr-schema-map.yaml to includeid
in the field formatting for Solr. Please note that thisid
field is from Tika metadata.