Closed jshifrin25 closed 4 years ago
I cannot reproduce. Which version are you using? Can you send me a sample config that has the minimum settings to reproduce your issue?
I am way using version 2.9.0 of the importer with the following in the configuration.
<tagger class="${taggerBase}.impl.DOMTagger">
<restrictTo field="document.contentType">text/html</restrictTo>
<dom selector="meta#MetaKeywords" toField="keywords" extract="attr(content)" overwrite="true"/>
</tagger>
<tagger class="${taggerBase}.impl.SplitTagger">
<split fromField="keywords"
regex="true">
<separator>\s*,\s*</separator>
</split>
</tagger>
I tried reproducing again without success. Can you please attach an HTML causing the issue for you?
I would like to use the SplitTagger to replace a metadata field as multiple values and remove the original comma delimited string. Currently, the SplitTagger appends the array of strings to a list containing the original String. I would like there to be an option to remove the original string from the list.