larsga / Duke

Duke is a fast and flexible deduplication engine written in Java
Apache License 2.0
613 stars 194 forks source link

Supporting <param> with value taken from the child text node #233

Open marco-brandizi opened 7 years ago

marco-brandizi commented 7 years ago

I have a problem in using '<' '>' with the sparql data source:

<sparql>
  <param name = "query" value = "prefix dc: <http://purl.org/dc/elements/1.1/>..." />
 ...
</sparql>

This isn't valid in XML and in fact it yields an error. HTML-Escaping fixes it (i.e., &lt; &gt;), but very uncomfortable, even using search/replace. A quick solution might be that param value can be specified via child text node, i.e., <param name = "query">VALUE</param>.