While using Combine to analyze ~26k XML files of a relatively unknown structure, got the following from a naive field mapping:
Unfortunately, this XML contains elements that only serve a presentation function, e.g. <italic>, which don't provide any semantic meaning.
It would be nice if field mappings configurations, xml2kvp, would accept some kind of configuration to ignore child elements of a targeted element. Or, better yet, take all text and child elements of a target node and convert to string.
In this example, it would be beneficial to stop at:
While using Combine to analyze ~26k XML files of a relatively unknown structure, got the following from a naive field mapping:
Unfortunately, this XML contains elements that only serve a presentation function, e.g.
<italic>
, which don't provide any semantic meaning.It would be nice if field mappings configurations, xml2kvp, would accept some kind of configuration to ignore child elements of a targeted element. Or, better yet, take all text and child elements of a target node and convert to string.
In this example, it would be beneficial to stop at:
and produce only raw text for all child text and elements.