gchq / stroom

Stroom is a highly scalable data storage, processing and analysis platform.
https://gchq.github.io/stroom-docs/
Apache License 2.0
424 stars 58 forks source link

Split TextConverter into DataSplitter and XMLFragmentTemplate #4113

Open at055612 opened 4 months ago

at055612 commented 4 months ago

(or some better names)

The current approach of TextConverter being used for both DataSplitter and XMLFragmentParser is not ideal. It means if the user is adding a doc to their DSParser element then they need to rely on document names to ignore the frag parser ones. It also causes problems for stroom knowing what editor snippets to use as it doesn't know what a TC is to be used for unless the type has been set.

Much better to have two dedicated doc types. We can then deprecate the TextConverter and make it so you can't create it. CombinedParser should also be deprecated.