Closed stefan-pdx closed 8 years ago
Ah, after looking at the implementation of the default txt Chunker, I see it treats each line as a separate zone. I assume that a customer Chunker has to be written.
For others who came across a similar question, the documentation briefly talks about how to create additional workers for accomplishing something like this.
Hi,
I'm relatively new to Treat and am trying to figure out how to chunk text across multiple lines. For example, given the document:
When chunking that text, Treat treats (da boom CHING) each line as a separate paragraph:
I would expect for there to be two paragraphs. Does Treat support this parsing behavior? Are there any strategies that could be used in pre-processing to join line returns?
Thanks!