Closed azu closed 1 year ago
I noticed that some case is difficult.
彼は「コレ」と読んだ
I think that 「コレ」
is not a sentence.
sentence-splitter can not detect it.
It should be opt-in feature.
We can not detect which is better.
Probably, These are rule implemetation bug. Not this library
We are talking about pens.
He said "This is a pen. I like it".
I could relate to that statement.
Current parser parse it following sentences.
Second sentence has "This is a pen. I like it"
, but we can not split it into new sentence.
The conversation text is just Str
node.
HTML does not have suitable semantics for conversation.
As a result, sentence-splitter can not support nesting sentence. Probably, rule implementation should handble the quote text after parsing sentences by sentence-splitter.
We will close this issue by adding current behavior.
Currently, sentence-splitter parse the text as follows:
We want to support nesting sentences.
PairMarker is related https://github.com/azu/sentence-splitter/blob/master/src/parser/PairMaker.ts
AST Design
Related