Currently, the function splits a given text into pieces based on Chinese and English punctuation marks. The punctuation marks include
{ ",", "。", "!", "?", "、", ";", ":", "“", "”", "‘", "’", "(", ")", "【", "】", "《", "》", "——", "……", "·", ",", ".", "!", "?", ";", ":", "\"", "\"", "'", "'", "(", ")", "[", "]", "<", ">", "-", "...", ".", "\n", "\t", "\r", };
However, we have found that certain symbols, including quotation marks and parentheses, do not need to split sentences and only need to be filtered. Therefore, we will revies the logic accordingly.
Currently, the function splits a given text into pieces based on Chinese and English punctuation marks. The punctuation marks include
{ ",", "。", "!", "?", "、", ";", ":", "“", "”", "‘", "’", "(", ")", "【", "】", "《", "》", "——", "……", "·", ",", ".", "!", "?", ";", ":", "\"", "\"", "'", "'", "(", ")", "[", "]", "<", ">", "-", "...", ".", "\n", "\t", "\r", };
However, we have found that certain symbols, including quotation marks and parentheses, do not need to split sentences and only need to be filtered. Therefore, we will revies the logic accordingly.