apinge / MeloTTS.cpp

A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting mixed English and Chinese languages.
Apache License 2.0
16 stars 4 forks source link

Fix TTS::split_sentences_into_pieces to avoid splitting at certain punctuation marks #7

Open apinge opened 2 weeks ago

apinge commented 2 weeks ago

Currently, the function splits a given text into pieces based on Chinese and English punctuation marks. The punctuation marks include { ",", "。", "!", "?", "、", ";", ":", "“", "”", "‘", "’", "(", ")", "【", "】", "《", "》", "——", "……", "·", ",", ".", "!", "?", ";", ":", "\"", "\"", "'", "'", "(", ")", "[", "]", "<", ">", "-", "...", ".", "\n", "\t", "\r", }; However, we have found that certain symbols, including quotation marks and parentheses, do not need to split sentences and only need to be filtered. Therefore, we will revies the logic accordingly.