segment-any-text / wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.
MIT License
695 stars 39 forks source link

remove_repetition #111

Closed mmichelli closed 9 months ago

mmichelli commented 9 months ago

When I run remove_repetition(4) I sometimes get an error:

remove_repetition self.remove_word(all_words[j], False, verbose=False)

line 1545, in remove_word raise ValueError('word not in result') ValueError: word not in result

bminixhofer commented 9 months ago

What is remove_repetition? This does not look related to the wtpsplit code to me? Please reopen if I am missing something.

mmichelli commented 9 months ago

So sorry, I posted the issue to the wrong repo.