patrickenfuego / Chapterize-Audiobooks

Split a single, monolithic mp3 audiobook file into chapters using Machine Learning and ffmpeg.
Apache License 2.0
86 stars 14 forks source link

Add Additional Language Support #11

Open patrickenfuego opened 11 months ago

patrickenfuego commented 11 months ago

In a previous release, I modularized the project so it can leverage multiple different languages dynamically. I need help from people who speak those languages to fill out the excluded phrases and chapter separators so more people can use this tool.

Gugna commented 9 months ago

Hello! How exactly could I help? I'm Greek and I have rudimentary coding knowledge(working on it) and would love to help if you could let me know how.

patrickenfuego commented 6 months ago

Sorry for the delay! I've been very busy unfortunately. I would be very happy to help you contribute Greek! The way I've done things is by creating a separate list of standard chapter break keywords, and another list of words that may contain parts of the words in the original list but are actually false positive results. Let me know if that makes sense, and if so, I'll send you a few examples of what it might look like!