glut23 / webvtt-py

Read, write, convert and segment WebVTT caption files in Python.
MIT License
192 stars 56 forks source link

does not support unicode characters #40

Closed TorhamDev closed 5 months ago

TorhamDev commented 2 years ago

hi, I have been using this library for some time and this library has difficulty reading Unicode characters.

For example, in languages like fa, ko, ar and el, we get the following error :

raise MalformedFileError('The file does not have a valid format') webvtt.errors.MalformedFileError: The file does not have a valid format

I do a lot of work so that I can finally use the subtitles of these languages with this library.

Can this problem be solved? :)

glut23 commented 5 months ago

Hi @TorhamDev apologies for the very late reply. I tested caption files in all those languages that you listed and the library seems to be fine not raising any exception. Please give a try with the latest version 0.5.0. I am closing this issue. Open a new issue if needed providing a faulty captions file for evaluation. Thanks!

TorhamDev commented 5 months ago

Hi @glut23 :), that problem was from my first job as backend dev XD, I have no idea what they are doing there rn. but thank u so much for ur response! and I'm sorry for my bad issue I should have given more information.