Open qgustavor opened 2 years ago
About finding a better fingerprint storage format, I did some tests:
Here's my proposal:
[...[tcode, hcode]]
.flat()
, so [tcode, hcode, tcode, hcode...]
Uint32Array
DOUKI
Decoding follow the following:
DOUKI
, reject if not, then drop those bytesWhy caring about adding metadata: because it allow versioning, it allows changing the fingerprinter parameters in case in future someone finds better values than the current ones, it allow adding info about the file used to generate the fingerprints.
console.log
inside the library[[tcode, hcode], [tcode, hcode], ...]
to{[hcode]: [tcodes, tcodes]}
to compress repeated hcodes and make fingerprint matching faster?.douki
for it?[x] Create function to merge the synchronized subtitles with existent ones that can handle overlapping with multiple modes
translation
,translation-overlap
andkaraoke
then lines withtranslation
style would be switched totranslation-overlap
when those overlaps with the existing subtitles and, since there is nokaraoke-overlap
style defined, lines withkaraoke
style would be removedMost of what is needed, including some of the ideas above, were implemented here: https://github.com/qgustavor/subtitle-tools