KBlixt / subcleaner

removes ads from subtitle files cleanly.
284 stars 12 forks source link

New Regex to add to english (or others) config #52

Closed Zxurian closed 8 months ago

Zxurian commented 8 months ago

Found the following ads in some subs that Bazarr was pulling in

          |     [---------Removed Blocks----------]
          |     33
          |     00:01:19,000 --> 00:01:25,074
          |     ENJOY ALL VOD IN HIGH QUALITY @ 4KVOD.TV
          |     GET LIVE TV,MOVIES, SHOWS IN ONE PACKAGE
          |     reasons: (en_purge6, global_purge2, global_warn1)
          |     
          |     1223
          |     01:36:55,305 --> 01:37:55,664
          |     Watch Movies, TV Series and Live Sports
          |     Signup Here -> WWW.ADMIT1.APP
          |     reasons: (en_purge6, global_warn1, global_warn1, close_to_end)
          |     [---------------------------------]

Created the following regex and tested it it works to remove. Probably a good idea to add to english config. en_purge6: \b(admit1\.app|4kvod\.tv)\b

KBlixt commented 8 months ago

I've added it but the 33 block seems to get removed by global_purge2 already? 🤔

Have you seen som instances where "something.tv" isn't removed?

Zxurian commented 8 months ago

Interesting. I had it running with Bazarr on a custom post-processing command, however subs were still coming in with those two ads. It wasn't until I edited english config to add extra regex were they removed, re-ran it against my libraries and ~300 subs were updated.