guessit-io / guessit

GuessIt is a python library that extracts as much information as possible from a video filename.
https://guessit-io.github.io/guessit
GNU Lesser General Public License v3.0
814 stars 92 forks source link

Incorrect release group guess #718

Closed duramato closed 1 year ago

duramato commented 2 years ago

The language gets detected as the release group. There are three "subtitle_language": Italian, English and Portuguese.

For: Gloria.S01E01-10.DLMux.1080p.E-AC3-AC3.ITA.ENG.POR.SUBS
GuessIt found: {
    "title": "Gloria",
    "season": 1,
    "episode": [
        1,
        2,
        3,
        4,
        5,
        6,
        7,
        8,
        9,
        10
    ],
    "source": "Web",
    "other": "Mux",
    "screen_size": "1080p",
    "audio_codec": "Dolby Digital",
    "language": "Italian",
    "subtitle_language": "English",
    "release_group": "POR",
    "type": "episode"
}
+-------------------------------------------------------+
+                   GuessIt 3.4.2                       +
+-------------------------------------------------------+
+                   Rebulk 3.1.0                        +
+-------------------------------------------------------+
|      Please report any bug or feature request at      |
|     https://github.com/guessit-io/guessit/issues.     |
+-------------------------------------------------------+
Toilal commented 1 year ago

This is because por is in common_words configuration settings, as it's a common spanish word (por favor :p). You may remove this word inside your own configuration, but I won't do it upstream because it breaks a test.

? Por Trece Razones - Temporada 2 [HDTV 720p][Cap.201][AC3 5.1 Castellano]/Por Trece Razones 2x01 [des202].mkv
: title: Por Trece Razones
  season: 2
  source: HDTV
  screen_size: 720p
  episode: 1
  audio_codec: Dolby Digital
  audio_channels: '5.1'
  language: Catalan
  release_group: des202
  container: mkv
  type: episode