guessit-io / guessit

GuessIt is a python library that extracts as much information as possible from a video filename.
https://guessit-io.github.io/guessit
GNU Lesser General Public License v3.0
824 stars 92 forks source link

Issue matching "Gold Rush: White Water" #765

Open Sparhawk76 opened 8 months ago

Sparhawk76 commented 8 months ago

I am a medusa user and am having issues where it is getting confused between Gold Rush: White Water and Gold Rush.

When it searches for an episode of Gold Rush: White Water, and a file is found, the filename is sent through guessit and it returns the show name "Gold Rush", confusing medusa and resulting in no episode being downloaded.

For more information my reply to this bug report contains logs showing this problem. https://github.com/pymedusa/Medusa/issues/11644

VeNoMouS commented 8 months ago

Appears to parse fine?

>>> from guessit import guessit
>>> print(json.dumps(guessit('Gold.Rush.White.Water.S07E01.1080p.HEVC.x265-MeGusta.mkv'), indent=4))
{
    "title": "Gold Rush White Water",
    "season": 7,
    "episode": 1,
    "screen_size": "1080p",
    "video_codec": "H.265",
    "video_profile": "High Efficiency Video Coding",
    "release_group": "MeGusta",
    "container": "mkv",
    "mimetype": "video/x-matroska",
    "type": "episode"
}
>>>
>>> import guessit
>>> guessit.__version__
'3.7.1'
>>>
Sparhawk76 commented 5 months ago

Getting the same error now for episodes of Gold Rush: Parkers Trail. In the log below it finds a file named "Gold.Rush.Parkers.Trail.S07E01.1080p.HEVC.x265-MeGusta" (bottom line of log), then you can see it gets confused and at the top two lines of the log it thinks it's found an episode of the show "Gold Rush".

As I stated before I'm just a user of medusa, with very little python knowledge. I'm not the only medusa user seeing this error.

Log Snippet:

2024-06-02 17:03:44 DEBUG    FORCEDSEARCHQUEUE-BACKLOG-379221 :: [DigitalCore-JKT] :: [34a67cf] No episodes [1] of season 7 are needed with this quality for Gold Rush
2024-06-02 17:03:44 DEBUG    FORCEDSEARCHQUEUE-BACKLOG-379221 :: [DigitalCore-JKT] :: [34a67cf] 208111: 'Gold Rush' S07E01 status is: 'Downloaded'. Ignoring result with quality '1080p WEB-DL'. Reason: Existing quality is already a preferred quality. Ignoring new quality
2024-06-02 17:03:44 DEBUG    FORCEDSEARCHQUEUE-BACKLOG-379221 :: [DigitalCore-JKT] :: [34a67cf] 208111: Allowed, Preferred = [ SDTV, SD DVD, 720p HDTV, 1080p HDTV, 720p WEB-DL, 1080p WEB-DL, 720p BluRay, 1080p BluRay ] [ 720p HDTV, 1080p HDTV, 720p WEB-DL, 1080p WEB-DL, 720p BluRay, 1080p BluRay ] Found = [ 1080p WEB-DL ]
2024-06-02 17:03:44 INFO     FORCEDSEARCHQUEUE-BACKLOG-379221 :: [DigitalCore-JKT] :: [34a67cf] Quality of Gold.Rush.Parkers.Trail.S07E01.1080p.WEB.h264-EDITH is 1080p WEB-DL
2024-06-02 17:03:44 DEBUG    FORCEDSEARCHQUEUE-BACKLOG-379221 :: [DigitalCore-JKT] :: [34a67cf] Transaction with 6 queries executed
2024-06-02 17:03:44 DEBUG    FORCEDSEARCHQUEUE-BACKLOG-379221 :: [DigitalCore-JKT] :: [34a67cf] Found single episode result Gold.Rush.Parkers.Trail.S07E01.1080p.HEVC.x265-MeGusta at http://10.69.0.1:9117/dl/digitalcore/?jackett_apikey=**********&path=Q2ZESjhHYmY3RGpaNXhaQWpWOW9nUHQ5bEh5MEZuMy04cnEwYVFpR052SDRiYWdXMkMxTFFmVUVDZjdWcEVrZXRJbzFWMEVRcTJfTHNGS19jbExaV2pOT19HY2o2TjZGTFJaS0pNMmNlT1NXQVF6d08xeUFQbWFwY3hJUTdFOTVwc2ZlS3pYbFlvU0lWTTFjUVBQc3RpN0ZXMHJtZzk0Xy1GZXh3a2dHaWc0WXRWVmg0T1ZTdXZBakh0N1k4Q0JyN2diSU1R&file=Gold.Rush.Parkers.Trail.S07E01.1080p.HEVC.x265-MeGusta
VeNoMouS commented 5 months ago

Gold.Rush.Parkers.Trail.S07E01.1080p.HEVC.x265-MeGusta

>>> from guessit import guessit
>>> import json
>>> print(json.dumps(guessit('Gold.Rush.Parkers.Trail.S07E01.1080p.HEVC.x265-MeGusta'), indent=4))
{
    "title": "Gold Rush Parkers Trail",
    "season": 7,
    "episode": 1,
    "screen_size": "1080p",
    "video_codec": "H.265",
    "video_profile": "High Efficiency Video Coding",
    "release_group": "MeGusta",
    "type": "episode"
}
>>>

As you can see it parses the show title fine, the issue will be with how medusa is looking for the string "Gold Rush"