PAhelper / PhoenixAdult.bundle

Plex Meta-Data Agent for scene videos from multiple adult sites
356 stars 138 forks source link

Index out of range on some She's New scenes #283

Closed ordinarygulp closed 5 years ago

ordinarygulp commented 5 years ago
2019-07-14 20:17:44,726 (1481999ed700) :  INFO (__init__:46) - *******MEDIA TITLE****** Ssn 16 06 12 Zoe Doll
2019-07-14 20:17:44,726 (1481999ed700) :  INFO (__init__:53) - Getting Search Settings for: Ssn 16 06 12 Zoe Doll
2019-07-14 20:17:44,726 (1481999ed700) :  INFO (PAsearchSites:885) - mediaTitle w/ possible abbrieviation: Ssn 16 06 12 Zoe Doll
2019-07-14 20:17:44,737 (1481999ed700) :  INFO (PAsearchSites:1045) - mediaTitle w/ possible abbrieviation fixed: ShesNew 16 06 12 Zoe Doll
2019-07-14 20:17:44,738 (1481999ed700) :  INFO (PAsearchSites:1060) - ^^^^^^^ siteID: 195
2019-07-14 20:17:44,739 (1481999ed700) :  INFO (PAsearchSites:1062) - ^^^^^^^ Shortening Title
2019-07-14 20:17:44,739 (1481999ed700) :  INFO (PAsearchSites:1063) - shesnew  vs shes new
2019-07-14 20:17:44,739 (1481999ed700) :  INFO (PAsearchSites:1069) - 2
2019-07-14 20:17:44,739 (1481999ed700) :  INFO (PAsearchSites:1072) - 3
2019-07-14 20:17:44,739 (1481999ed700) :  INFO (PAsearchSites:1079) - 6
2019-07-14 20:17:44,740 (1481999ed700) :  INFO (PAsearchSites:1091) - searchTitle (before date processing): 16 06 12 Zoe Doll
2019-07-14 20:17:44,740 (1481999ed700) :  INFO (__init__:63) - Search Title: Zoe Doll
2019-07-14 20:17:44,740 (1481999ed700) :  INFO (__init__:67) - Search Date: 2016-06-12
2019-07-14 20:17:44,740 (1481999ed700) :  INFO (__init__:72) - Zoe%20Doll
2019-07-14 20:17:44,743 (1481999ed700) :  DEBUG (networking:161) - Fetching 'https://www.teamskeet.com/t1/search/results/?query=Zoe%20Doll' from the HTTP cache
2019-07-14 20:17:44,752 (1481999ed700) :  DEBUG (networking:161) - Fetching 'https://www.teamskeet.com/t1/trailer/view/zoes_fantasy/zoe_doll?thumb=02.jpg' from the HTTP cache
2019-07-14 20:17:44,763 (1481999ed700) :  CRITICAL (agentkit:1014) - Exception in the search function of agent named 'PhoenixAdult', called with keyword arguments {'year': None, 'id': '106284', 'name': 'Ssn 16 06 12 Zoe Doll'} (most recent call last):
  File "/usr/lib/plexmediaserver/Resources/Plug-ins-4b7ace214/Framework.bundle/Contents/Resources/Versions/2/Python/Framework/api/agentkit.py", line 1007, in _search
    agent.search(*f_args, **f_kwargs)
  File "/config/Library/Application Support/Plex Media Server/Plug-ins/PhoenixAdult.bundle/Contents/Code/__init__.py", line 378, in search
    results = PAsearchSites.siteTeamSkeet.search(results,encodedTitle,title,searchTitle,siteNum,lang,searchByDateActor,searchDate, searchSiteID)
  File "/config/Library/Application Support/Plex Media Server/Plug-ins/PhoenixAdult.bundle/Contents/Code/siteTeamSkeet.py", line 11, in search
    titleNoFormatting = scenePage.xpath('//title')[0].text_content().split(" | ")[1]
  File "/usr/lib/plexmediaserver/Resources/Plug-ins-4b7ace214/Framework.bundle/Contents/Resources/Versions/2/Python/Framework/code/sandbox.py", line 108, in <lambda>
    _getitem_           = lambda x, y: x.__getitem__(y),
IndexError: list index out of range
2019-07-14 20:20:18,232 (1481999ed700) :  INFO (__init__:46) - *******MEDIA TITLE****** Ssn 14 01 10 Kiara Knight
2019-07-14 20:20:18,233 (1481999ed700) :  INFO (__init__:53) - Getting Search Settings for: Ssn 14 01 10 Kiara Knight
2019-07-14 20:20:18,233 (1481999ed700) :  INFO (PAsearchSites:885) - mediaTitle w/ possible abbrieviation: Ssn 14 01 10 Kiara Knight
2019-07-14 20:20:18,243 (1481999ed700) :  INFO (PAsearchSites:1045) - mediaTitle w/ possible abbrieviation fixed: ShesNew 14 01 10 Kiara Knight
2019-07-14 20:20:18,244 (1481999ed700) :  INFO (PAsearchSites:1060) - ^^^^^^^ siteID: 195
2019-07-14 20:20:18,245 (1481999ed700) :  INFO (PAsearchSites:1062) - ^^^^^^^ Shortening Title
2019-07-14 20:20:18,245 (1481999ed700) :  INFO (PAsearchSites:1063) - shesnew  vs shes new
2019-07-14 20:20:18,245 (1481999ed700) :  INFO (PAsearchSites:1069) - 2
2019-07-14 20:20:18,245 (1481999ed700) :  INFO (PAsearchSites:1072) - 3
2019-07-14 20:20:18,245 (1481999ed700) :  INFO (PAsearchSites:1079) - 6
2019-07-14 20:20:18,246 (1481999ed700) :  INFO (PAsearchSites:1091) - searchTitle (before date processing): 14 01 10 Kiara Knight
2019-07-14 20:20:18,246 (1481999ed700) :  INFO (__init__:63) - Search Title: Kiara Knight
2019-07-14 20:20:18,246 (1481999ed700) :  INFO (__init__:67) - Search Date: 2014-01-10
2019-07-14 20:20:18,246 (1481999ed700) :  INFO (__init__:72) - Kiara%20Knight
2019-07-14 20:20:18,248 (1481999ed700) :  DEBUG (networking:166) - Requesting 'https://www.teamskeet.com/t1/search/results/?query=Kiara%20Knight'
2019-07-14 20:20:19,583 (1481999ed700) :  DEBUG (networking:166) - Requesting 'https://www.teamskeet.com/t1/trailer/view/get_caught_get_boned/kiera_knight?thumb=05.jpg'
2019-07-14 20:20:20,718 (1481999ed700) :  CRITICAL (agentkit:1014) - Exception in the search function of agent named 'PhoenixAdult', called with keyword arguments {'openSubtitlesHash': '1979ba9e0c79e0ab', 'name': 'Ssn 14 01 10 Kiara Knight', 'filename': '%2Fmisc%2FSites%2FShesNew%2Fssn%2E14%2E01%2E10%2Ekiara%2Eknight%2Emp4', 'plexHash': 'e8059658abea1107d59684db9277a469d9c3f358', 'duration': '1033957', 'id': '106203'} (most recent call last):
  File "/usr/lib/plexmediaserver/Resources/Plug-ins-4b7ace214/Framework.bundle/Contents/Resources/Versions/2/Python/Framework/api/agentkit.py", line 1007, in _search
    agent.search(*f_args, **f_kwargs)
  File "/config/Library/Application Support/Plex Media Server/Plug-ins/PhoenixAdult.bundle/Contents/Code/__init__.py", line 378, in search
    results = PAsearchSites.siteTeamSkeet.search(results,encodedTitle,title,searchTitle,siteNum,lang,searchByDateActor,searchDate, searchSiteID)
  File "/config/Library/Application Support/Plex Media Server/Plug-ins/PhoenixAdult.bundle/Contents/Code/siteTeamSkeet.py", line 11, in search
    titleNoFormatting = scenePage.xpath('//title')[0].text_content().split(" | ")[1]
  File "/usr/lib/plexmediaserver/Resources/Plug-ins-4b7ace214/Framework.bundle/Contents/Resources/Versions/2/Python/Framework/code/sandbox.py", line 108, in <lambda>
    _getitem_           = lambda x, y: x.__getitem__(y),
IndexError: list index out of range
2019-07-14 20:21:47,837 (1481999ed700) :  INFO (__init__:46) - *******MEDIA TITLE****** Ssn 13 09 06 Ariana Marie
2019-07-14 20:21:47,837 (1481999ed700) :  INFO (__init__:53) - Getting Search Settings for: Ssn 13 09 06 Ariana Marie
2019-07-14 20:21:47,837 (1481999ed700) :  INFO (PAsearchSites:885) - mediaTitle w/ possible abbrieviation: Ssn 13 09 06 Ariana Marie
2019-07-14 20:21:47,848 (1481999ed700) :  INFO (PAsearchSites:1045) - mediaTitle w/ possible abbrieviation fixed: ShesNew 13 09 06 Ariana Marie
2019-07-14 20:21:47,850 (1481999ed700) :  INFO (PAsearchSites:1060) - ^^^^^^^ siteID: 195
2019-07-14 20:21:47,850 (1481999ed700) :  INFO (PAsearchSites:1062) - ^^^^^^^ Shortening Title
2019-07-14 20:21:47,850 (1481999ed700) :  INFO (PAsearchSites:1063) - shesnew  vs shes new
2019-07-14 20:21:47,850 (1481999ed700) :  INFO (PAsearchSites:1069) - 2
2019-07-14 20:21:47,850 (1481999ed700) :  INFO (PAsearchSites:1072) - 3
2019-07-14 20:21:47,851 (1481999ed700) :  INFO (PAsearchSites:1079) - 6
2019-07-14 20:21:47,851 (1481999ed700) :  INFO (PAsearchSites:1091) - searchTitle (before date processing): 13 09 06 Ariana Marie
2019-07-14 20:21:47,851 (1481999ed700) :  INFO (__init__:63) - Search Title: Ariana Marie
2019-07-14 20:21:47,851 (1481999ed700) :  INFO (__init__:67) - Search Date: 2013-09-06
2019-07-14 20:21:47,851 (1481999ed700) :  INFO (__init__:72) - Ariana%20Marie
2019-07-14 20:21:47,855 (1481999ed700) :  DEBUG (networking:161) - Fetching 'https://www.teamskeet.com/t1/search/results/?query=Ariana%20Marie' from the HTTP cache
2019-07-14 20:21:47,861 (1481999ed700) :  DEBUG (networking:166) - Requesting 'https://www.teamskeet.com/t1/trailer/view/mister,_am_i_accepted_now/ariana_marie?thumb=04.jpg'
2019-07-14 20:21:48,664 (1481999ed700) :  INFO (siteTeamSkeet:12) - Result Title: Mister, Am I Accepted Now?
2019-07-14 20:21:48,664 (1481999ed700) :  INFO (siteTeamSkeet:15) - ID: www.teamskeet.com+t1+trailer+view+mister,_am_i_accepted_now+ariana_marie
2019-07-14 20:21:48,665 (1481999ed700) :  INFO (siteTeamSkeet:17) - 2013-08-19
2019-07-14 20:21:48,665 (1481999ed700) :  INFO (siteTeamSkeet:18) - www.teamskeet.com+t1+trailer+view+mister,_am_i_accepted_now+ariana_marie
2019-07-14 20:21:48,667 (1481999ed700) :  DEBUG (networking:166) - Requesting 'https://www.teamskeet.com/t1/trailer/view/lets_make_porn_babies/ariana_marie?thumb=04.jpg'
2019-07-14 20:21:49,808 (1481999ed700) :  CRITICAL (agentkit:1014) - Exception in the search function of agent named 'PhoenixAdult', called with keyword arguments {'year': None, 'id': '106189', 'name': 'Ssn 13 09 06 Ariana Marie'} (most recent call last):
  File "/usr/lib/plexmediaserver/Resources/Plug-ins-4b7ace214/Framework.bundle/Contents/Resources/Versions/2/Python/Framework/api/agentkit.py", line 1007, in _search
    agent.search(*f_args, **f_kwargs)
  File "/config/Library/Application Support/Plex Media Server/Plug-ins/PhoenixAdult.bundle/Contents/Code/__init__.py", line 378, in search
    results = PAsearchSites.siteTeamSkeet.search(results,encodedTitle,title,searchTitle,siteNum,lang,searchByDateActor,searchDate, searchSiteID)
  File "/config/Library/Application Support/Plex Media Server/Plug-ins/PhoenixAdult.bundle/Contents/Code/siteTeamSkeet.py", line 11, in search
    titleNoFormatting = scenePage.xpath('//title')[0].text_content().split(" | ")[1]
  File "/usr/lib/plexmediaserver/Resources/Plug-ins-4b7ace214/Framework.bundle/Contents/Resources/Versions/2/Python/Framework/code/sandbox.py", line 108, in <lambda>
    _getitem_           = lambda x, y: x.__getitem__(y),
IndexError: list index out of range
2019-07-14 20:36:01,537 (1481999ed700) :  INFO (__init__:46) - *******MEDIA TITLE****** Shesnew 13 11 29 Marina Angel
2019-07-14 20:36:01,538 (1481999ed700) :  INFO (__init__:53) - Getting Search Settings for: Shesnew 13 11 29 Marina Angel
2019-07-14 20:36:01,538 (1481999ed700) :  INFO (PAsearchSites:885) - mediaTitle w/ possible abbrieviation: Shesnew 13 11 29 Marina Angel
2019-07-14 20:36:01,548 (1481999ed700) :  INFO (PAsearchSites:1045) - mediaTitle w/ possible abbrieviation fixed: Shesnew 13 11 29 Marina Angel
2019-07-14 20:36:01,550 (1481999ed700) :  INFO (PAsearchSites:1060) - ^^^^^^^ siteID: 195
2019-07-14 20:36:01,550 (1481999ed700) :  INFO (PAsearchSites:1062) - ^^^^^^^ Shortening Title
2019-07-14 20:36:01,550 (1481999ed700) :  INFO (PAsearchSites:1063) - shesnew  vs shes new
2019-07-14 20:36:01,551 (1481999ed700) :  INFO (PAsearchSites:1069) - 2
2019-07-14 20:36:01,551 (1481999ed700) :  INFO (PAsearchSites:1072) - 3
2019-07-14 20:36:01,551 (1481999ed700) :  INFO (PAsearchSites:1079) - 6
2019-07-14 20:36:01,551 (1481999ed700) :  INFO (PAsearchSites:1091) - searchTitle (before date processing): 13 11 29 Marina Angel
2019-07-14 20:36:01,552 (1481999ed700) :  INFO (__init__:63) - Search Title: Marina Angel
2019-07-14 20:36:01,552 (1481999ed700) :  INFO (__init__:67) - Search Date: 2013-11-29
2019-07-14 20:36:01,552 (1481999ed700) :  INFO (__init__:72) - Marina%20Angel
2019-07-14 20:36:01,555 (1481999ed700) :  DEBUG (networking:161) - Fetching 'https://www.teamskeet.com/t1/search/results/?query=Marina%20Angel' from the HTTP cache
2019-07-14 20:36:01,562 (1481999ed700) :  DEBUG (networking:161) - Fetching 'https://join.teamskeet.com/signup/signup.php' from the HTTP cache
2019-07-14 20:36:01,567 (1481999ed700) :  CRITICAL (agentkit:1014) - Exception in the search function of agent named 'PhoenixAdult', called with keyword arguments {'year': None, 'id': '106124', 'name': 'Shesnew 13 11 29 Marina Angel'} (most recent call last):
  File "/usr/lib/plexmediaserver/Resources/Plug-ins-4b7ace214/Framework.bundle/Contents/Resources/Versions/2/Python/Framework/api/agentkit.py", line 1007, in _search
    agent.search(*f_args, **f_kwargs)
  File "/config/Library/Application Support/Plex Media Server/Plug-ins/PhoenixAdult.bundle/Contents/Code/__init__.py", line 378, in search
    results = PAsearchSites.siteTeamSkeet.search(results,encodedTitle,title,searchTitle,siteNum,lang,searchByDateActor,searchDate, searchSiteID)
  File "/config/Library/Application Support/Plex Media Server/Plug-ins/PhoenixAdult.bundle/Contents/Code/siteTeamSkeet.py", line 11, in search
    titleNoFormatting = scenePage.xpath('//title')[0].text_content().split(" | ")[1]
  File "/usr/lib/plexmediaserver/Resources/Plug-ins-4b7ace214/Framework.bundle/Contents/Resources/Versions/2/Python/Framework/code/sandbox.py", line 108, in <lambda>
    _getitem_           = lambda x, y: x.__getitem__(y),
IndexError: list index out of range
claygoldfinch commented 5 years ago

The agent doesn't scrape info from the TeamSkeet search page, instead it opens each search result on the search page and scrapes info from there. This allows the agent to do two things: scrape full scene titles (as many on the search page are truncated), and utilize Date Match (dates are listed on each scene page, but not on the search page).

Sometimes the search results on the search page don't have valid URLs attached to them. Occassionally, this is just a case of the webmaster not attaching a valid URL. In other cases, the scene page actually doesn't exist - it was never created by the webmaster.

I've created an exception in the agent that makes it so it doesn't error out when it encounters these scenes. Additionally, I've added in valid URLs for a few scenes.

  1. Resolved. Added in exception for "Zoe's Fantasy".
  2. Can't fully resolve - The scene page for "Get Caught Get Boned" redirects to a "join now" page. Valid scene page does not exist.
  3. Again, can't fully resolve - The scene page for "Let's Make Porn Babies" redirects to a "join now" page. Valid scene page does not exist.
  4. Resolved. Added in exception for "She Has Her Ways".