stashapp / CommunityScrapers

This is a public repository containing scrapers created by the Stash Community.
https://stashapp.github.io/CommunityScrapers/
GNU Affero General Public License v3.0
640 stars 413 forks source link

Broken Scrapers #123

Closed bnkai closed 4 months ago

bnkai commented 4 years ago

Any issues with scrapers not working should be mentioned here The name of the scraper, the xpath or part not working would be appretiated.


Known Issues

updated 2022-09-25

ferengi82 commented 3 years ago

ThePornDB.yml: metadataapi.net api changed, "parse" no longer delivers tags. if i understand it right first parse to find the scene, then use the id / url to geht the rest of the scene data

The scraper was updated 4 days ago #509 for the tag issue, are you sure you have latest version?

sorry somehow i misse the "The" and only saw PornDB.ym - 16 days ago

jb19774 commented 3 years ago

Clips4Sale has changed the date format on their sites to DD/MM/YY and so the date parsing is now broken.

Jerrk commented 3 years ago

javdb is currently broken, confirmed by bnkai on discord and i'm just posting it here for posterity.

nerfdaderp commented 3 years ago

Thirdmovies.com does not seem to return any data, ztod contained within the same scraper, does still seem to work without issue.

OGustavo30 commented 3 years ago

RealityKings not working (with Brazzers.yml) , returns a error exec: "google-chrome": executable file not found in %PATH% VixenNetwork.yml also not working (don't know how to use the .py provided)

bnkai commented 3 years ago

RealityKings not working (with Brazzers.yml) , returns a error exec: "google-chrome": executable file not found in %PATH% VixenNetwork.yml also not working (don't know how to use the .py provided)

Neither of the scrapers mentioned are broken. https://github.com/stashapp/CommunityScrapers#communityscrapers provides instructions for both cases (CDP setup and where to put the scraper files). For more info either have a look at the in-app help section of stash or visit the discord channel

ddragon3 commented 3 years ago

MyDirtyHobby scraper not returning results. click the button to scrape next to url, spins, then nothing happens.

malibustacynewhat commented 3 years ago

transangels and loveherfeet no longer scraping cover images

bnkai commented 3 years ago

transangels and loveherfeet no longer scraping cover images

Can you provide a url where loveherfeet doesnt work? The image part seems fine from a few scenes i tried

malibustacynewhat commented 3 years ago

transangels and loveherfeet no longer scraping cover images

Can you provide a url where loveherfeet doesnt work? The image part seems fine from a few scenes i tried

https://www.loveherfeet.com/tour/trailers/Did-you-post-my-feet-pictures-kayla-kayden-footjob.html https://www.loveherfeet.com/tour/trailers/Aunt-Lexis-feet-lexi-luna-footjob.html https://www.loveherfeet.com/tour/trailers/Suck-My-Naughty-Feet-Underneath-The-Tree-Violet-Starr-Feet.html These all missed the image when scraped

bnkai commented 3 years ago

https://www.loveherfeet.com/tour/trailers/Did-you-post-my-feet-pictures-kayla-kayden-footjob.html https://www.loveherfeet.com/tour/trailers/Aunt-Lexis-feet-lexi-luna-footjob.html https://www.loveherfeet.com/tour/trailers/Suck-My-Naughty-Feet-Underneath-The-Tree-Violet-Starr-Feet.html These all missed the image when scraped

All of the above work fine for me. Can you double check that you have the latest version of the loveherfeet scraper (# Last Updated May 16, 2021) ?

malibustacynewhat commented 3 years ago

All of the above work fine for me. Can you double check that you have the latest version of the loveherfeet scraper (# Last Updated May 16, 2021) ?

That was the problem, I missed the updated scraper. Sorry and thanks, bnkai

nobofernandez commented 3 years ago

When loading stash with the latest scrapers I see these errors.

ERRO[2021-07-01 11:11:58] Error loading scraper <path>\HelixStudios.yml: invalid post-process action
ERRO[2021-07-01 11:11:58] Error loading scraper <path>\MaxHardcore.yml: invalid post-process action

Might not be the right thread but figured it was worth pointing out.

peolic commented 3 years ago

@nobofernandez some scrapers require development releases of Stash, so this is expected. v0.8.0 should be released soon, when you update it will clear those errors.

PhantomEight commented 3 years ago

TeamskeetAPI.py - Seems to be an issue with some date conversion. I'm trying to see if I can teach myself python and figure it out, but I've always coded for fun so I don't know how far I'll get.

ERRO[2021-07-01 21:07:12] scraper: Asking the API...
ERRO[2021-07-01 21:07:12] scraper: Traceback (most recent call last):
ERRO[2021-07-01 21:07:12] scraper:   File "C:\Users\<user>\.stash\scrapers\TeamskeetAPI.py", line 90, in <module>
ERRO[2021-07-01 21:07:12] scraper:     date = datetime.strptime(scene_api_json.get(
ERRO[2021-07-01 21:07:12] scraper:   File "C:\Users\<user>\AppData\Local\Programs\Python\Python39\lib\_strptime.py", line 568, in _strptime_datetime
ERRO[2021-07-01 21:07:12] scraper:     tt, fraction, gmtoff_fraction = _strptime(data_string, format)
ERRO[2021-07-01 21:07:12] scraper:   File "C:\Users\<user>\AppData\Local\Programs\Python\Python39\lib\_strptime.py", line 349, in _strptime
ERRO[2021-07-01 21:07:12] scraper:     raise ValueError("time data %r does not match format %r" %
ERRO[2021-07-01 21:07:12] scraper: ValueError: time data '2017-08-25T18:00:00-04:00' does not match format '%Y-%m-%dT%H:%M:%S.%f%z'
ERRO[2021-07-01 21:07:12] could not unmarshal json: EOF
bnkai commented 3 years ago

Fit18 is broken as reported in #700 A python scraper from #702 should fix the issue

jthrow0451 commented 2 years ago

nympho.yml is not working for scraping descriptions (at least on swallowed.com). I can't work out the x-path for it.

An example URL can be found here: https://tour.swallowed.com/view/56/cock-gargling-fun-with-morgan-karlee-and-aj

The text seems to be separated into 3 divs

brnkaj commented 2 years ago

JacquieEtMichelTV.py has some errors.

[Scrape / JacquieEtMicaelTV] IndexError: list index out of range
[Scrape / JacquieEtMicaelTV]     details = tree.xpath("//div[@class='video-description']/p")[0]
[Scrape / JacquieEtMicaelTV]   File "JacquieEtMichelTV.py", line 51, in <module>
[Scrape / JacquieEtMicaelTV] Traceback (most recent call last):

When I change line 51 from [0] to [1] it sometimes work. However sometimes it doesn't and in log can be 2 errors:

adudeirl commented 2 years ago

I reported this in the discord a few days ago, but a few of the sites the AdultEmpireCash scraper (MyPervyFamily, TouchMyWife, and FilthyKings at least) changed layouts and url structures so they no longer work.

bnkai commented 2 years ago

Tushy Raw is broken. The page loads some js first now so the python scraper doesnt have access to the needed Githubissues.

  • Githubissues is a development platform for aggregating issues.