Closed j-ktz closed 1 year ago
I will check it out soon.. Thanks for finding this
On Wed, 24 May 2023, 14:09 j-ktz, @.***> wrote:
Mandatory Describe the bug:
This issue seems to keep popping up for me. I copy/paste the title directly from browser, add ( ) around studio and year, but if there are " " in the title, it doesn't match. Specific Agent(s) Causing the Issue:
Fagalicious Index Site URL(s) Attempting to Match:
https://fagalicious.com/cutlersden-drew-sebastian-boy-david-where-he-belongs/ Log Attached:
com.plexapp.agents.Fagalicious.log https://github.com/CodyBerenson/PGMA-Modernized/files/11554356/com.plexapp.agents.Fagalicious.log Optional Screenshot(s) with all nudity redacted: Additional Context: Desktop (please complete the following information):
- OS: [e.g. iOS] Mac OS 12.6.3 (21G419)
- Browser [e.g. chrome, safari] Safari
- Version [e.g. 22]
— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKN36JSKSWEK2YZG3DTXHX257ANCNFSM6AAAAAAYNI4PHU . You are receiving this because you are subscribed to this thread.Message ID: @.***>
@j-ktz Hi, hope this finds you well. This doesn't have anything to do with the quotes. You haven't set your "Genre" parameter. When you open up the parameters, it may appear that "Purple" is selected, but it is just one of the list of values, and you actually have to make a selection.
In the log it says "None", and the agent fails with an error: SEARCH:: Error: <QUIT: Scraping Parameters Not Set Up!
Thanks @CodyBerenson! I tweaked the setting and got fagalicious to match a title successfully without quotes, but titles with "" still won't match. See new log.
com.plexapp.agents.Fagalicious.log
@j-ktz Sorry, matches perfectly on Windows. I can see where your log treats the quotes differently than mine.
@JPH71 Can you take a look at his log? Thanks!
Yeah, I'm on a Mac that remotes into Synology with Finder.
It also matches for me... I will look into this more carefully and find a way to sort this heifer out!!! once and for all....
Dear @JPH71 Please read and follow the rules about posting naughty pictures or you will get many, many whacks on the arse.
You have been warned
The management.
Oh shit... I forgot!!!!
Since you are a newbie, the management has redacted your naughty vial photo and reposted it for you.
Sorry man... I shouldn't post when I get back from the Pub..... Just quickly ran a text and did a snip..... Doh!!!!
On Sat, 27 May 2023 at 00:44, Cody Berenson @.***> wrote:
Since you are a newbie, the management has redacted your naughty vial photo and reposted it for you.
— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1565074131, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKOXE355BZQTRF4VPTDXIE57FANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>
ran a test!!!! I am stopping emailing now!
On Sat, 27 May 2023 at 00:46, Jason Hudson @.***> wrote:
Sorry man... I shouldn't post when I get back from the Pub..... Just quickly ran a text and did a snip..... Doh!!!!
On Sat, 27 May 2023 at 00:44, Cody Berenson @.***> wrote:
Since you are a newbie, the management has redacted your naughty vial photo and reposted it for you.
— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1565074131, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKOXE355BZQTRF4VPTDXIE57FANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>
No worries. I think you'd approve of the family friendly image substitution.
LOL
On Sat, 27 May 2023 at 00:48, Cody Berenson @.***> wrote:
No worries. I think you'd approve of the family friendly image substitution.
— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1565076046, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKK36X3BLD2YJYAD2F3XIE6NJANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>
It's an image to frighten the dead....
On Sat, 27 May 2023 at 00:59, Jason Hudson @.***> wrote:
LOL
On Sat, 27 May 2023 at 00:48, Cody Berenson @.***> wrote:
No worries. I think you'd approve of the family friendly image substitution.
— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1565076046, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKK36X3BLD2YJYAD2F3XIE6NJANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>
night night....
On Sat, 27 May 2023 at 01:00, Jason Hudson @.***> wrote:
It's an image to frighten the dead....
On Sat, 27 May 2023 at 00:59, Jason Hudson @.***> wrote:
LOL
On Sat, 27 May 2023 at 00:48, Cody Berenson @.***> wrote:
No worries. I think you'd approve of the family friendly image substitution.
— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1565076046, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKK36X3BLD2YJYAD2F3XIE6NJANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>
LOL. I've missed you boys.
😉
On Sat, 27 May 2023, 14:30 j-ktz, @.***> wrote:
LOL. I've missed you boys.
— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1565392632, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKIC56IECUE7QNPCKCTXIHXU3ANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>
Hey all. I'm seeing this issue as well on Windows - just thought I'd add my 2 pence. I don't think it happens ALL the time, either. I've matched a bunch of "double-quote" titles before. Here's an example of one that isn't matching as of this morning:
Filename: (Men.com) - Colton Reece raw fucks the cum out of Joey Mills in “Accidental Pornstar” (2023).mp4 URL: https://fagalicious.com/men-com-colton-reece-fucks-joey-mills-accidental-pornstar/
Portion of log that shows error: (can give whole log if needed) - maybe cuz there are several titles that start with "Colton Reece raw" ?? I dunno - just thinking out loud.
2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Original Search Query Colton Reece raw fucks the cum out of Joey Mills in "Accidental Pornstar" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Search Query colton reece raw fucks the cum out of joey mills in "accidental pornstar" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Search Query Search Query Length: "16 <= 20" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Search Query Shorten Search Query: "colton reece raw" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Returned Search Query colton+reece+raw 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - -------------------------------------------------------------------------------------------------------------------------------------------- 2023-06-06 09:29:03,089 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: Search Query https://fagalicious.com/search/colton+reece+raw 2023-06-06 09:29:03,105 (2724) : DEBUG (networking:143) - Requesting 'https://fagalicious.com/search/colton+reece+raw' 2023-06-06 09:29:03,214 (2724) : ERROR (networking:196) - Error opening URL 'https://fagalicious.com/search/colton+reece+raw' 2023-06-06 09:29:03,214 (2724) : ERROR (logkit:22) - Fagalicious - SEARCH:: Error: Search Query did not pull any results: HTTP Error 403: Forbidden 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - **** 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: **** >> Fagalicious: Finished Search Routine << ***** 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: **** >> (Men.com) - Colton Reece raw fucks the cum out of Joey Mills in "Accidental Pornstar" (2023) << * 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: ** >> << ** 2023-06-06 09:29:03,230 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: >> Failed << ** 2023-06-06 09:29:03,230 (2724) : INFO (logkit:16) - Fagalicious - ****
I will look at this... Currently working on a new scraper to scrape GEVI scenes...
I will add code to utils.py to sort this error out...
Cheers for this
Jason xxx
On Tue, 6 Jun 2023, 18:34 ScoBoXxX, @.***> wrote:
Hey all. I'm seeing this issue as well on Windows - just thought I'd add my 2 pence. I don't think it happens ALL the time, either. I've matched a bunch of "double-quote" titles before. Here's an example of one that isn't matching as of this morning:
Filename: (Men.com) - Colton Reece raw fucks the cum out of Joey Mills in “Accidental Pornstar” (2023).mp4 URL: https://fagalicious.com/men-com-colton-reece-fucks-joey-mills-accidental-pornstar/
Portion of log that shows error: (can give whole log if needed) - maybe cuz there are several titles that start with "Colton Reece raw" ?? I dunno
- just thinking out loud.
2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Original Search Query Colton Reece raw fucks the cum out of Joey Mills in "Accidental Pornstar" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Search Query colton reece raw fucks the cum out of joey mills in "accidental pornstar" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Search Query Search Query Length: "16 <= 20" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Search Query Shorten Search Query: "colton reece raw" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Returned Search Query colton+reece+raw 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious -
2023-06-06 09:29:03,089 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: Search Query https://fagalicious.com/search/colton+reece+raw 2023-06-06 09:29:03,105 (2724) : DEBUG (networking:143) - Requesting ' https://fagalicious.com/search/colton+reece+raw' 2023-06-06 09:29:03,214 (2724) : ERROR (networking:196) - Error opening URL 'https://fagalicious.com/search/colton+reece+raw' 2023-06-06 09:29:03,214 (2724) : ERROR (logkit:22) - Fagalicious - SEARCH:: Error: Search Query did not pull any results: HTTP Error 403: Forbidden 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious -
2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: **** >> Fagalicious: Finished Search Routine << ***** 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: **** >> (Men.com) - Colton Reece raw fucks the cum out of Joey Mills in "Accidental Pornstar" (2023) << * 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: ** >> <<
2023-06-06 09:29:03,230 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: *** >> Failed <<
2023-06-06 09:29:03,230 (2724) : INFO (logkit:16) - Fagalicious -
— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1579098556, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKO6QCYTVXPC7V47UZDXJ5L2LANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>
Just looked at this: It scraped with no issue..... You are getting a 403 Error - this is the website actively refusing you access, it detects that the request is not from a human and tries to prevent scraping.... This is currently what happens with IAFD...... What I generally do when I start getting 403 is reboot my computer and wait 10 or so minutes before I start scraping again.....
attached find log.... com.plexapp.agents.Fagalicious.log
hey sorry for delay. I tried this. Left my PC off overnight and this morning the same results. I thought one matched this morning but it was actually matched via Waybig - for once they had the same title on both sites. So yeah these are still failing.
Any other ideas to help that 403 error?
Here's a new log if you want to look through it. com.plexapp.agents.Fagalicious.log
Just tested it on my machine and it scraped OK... the usual IAFD 403 errors. so di not get the actor pictures. Could you try using a VPN to see if it would be scraped... You are getting a 403 error from Fagalicious.com, so its not even starting to scrape..... There is a possibility that you are been detected as a scraper and you are been politely been told to eff off: https://blog.apify.com/web-scraping-how-to-solve-403-errors/
In the meantime - I have just changed the User-Agent, just as I was writing this email and recrapped it and it has now successfully got the actors pictures off IAFD instead of a 403 error.
So later today I will send the new agents to Cody to test and upload to PGMA
in the meantime if you are online now - go itno your init.py and set HTTP.Headers['User-Agent'] to 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/104.0.5112.102 Safari/537.36 Edg/104.0.1293.63'
Let me know how it goes!
Here is my log: com.plexapp.agents.Fagalicious.log
Thanks @JPH71! Happy test new scrapers on a Mac.
I downloaded and tested with a VPN with same 403 results. No luck there. Also tried to edit that .py file and also no change. Guess I've been targeted by Fagalicious lol. I'm able to lead the website when pasting in the search URL from the log. So they must know that I am scraping or whatever. https://fagalicious.com/search/pledge+recruitment%22 loads fine in a browser. Maybe this is my sign that I need to stop spending so much time on this hobby :)
Just giving an update,
I have sent Cody a new scraper for getting the videos of GEVI.
In this scraper I have put in code to rotate the User-Agent that the code uses to query IAFD...
Marginally better, so hopefully this will work when I adapt the other agents to use the new utils.py...
Just need to sort out the icons when a 403 occurs when an actor is been IAFD queried.....
Please close this issue.....
Hey friends, I still have the double quote issue to this day. Did we ever get this figured out?
@j-ktz since Windows has no issues with double quotes, you'll need to provide @JPH71 your log. Thanks.
I have been living with this for a while and I have to replace the curly quotes with straight quotes manually. It happens only on Macs I think. On 10 Sep 2024, at 5:15 AM, j-ktz @.***> wrote: Hey friends, I still have the double quote issue to this day. Did we ever get this figured out?
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you are subscribed to this thread.Message ID: @.***>
Wouldn't it just be easier to replace your curly quotes with straight ones? If they work? Send me a log file and I will lo9k at this over the next few days... I am rather swamped at work with people off on their summer holidays and those coming back with Dehli Belly!
On Tue, 10 Sept 2024, 02:38 Xtian Hog, @.***> wrote:
I have been living with this for a while and I have to replace the curly quotes with straight quotes manually. It happens only on Macs I think. On 10 Sep 2024, at 5:15 AM, j-ktz @.***> wrote: Hey friends, I still have the double quote issue to this day. Did we ever get this figured out?
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you are subscribed to this thread.Message ID: @.***>
— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-2339389896, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKJP3F4EUQXWYNTUDWDZVY5P7AVCNFSM6AAAAABN5MFLKKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMZZGM4DSOBZGY . You are receiving this because you were mentioned.Message ID: @.***>
I have been living with this for a while and I have to replace the curly quotes with straight quotes manually. It happens only on Macs I think. On 10 Sep 2024, at 5:15 AM, j-ktz @.> wrote: Hey friends, I still have the double quote issue to this day. Did we ever get this figured out? —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you are subscribed to this thread.Message ID: @.>
What do you use to replace the quotes? I've tried replacing them but it always still fails. Do you use a website or just do it in Finder?
Wouldn't it just be easier to replace your curly quotes with straight ones? If they work? Send me a log file and I will lo9k at this over the next few days... I am rather swamped at work with people off on their summer holidays and those coming back with Dehli Belly! … On Tue, 10 Sept 2024, 02:38 Xtian Hog, @.> wrote: I have been living with this for a while and I have to replace the curly quotes with straight quotes manually. It happens only on Macs I think. On 10 Sep 2024, at 5:15 AM, j-ktz @.> wrote: Hey friends, I still have the double quote issue to this day. Did we ever get this figured out? —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you are subscribed to this thread.Message ID: @.> — Reply to this email directly, view it on GitHub <#245 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKJP3F4EUQXWYNTUDWDZVY5P7AVCNFSM6AAAAABN5MFLKKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMZZGM4DSOBZGY . You are receiving this because you were mentioned.Message ID: @.>
com.plexapp.agents.Fagalicious.log
I've tried replacing quotes but no dice. No rush but here's the log!
Mandatory
Describe the bug:
This issue seems to keep popping up for me. I copy/paste the title directly from browser, add ( ) around studio and year, but if there are " " in the title, it doesn't match.
Specific Agent(s) Causing the Issue:
Fagalicious
Index Site URL(s) Attempting to Match:
https://fagalicious.com/cutlersden-drew-sebastian-boy-david-where-he-belongs/
Log Attached:
com.plexapp.agents.Fagalicious.log
Optional
Screenshot(s) with all nudity redacted:
Additional Context:
Desktop (please complete the following information):