CodyBerenson / PGMA-Modernized

An updated approach for Plex Gay Media Adult Agents for both Full Feature Films and Scenes
MIT License
127 stars 46 forks source link

[BUG]: Fagalicious won't match with double quotes #245

Closed j-ktz closed 1 year ago

j-ktz commented 1 year ago

Mandatory

Describe the bug:

This issue seems to keep popping up for me. I copy/paste the title directly from browser, add ( ) around studio and year, but if there are " " in the title, it doesn't match.

Specific Agent(s) Causing the Issue:

Fagalicious

Index Site URL(s) Attempting to Match:

https://fagalicious.com/cutlersden-drew-sebastian-boy-david-where-he-belongs/

Log Attached:

com.plexapp.agents.Fagalicious.log

Optional

Screenshot(s) with all nudity redacted:

Additional Context:

Desktop (please complete the following information):

JPH71 commented 1 year ago

I will check it out soon.. Thanks for finding this

On Wed, 24 May 2023, 14:09 j-ktz, @.***> wrote:

Mandatory Describe the bug:

This issue seems to keep popping up for me. I copy/paste the title directly from browser, add ( ) around studio and year, but if there are " " in the title, it doesn't match. Specific Agent(s) Causing the Issue:

Fagalicious Index Site URL(s) Attempting to Match:

https://fagalicious.com/cutlersden-drew-sebastian-boy-david-where-he-belongs/ Log Attached:

com.plexapp.agents.Fagalicious.log https://github.com/CodyBerenson/PGMA-Modernized/files/11554356/com.plexapp.agents.Fagalicious.log Optional Screenshot(s) with all nudity redacted: Additional Context: Desktop (please complete the following information):

  • OS: [e.g. iOS] Mac OS 12.6.3 (21G419)
  • Browser [e.g. chrome, safari] Safari
  • Version [e.g. 22]

— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKN36JSKSWEK2YZG3DTXHX257ANCNFSM6AAAAAAYNI4PHU . You are receiving this because you are subscribed to this thread.Message ID: @.***>

CodyBerenson commented 1 year ago

@j-ktz Hi, hope this finds you well. This doesn't have anything to do with the quotes. You haven't set your "Genre" parameter. When you open up the parameters, it may appear that "Purple" is selected, but it is just one of the list of values, and you actually have to make a selection.

In the log it says "None", and the agent fails with an error: SEARCH:: Error: <QUIT: Scraping Parameters Not Set Up!

image

j-ktz commented 1 year ago

Thanks @CodyBerenson! I tweaked the setting and got fagalicious to match a title successfully without quotes, but titles with "" still won't match. See new log.
com.plexapp.agents.Fagalicious.log

CodyBerenson commented 1 year ago

@j-ktz Sorry, matches perfectly on Windows. I can see where your log treats the quotes differently than mine.

@JPH71 Can you take a look at his log? Thanks!

j-ktz commented 1 year ago

Yeah, I'm on a Mac that remotes into Synology with Finder.

JPH71 commented 1 year ago

It also matches for me... I will look into this more carefully and find a way to sort this heifer out!!! once and for all....

imagecom.plexapp.agents.Fagalicious.log

CodyBerenson commented 1 year ago

Dear @JPH71 Please read and follow the rules about posting naughty pictures or you will get many, many whacks on the arse.

You have been warned

The management.

JPH71 commented 1 year ago

Oh shit... I forgot!!!!

CodyBerenson commented 1 year ago

Since you are a newbie, the management has redacted your naughty vial photo and reposted it for you.

JPH71 commented 1 year ago

Sorry man... I shouldn't post when I get back from the Pub..... Just quickly ran a text and did a snip..... Doh!!!!

On Sat, 27 May 2023 at 00:44, Cody Berenson @.***> wrote:

Since you are a newbie, the management has redacted your naughty vial photo and reposted it for you.

— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1565074131, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKOXE355BZQTRF4VPTDXIE57FANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>

JPH71 commented 1 year ago

ran a test!!!! I am stopping emailing now!

On Sat, 27 May 2023 at 00:46, Jason Hudson @.***> wrote:

Sorry man... I shouldn't post when I get back from the Pub..... Just quickly ran a text and did a snip..... Doh!!!!

On Sat, 27 May 2023 at 00:44, Cody Berenson @.***> wrote:

Since you are a newbie, the management has redacted your naughty vial photo and reposted it for you.

— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1565074131, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKOXE355BZQTRF4VPTDXIE57FANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>

CodyBerenson commented 1 year ago

No worries. I think you'd approve of the family friendly image substitution.

JPH71 commented 1 year ago

LOL

On Sat, 27 May 2023 at 00:48, Cody Berenson @.***> wrote:

No worries. I think you'd approve of the family friendly image substitution.

— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1565076046, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKK36X3BLD2YJYAD2F3XIE6NJANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>

JPH71 commented 1 year ago

It's an image to frighten the dead....

On Sat, 27 May 2023 at 00:59, Jason Hudson @.***> wrote:

LOL

On Sat, 27 May 2023 at 00:48, Cody Berenson @.***> wrote:

No worries. I think you'd approve of the family friendly image substitution.

— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1565076046, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKK36X3BLD2YJYAD2F3XIE6NJANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>

JPH71 commented 1 year ago

night night....

On Sat, 27 May 2023 at 01:00, Jason Hudson @.***> wrote:

It's an image to frighten the dead....

On Sat, 27 May 2023 at 00:59, Jason Hudson @.***> wrote:

LOL

On Sat, 27 May 2023 at 00:48, Cody Berenson @.***> wrote:

No worries. I think you'd approve of the family friendly image substitution.

— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1565076046, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKK36X3BLD2YJYAD2F3XIE6NJANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>

j-ktz commented 1 year ago

LOL. I've missed you boys.

JPH71 commented 1 year ago

😉

On Sat, 27 May 2023, 14:30 j-ktz, @.***> wrote:

LOL. I've missed you boys.

— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1565392632, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKIC56IECUE7QNPCKCTXIHXU3ANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>

ScoBoXxX commented 1 year ago

Hey all. I'm seeing this issue as well on Windows - just thought I'd add my 2 pence. I don't think it happens ALL the time, either. I've matched a bunch of "double-quote" titles before. Here's an example of one that isn't matching as of this morning:

Filename: (Men.com) - Colton Reece raw fucks the cum out of Joey Mills in “Accidental Pornstar” (2023).mp4 URL: https://fagalicious.com/men-com-colton-reece-fucks-joey-mills-accidental-pornstar/

Portion of log that shows error: (can give whole log if needed) - maybe cuz there are several titles that start with "Colton Reece raw" ?? I dunno - just thinking out loud.

2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Original Search Query Colton Reece raw fucks the cum out of Joey Mills in "Accidental Pornstar" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Search Query colton reece raw fucks the cum out of joey mills in "accidental pornstar" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Search Query Search Query Length: "16 <= 20" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Search Query Shorten Search Query: "colton reece raw" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Returned Search Query colton+reece+raw 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - -------------------------------------------------------------------------------------------------------------------------------------------- 2023-06-06 09:29:03,089 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: Search Query https://fagalicious.com/search/colton+reece+raw 2023-06-06 09:29:03,105 (2724) : DEBUG (networking:143) - Requesting 'https://fagalicious.com/search/colton+reece+raw' 2023-06-06 09:29:03,214 (2724) : ERROR (networking:196) - Error opening URL 'https://fagalicious.com/search/colton+reece+raw' 2023-06-06 09:29:03,214 (2724) : ERROR (logkit:22) - Fagalicious - SEARCH:: Error: Search Query did not pull any results: HTTP Error 403: Forbidden 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - **** 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: **** >> Fagalicious: Finished Search Routine << ***** 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: **** >> (Men.com) - Colton Reece raw fucks the cum out of Joey Mills in "Accidental Pornstar" (2023) << * 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: ** >> << ** 2023-06-06 09:29:03,230 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: >> Failed << ** 2023-06-06 09:29:03,230 (2724) : INFO (logkit:16) - Fagalicious - ****

JPH71 commented 1 year ago

I will look at this... Currently working on a new scraper to scrape GEVI scenes...

I will add code to utils.py to sort this error out...

Cheers for this

Jason xxx

On Tue, 6 Jun 2023, 18:34 ScoBoXxX, @.***> wrote:

Hey all. I'm seeing this issue as well on Windows - just thought I'd add my 2 pence. I don't think it happens ALL the time, either. I've matched a bunch of "double-quote" titles before. Here's an example of one that isn't matching as of this morning:

Filename: (Men.com) - Colton Reece raw fucks the cum out of Joey Mills in “Accidental Pornstar” (2023).mp4 URL: https://fagalicious.com/men-com-colton-reece-fucks-joey-mills-accidental-pornstar/

Portion of log that shows error: (can give whole log if needed) - maybe cuz there are several titles that start with "Colton Reece raw" ?? I dunno

  • just thinking out loud.

2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Original Search Query Colton Reece raw fucks the cum out of Joey Mills in "Accidental Pornstar" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Search Query colton reece raw fucks the cum out of joey mills in "accidental pornstar" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Search Query Search Query Length: "16 <= 20" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Search Query Shorten Search Query: "colton reece raw" 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious - AGENT :: Returned Search Query colton+reece+raw 2023-06-06 09:29:03,073 (2724) : INFO (logkit:16) - Fagalicious -

2023-06-06 09:29:03,089 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: Search Query https://fagalicious.com/search/colton+reece+raw 2023-06-06 09:29:03,105 (2724) : DEBUG (networking:143) - Requesting ' https://fagalicious.com/search/colton+reece+raw' 2023-06-06 09:29:03,214 (2724) : ERROR (networking:196) - Error opening URL 'https://fagalicious.com/search/colton+reece+raw' 2023-06-06 09:29:03,214 (2724) : ERROR (logkit:22) - Fagalicious - SEARCH:: Error: Search Query did not pull any results: HTTP Error 403: Forbidden 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious -


2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: **** >> Fagalicious: Finished Search Routine << ***** 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: **** >> (Men.com) - Colton Reece raw fucks the cum out of Joey Mills in "Accidental Pornstar" (2023) << * 2023-06-06 09:29:03,214 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: ** >> <<


2023-06-06 09:29:03,230 (2724) : INFO (logkit:16) - Fagalicious - SEARCH:: *** >> Failed <<


2023-06-06 09:29:03,230 (2724) : INFO (logkit:16) - Fagalicious -


— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-1579098556, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKO6QCYTVXPC7V47UZDXJ5L2LANCNFSM6AAAAAAYNI4PHU . You are receiving this because you were mentioned.Message ID: @.***>

JPH71 commented 1 year ago

Just looked at this: It scraped with no issue..... You are getting a 403 Error - this is the website actively refusing you access, it detects that the request is not from a human and tries to prevent scraping.... This is currently what happens with IAFD...... What I generally do when I start getting 403 is reboot my computer and wait 10 or so minutes before I start scraping again.....

attached find log.... com.plexapp.agents.Fagalicious.log image

ScoBoXxX commented 1 year ago

hey sorry for delay. I tried this. Left my PC off overnight and this morning the same results. I thought one matched this morning but it was actually matched via Waybig - for once they had the same title on both sites. So yeah these are still failing.

Any other ideas to help that 403 error?

Here's a new log if you want to look through it. com.plexapp.agents.Fagalicious.log

JPH71 commented 1 year ago

Just tested it on my machine and it scraped OK... the usual IAFD 403 errors. so di not get the actor pictures. Could you try using a VPN to see if it would be scraped... You are getting a 403 error from Fagalicious.com, so its not even starting to scrape..... There is a possibility that you are been detected as a scraper and you are been politely been told to eff off: https://blog.apify.com/web-scraping-how-to-solve-403-errors/

In the meantime - I have just changed the User-Agent, just as I was writing this email and recrapped it and it has now successfully got the actors pictures off IAFD instead of a 403 error.

So later today I will send the new agents to Cody to test and upload to PGMA

in the meantime if you are online now - go itno your init.py and set HTTP.Headers['User-Agent'] to 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/104.0.5112.102 Safari/537.36 Edg/104.0.1293.63'

Let me know how it goes!

Here is my log: com.plexapp.agents.Fagalicious.log

j-ktz commented 1 year ago

Thanks @JPH71! Happy test new scrapers on a Mac.

ScoBoXxX commented 1 year ago

I downloaded and tested with a VPN with same 403 results. No luck there. Also tried to edit that .py file and also no change. Guess I've been targeted by Fagalicious lol. I'm able to lead the website when pasting in the search URL from the log. So they must know that I am scraping or whatever. https://fagalicious.com/search/pledge+recruitment%22 loads fine in a browser. Maybe this is my sign that I need to stop spending so much time on this hobby :)

JPH71 commented 1 year ago

Just giving an update,

I have sent Cody a new scraper for getting the videos of GEVI.

In this scraper I have put in code to rotate the User-Agent that the code uses to query IAFD...

Marginally better, so hopefully this will work when I adapt the other agents to use the new utils.py...

Just need to sort out the icons when a 403 occurs when an actor is been IAFD queried.....

JPH71 commented 1 year ago

Please close this issue.....

j-ktz commented 1 month ago

Hey friends, I still have the double quote issue to this day. Did we ever get this figured out?

CodyBerenson commented 1 month ago

@j-ktz since Windows has no issues with double quotes, you'll need to provide @JPH71 your log. Thanks.

vampirelayer commented 1 month ago

I have been living with this for a while and I have to replace the curly quotes with straight quotes manually. It happens only on Macs I think. On 10 Sep 2024, at 5:15 AM, j-ktz @.***> wrote: Hey friends, I still have the double quote issue to this day. Did we ever get this figured out?

—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you are subscribed to this thread.Message ID: @.***>

JPH71 commented 1 month ago

Wouldn't it just be easier to replace your curly quotes with straight ones? If they work? Send me a log file and I will lo9k at this over the next few days... I am rather swamped at work with people off on their summer holidays and those coming back with Dehli Belly!

On Tue, 10 Sept 2024, 02:38 Xtian Hog, @.***> wrote:

I have been living with this for a while and I have to replace the curly quotes with straight quotes manually. It happens only on Macs I think. On 10 Sep 2024, at 5:15 AM, j-ktz @.***> wrote: Hey friends, I still have the double quote issue to this day. Did we ever get this figured out?

—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you are subscribed to this thread.Message ID: @.***>

— Reply to this email directly, view it on GitHub https://github.com/CodyBerenson/PGMA-Modernized/issues/245#issuecomment-2339389896, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKJP3F4EUQXWYNTUDWDZVY5P7AVCNFSM6AAAAABN5MFLKKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMZZGM4DSOBZGY . You are receiving this because you were mentioned.Message ID: @.***>

j-ktz commented 1 month ago

I have been living with this for a while and I have to replace the curly quotes with straight quotes manually. It happens only on Macs I think. On 10 Sep 2024, at 5:15 AM, j-ktz @.> wrote: Hey friends, I still have the double quote issue to this day. Did we ever get this figured out? —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you are subscribed to this thread.Message ID: @.>

What do you use to replace the quotes? I've tried replacing them but it always still fails. Do you use a website or just do it in Finder?

j-ktz commented 1 month ago

Wouldn't it just be easier to replace your curly quotes with straight ones? If they work? Send me a log file and I will lo9k at this over the next few days... I am rather swamped at work with people off on their summer holidays and those coming back with Dehli Belly! On Tue, 10 Sept 2024, 02:38 Xtian Hog, @.> wrote: I have been living with this for a while and I have to replace the curly quotes with straight quotes manually. It happens only on Macs I think. On 10 Sep 2024, at 5:15 AM, j-ktz @.> wrote: Hey friends, I still have the double quote issue to this day. Did we ever get this figured out? —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you are subscribed to this thread.Message ID: @.> — Reply to this email directly, view it on GitHub <#245 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI3AKJP3F4EUQXWYNTUDWDZVY5P7AVCNFSM6AAAAABN5MFLKKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMZZGM4DSOBZGY . You are receiving this because you were mentioned.Message ID: @.>

com.plexapp.agents.Fagalicious.log

I've tried replacing quotes but no dice. No rush but here's the log!