jpd236 / CrosswordScraper

Browser extension which downloads crosswords from crossword applets for offline solving.
Apache License 2.0
28 stars 1 forks source link

Stopped working on The Week crossword puzzles #39

Closed lobert closed 8 months ago

lobert commented 8 months ago

Stopped working on The Week crossword puzzles

arelkin commented 8 months ago

The Week seems to have changed their website design as well as something about their puzzle management.

lobert commented 8 months ago

Thanks for getting back to me. On Oct 20, 2023 at 9:32 AM -0400, JohnGalt @.***>, wrote:

The Week seems to have changed their website design as well as something about their puzzle management. — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>

jpd236 commented 8 months ago

There seem to be two different types of crossword at https://theweek.com/puzzles. One, labeled "Crossword", is a daily crossword that seems to be syndicated from puzzler.com. These aren't and have never been supported. They also don't strike me as particularly interesting - UK style grid with completely straight cluing; very possibly just computer generated. I don't think I would put the effort in to support these without more explicit demand.

There is also the "Magazine interactive crossword", which appears to be the regular Matt Gaffney puzzle. For whatever reason these don't appear in chronological order, but there is still one available for today on the second page:

https://theweek.com/puzzles/magazine-interactive-crossword-october-20-2023

These should work, but the applet moved from hosting the puzzles on the same domain to another domain, which the scraper isn't handling correctly. Should be a straightforward fix.

arelkin commented 8 months ago

I look here: https://theweek.com/tag/magazine-puzzles and look for puzzles labeled "interactive"

I also look here: https://theweek.com/archive and also search for "interactive" Sometimes this "archive" has the latest interactive puzzle before the previous link (don't know why)

Also, if you are just desperate to get the file, look in the source code for an IFRAME with class "crossword". Then look at the SRC. For example:

. Then just copy the query string from the path name, in the case of this example:

https://cdn.mos.cms.futurecdn.net/6wmV9SDcJNiqPiSeRAyGeU/Week715.puz

lobert commented 8 months ago

Thanks. On Oct 20, 2023 at 11:38 AM -0400, Jeff Davidson @.***>, wrote:

There seem to be two different types of crossword at https://theweek.com/puzzles. One, labeled "Crossword", is a daily crossword that seems to be syndicated from puzzler.com. These aren't and have never been supported. They also don't strike me as particularly interesting - UK style grid with completely straight cluing; very possibly just computer generated. I don't think I would put the effort in to support these without more explicit demand. There are also "Magazine interactive crossword", which appear to be the regular Matt Gaffney puzzle. For whatever reason these don't appear in chronological order, but there is still one available for today on the second page: https://theweek.com/puzzles/magazine-interactive-crossword-october-20-2023 These should work, but the applet moved from hosting the puzzles on the same domain to another domain, which the scraper isn't handling correctly. Should be a straightforward fix. — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>

jpd236 commented 8 months ago

Thanks again for the report. Will be fixed in the next release.