Open systwi-again opened 2 years ago
I can't test this (busy), sorry.
To potentially narrow down when this bug happens, could you go to http://thetechrobo.ca:1111, verify it says that the cookie isn't set, then go to http://thetechrobo.ca:1111/set to set the cookie? Then export the cookies, run wpull on http://thetechrobo.ca:1111 with the cookies, and see if it says the cookies are set.
Hmm, oddly enough your site did work with --load-cookies
. Can't explain why it's an outlier...
Again, I've had --load-cookies work before, like with Planet French, but not with Infos-Ados.
Are you sure there aren't any #HttpOnly lines int he cookies.txt...?
Okay, I thought maybe it was an issue with my particular setup for some reason.
Regarding #HttpOnly
lines, there is but one instance. It's my school's proprietary web portal that I'm trying to save. I can save it only using the aforementioned workaround, which doesn't send any #HttpOnly
cookies anyway, so I take it that that cookie is not as important. ¯\_(ツ)_/¯ I don't know.
What was that workaround? I can't find it.`Neverind, found it.
What I wanted/expected: Cookies, read from the provided cookies.txt, to be used during crawls with
wpull
.What happened:
wpull
ignores the provided cookies.txt file and crawls without it.The command or website causes the problem:
--load-cookies=/absolute/path/to/cookies.txt
Operating system: Debian GNU/Linux 11 (x86_64)
Python version: 3.8.13
Wpull version: 3.0.9
Options used with
wpull
(obtained usinggrab-site
's--which-wpull-args-partial
):Further details and temporary workaround here.
Even giving cookies.txt 777 permissions,
wpull
still refuses to use the cookies in cookies.txt during crawls.The filesystem used for everything is ext4, has no I/O errors, has ample free space, passes
fsck.ext4
, and the absolute path contains no spaces or special characters of any kind (just lowercase a-z).cookies.txt was exported using version 0.3 of this Firefox extension under Firefox 78.15.0esr on the same OS, and was not modified after exporting.