IgnoredAmbience / yahoo-group-archiver

Scrapes and archives a Yahoo groups email archives, photo galleries and file contents using the non-public API
MIT License
93 stars 46 forks source link

My browser session's T and Y cookie have a different structure/seem to be invalid #106

Closed SanneSideUp closed 4 years ago

SanneSideUp commented 4 years ago

First, thanks for creating the script, I have some 100 groups I would need to archive for the OTW until the deadline and I have a basic understanding of commandlines, so I thought I give this a try.

Not sure what I'm doing wrong, though. I looked up my browser session's Y and T cookie. Manual says "The cookies should look like this, where someLongText and someShortText are arbitrary strings:"

For example, my Y cookie start with a '=', not with a 'v=' and instead of &1= after the first long text mine has a &iz=

The only other cookie that starts with a v= is the AS one, but this one seem too short, it basically only, "v=1&s=someLongText" Subsequently, the script doesn't know what to with the "iz", "r" and "intl" commands it gets from my Y cookie and I get a "yahoo.py: error too few arguments" error.

Also, there is no E cookie in my browser session, despite me being in the EU (might be not an issue, though)...

I'm grateful for any help.

Pablo2m commented 4 years ago

Could you tell us what browser you are using?

SanneSideUp commented 4 years ago

Chrome Version 78.0.3904.87 (official Build) (64-Bit) on Windows 10 Build 1903

ugcheleuce commented 4 years ago

Yes, different browsers or extensions export the cookies in different formats. I have not been able to get the scripts to work with a cookie.txt file at all, so I have to specify the T and Y cookies specifically.

SanneSideUp commented 4 years ago

In Chrome I managed to find Cookie values for Y and T that look more like the ones in manual via the Dev Tools (F12) (and then via Application->Cookies. Still no E cookies, though. Script throws a lot of UnicodeDecodeError "' but seems to now at least do something (and according to this Issue here I just need to upgrade by Python version to 3 to fix that) :-)

IgnoredAmbience commented 4 years ago

In chrome the window noted in the readme file doesn't make it clear that there's more text in the textbox than is shown on the screen, using the cursor or double click to select everything should get the correct cookie values. I should update the readme to make this clear...