Mincka / DMArchiver

A tool to archive the direct messages, images and videos from your private conversations on Twitter
GNU General Public License v3.0
222 stars 25 forks source link

Error when using the lastest release - not all messages downloaded #38

Closed sussron closed 6 years ago

sussron commented 6 years ago

Hi there. I downloaded the latest Mac release. When I ran the archiver, I noticed it only went back to July 2017 in some of the threads and not all the way to the beginning. One of my largest DM messages is about 22MB txt file once downloaded and this time it was only 1.5MB. I did one screen shot of what seems to be different than what I normally see. Hope this helps in the explanation As a reminder I'm using the Mac version.

Thanks Ronnie

screenshot 2017-10-06 19 25 34 screenshot 2017-10-06 19 56 01 screenshot 2017-10-06 19 55 37
sussron commented 6 years ago

The main one that i am concerned about, conversation 629006352329760768 had This month: Starting crawl of '629006352329760768' Previous conversation not found. Creating a new one with incremental support. Begin of thread reached Total processed tweets: 13259 Writing conversation to /Users/ronniesussman/Downloads/629006352329760768.txt

But in the prior one I ran in February it had 166,858 tweets.

image

Mincka commented 6 years ago

Hello Ronnie,

I tried to run the script on my conversations without issues, even if my biggest conversations has "only" 22144 tweets. Sometimes, due to a connection error, the script may think it has reached the end of the thread and will continue with the next one. The only thing you can try here is to run the script again.

However, I think you miss something important. The script now allows you to perform incremental updates. That means only the new messages are downloaded and added to your archive. You just have to run the script where your log files are.

To know if a conversation file can be automatically completed, you just have to look if the last line contains [LatestTweetID] XXXXXXXXXXXX. It is an indication for the script to know how many new tweets have been sent since the last run.

If you did not keep a complete version of 629006352329760768, I suggest that you delete the incomplete file you just downloaded and start again. If this time, all the messages are downloaded, continue the next time to run the script with the complete file, in the same folder. The script will just add the new tweets and your backup will be done a lot faster. 😃

sussron commented 6 years ago

Oh i had missed that Julien, what a great new feature. I have kept my updates in separate folders so that is probably why it may have known it was run but i didn't get all the information in the download folder where the downloads automatically go.

screenshot 2017-10-07 09 11 13
sussron commented 6 years ago

Now this is strange so i ran the command but got this note: Ronnies-MacBook-Pro:downloads ronniesussman$ ./dmarchiver -id "629006352329760768" -di -dg DMArchiver 0.2.0 Running on Python 3.4.4 (v3.4.4:737efcadf5a6, Dec 19 2015, 20:38:52) [GCC 4.2.1 (Apple Inc. build 5666) (dot 3)]

Enter your username or email: beckybulldognj Enter your password (characters will not be displayed): Authentication succeedeed.

Press Ctrl+C at anytime to write the current conversation and skip to the next one. Keep it pressed to exit the script.

Conversation ID specified (629006352329760768). Retrieving only one thread. Starting crawl of '629006352329760768' Previous conversation not found. Creating a new one with incremental support. Begin of thread reached Total processed tweets: 0 Writing conversation to /Users/ronniesussman/Downloads/629006352329760768.txt Ronnies-MacBook-Pro:downloads ronniesussman$

sussron commented 6 years ago

I do see that in the file I downloaded in June that the last tweet was 6/27/17 and in this one i ran yesterday it started incrementally with 7/27/17 so it missed a month of messages but created a txt with everything since 7/27/17. I attempted this morning to run a clean one of that particular conversation and moved all the prior txt log files into a separate folder from downloads so that it would be fresh but it downloaded nothing. I'm afraid to delete the txt i have moved to this alternate location, but is the script somehow finding it on my computer and therefore deeming that zero new messages have been found?

Ronnie

Thanks and sorry for my ignorance. I love that you have created this amazing tool.

sussron commented 6 years ago

I will try running again as you suggested and let you know. Thanks again.

sussron commented 6 years ago

Hi Julien, Here's an update. i ran it and this time it came up with the 220,000+ tweets and created folders for the images, videos etc.

Thank you!

Mincka commented 6 years ago

Hi Ronnie,

Good to know it finally worked properly. 220,000 is insane. 😄

Keep it up with the incremental updates. 😉

Julien