Closed ckshitij closed 7 years ago
Hi there! Unfortunately, the data you sent me in the zip file isn't very helpful; it's just a subset of the source code of this project.
As to your issue, it appears from the exception that there is at least one message in your messages.htm
file that's missing timestamp data. Unfortunately, I cannot really diagnose what's wrong with it without looking in the file myself. Obviously that's private and you shouldn't post it ;)
To help diagnose the issue, you could add the following print
statement after this line:
...
elif tag == "p" and pos == "end":
print (self.current_timestamp, self.current_sender, e.text.strip() if e.text else "")
...
That will at least tell you how many messages the parser gets through before crashing.
Same problem here ! It seems that Facebook changed the archive structure recently.
Now the messages.htm file is only a few KB, and all the threads are in separate .html files in a /messages/ folder ! The whole parser is broken now..
Thanks for letting me know. I'll take a look soon.
@arnaudsm @ckshitij okay, should be fixed now. Please try the latest version: 1.1
@ownaginatious Thank you so much, it's working now. :)
Getting error at a time of parsing the message.htm file Related files are Attached parse_file.zip
fbcap ./messages.htm > fbMessages.txt
Traceback (most recent call last):
"An unrecoverable parsing error has occurred (missing timestamp data)" fbchat_archive_parser.parser.FacebookDataError: An unrecoverable parsing error has occurred (missing timestamp data)