Open grasshoppermouse opened 1 week ago
I confirm the first problem, the text in the archive contains all the references to the accounts that are cited in the tweet and should be removed
The second problem is very strange, because the text in the encoded archive is explicitly decoded through an explicit function already present that I have just verified to work. Have you noticed if this problem is only in this tweet or in others? If so, maybe open another issue so we can separate the two problems
Thanks
I assume this is the reason for the truncation:
Original tweet:
Imported:
As an aside, it would be nice to unescape the html, e.g., \& -> &