For some reason, lines containing media files start with the unicode left-to-right character in chat exports for WhatsApp. Before this fix, those lines where simply appended to the previous messages. This also caused the wordcloud viz to show the authors names prominently.
This fix detects U-200E characters, strips them and reconstructs the message accordingly.
For some reason, lines containing media files start with the unicode left-to-right character in chat exports for WhatsApp. Before this fix, those lines where simply appended to the previous messages. This also caused the wordcloud viz to show the authors names prominently. This fix detects U-200E characters, strips them and reconstructs the message accordingly.