Closed armelsoubeiga closed 2 years ago
Thanks for bringing this up! I never heard back from @msulyok, so I don't know what caused the other bug. I just fixed your problem in 85563a4 though.
This chat completely broke my assumption on how people use WhatsApp! I had a rule in the code to re-evaluate the timestamps if more than 50% of the lines do not contain a valid time. However, more than half the messages in this chat span over multiple lines, which means they have no timestamp! I increased the threshold to 90%. Hope this doesn't cause trouble elsewhere.
You can install the development version and read in that chat without issues.
Hi @JBGruber
I tested with the development version and it's works fine now. I'm not having any problems elsewhere either. Thanks
Same error showing now:
Error in split.default(lookup$emoji, lookup$id) :
primeiro argumento deve ser um vetor
Além disso: Warning messages:
1: Unknown or uninitialised column: emoji
.
2: Unknown or uninitialised column: emoji
.
Happening to those two chats Arena Divulgacao.txt Mentoria Imperium.txt
Hmm. I can't reproduce your issue.
curl::curl_download("https://github.com/JBGruber/rwhatsapp/files/13863974/Arena.Divulgacao.txt", "hist.txt")
df <- rwhatsapp::rwa_read(x = "hist.txt")
df
#> # A tibble: 1,235 × 6
#> time author text source emoji emoji_name
#> <dttm> <fct> <chr> <chr> <list> <list>
#> 1 2023-09-09 13:08:12 📣 Arena Divulgação - Imp… "‎As … hist.… <NULL> <NULL>
#> 2 2023-09-09 13:08:12 ~ Matheus Mendonça "‎~ M… hist.… <NULL> <NULL>
#> 3 2023-10-24 20:08:07 📣 Arena Divulgação - Imp… "‎Voc… hist.… <NULL> <NULL>
#> 4 2023-10-25 10:27:19 ~ Aguinaldo Santos "‎~ E… hist.… <NULL> <NULL>
#> 5 2023-10-25 10:32:52 ~ Priscilla Oliveira "‎~ E… hist.… <NULL> <NULL>
#> 6 2023-10-25 12:17:52 ~ Morena "Cur… hist.… <chr> <chr [1]>
#> 7 2023-10-25 12:18:19 ~ Morena "Que… hist.… <chr> <chr [1]>
#> 8 2023-10-25 12:18:37 ~ Morena "‎Men… hist.… <NULL> <NULL>
#> 9 2023-10-25 12:28:54 ~ Caio Teixeira "‎<an… hist.… <NULL> <NULL>
#> 10 2023-10-25 12:29:27 ~ Caio Teixeira "Fei… hist.… <chr> <chr [4]>
#> # ℹ 1,225 more rows
curl::curl_download("https://github.com/JBGruber/rwhatsapp/files/13863975/Mentoria.Imperium.txt", "hist2.txt")
df2 <- rwhatsapp::rwa_read(x = "hist2.txt")
df2
#> # A tibble: 5,805 × 6
#> time author text source emoji emoji_name
#> <dttm> <fct> <chr> <chr> <list> <list>
#> 1 2022-08-24 11:35:46 Mentoria Imperium "‎As men… hist2… <NULL> <NULL>
#> 2 2022-08-24 11:35:46 ~ Murilo V Marques "‎~ Muri… hist2… <NULL> <NULL>
#> 3 2023-09-11 19:26:33 ~ Matheus Mendonça "‎~ Math… hist2… <NULL> <NULL>
#> 4 2023-09-11 19:36:00 ~ Dra. Luciane Sippert "‎figuri… hist2… <NULL> <NULL>
#> 5 2023-09-11 19:36:01 ~ Dra. Luciane Sippert "Olá, … hist2… <NULL> <NULL>
#> 6 2023-09-11 19:38:01 ~ Renata Calil "Bemmmm… hist2… <NULL> <NULL>
#> 7 2023-09-11 20:21:18 ~ Eliana Coco Psicóloga "‎imagem… hist2… <NULL> <NULL>
#> 8 2023-09-11 20:41:06 ~ Joyce Scoto Advogada "Hj não… hist2… <chr> <chr [2]>
#> 9 2023-09-11 21:39:42 ~ Clarice "Obriga… hist2… <chr> <chr [2]>
#> 10 2023-09-11 21:50:09 ~ Carmen Lydia de Marco "Parabé… hist2… <NULL> <NULL>
#> # ℹ 5,795 more rows
Created on 2024-01-08 with reprex v2.0.2
Can you maybe use reprex to show the error and session_info()
(see below).
Sorry, but Idk how to comment using those chunks like you do.
I managed to use reprex and get the preview in my clippboard but I couldn't bring it in that format, I also cound't find how to in the internet.
So I just pasted here:
curl::curl_download("https://github.com/JBGruber/rwhatsapp/files/13863974/Arena.Divulgacao.txt", "hist.txt") df <- rwhatsapp::rwa_read(x = "hist.txt")
emoji
.emoji
.Created on 2024-01-09 with reprex v2.0.2
Session info sessioninfo::session_info()
Hello @JBGruber
I have a problem when I try to read my whatsapp data. The problem has already been asked in issue #29 but without solutions.
This is the data : Chat - Stat-Inf_ Job&Scholarship.txt
Thanks