KTH-Library / kthcorpus

R package to support workflows related to the corpus of publications from KTH
https://kth-library.github.io/kthcorpus
GNU Affero General Public License v3.0
0 stars 1 forks source link

Double use of tidy_html cleaning of Notes can cause error in `read_xml.raw()` #169

Closed mskyttner closed 2 months ago

mskyttner commented 2 months ago

Backtrace:

  1. kthcorpus::kth_diva_checks()
 13. purrr::map_chr(Notes, tidy_html)
 14. purrr:::map_("character", .x, .f, ..., .progress = .progress)
 18. kthcorpus (local) .f(.x[[i]], ...)
 22. xml2:::read_html.default(charToRaw(x))
 26. xml2:::read_xml.raw(...)