Open lkuiucsb opened 5 years ago
I knew about this bug, and thought it has something to do with the EML::set_TextType
function that we use under the hood. But that's not true according to this MRE:
library(EML)
person <- list(individualName = list(givenName = "Blizzard", surName = "Frosty"))
my_eml <- list(packageId = "id", system = "system",
dataset = list(
title = "A Mimimal Valid EML Dataset",
creator = person,
contact = person,
abstract = set_TextType("hyperlink_bug.docx")
))
eml_validate(my_eml)
using this minimal word file hyperlink_bug.docx
Just tried again, I can't reproduce this.
The problem seems not about invalid XML but that the hyperlinked content gets put into its own paragraph tags.
If the word document (e.g. abstract or method) has a hyperlink, the EML will be invalid. The code could either remove the hyperlink from the word document or change into other format that EML accepts.