From those two, we can see that the published time in JSON+LD and meta tags are actually the same except the former is in UTC+2 while the latter is in UTC+0.
So, for extraction result I think we should use 2020-12-24 since it's use UTC time instead of local time.
Hi @adbar, thanks for this awesome library.
While porting this library to Go, I noticed there are two Mediacloud tests that might be wrong:
For
baltimoresun
, its JSON+LD contains following snippet:From that snippet we can see its creation date is
2020-12-22
. Since we want the original date, I think we should use that one instead of2020-12-23
?For
elbalad.tv
, its JSON+LD contains following snippet:It also contains following meta tag:
From those two, we can see that the published time in JSON+LD and meta tags are actually the same except the former is in UTC+2 while the latter is in UTC+0.
So, for extraction result I think we should use
2020-12-24
since it's use UTC time instead of local time.