Open zackarno opened 2 months ago
So you mean search through the URL for dates of different formats? I think that is a nice idea to supplement the name. Might be worth exploration to see how often that happens in the link and if the dates are ever already included in the name that we use from the JRC (so we don't duplicate). I don't think would be too difficult to scale.
I wonder if there is something in {lubridate}
or related packages to extract dates of any format from within a string? Basically would want to extract any potential dates, check that they are within a reasonable tie frame from the date of the event, and if so, use them.
not sure how easy or hard this would be, but screenshot below shows the "Further Information" section from recent Viet Nam email:
in this example it would be useful (and easiesh?) to add the dates of the report to the link text (i.e
Viet Nam Disaster Management (VDMA): 07/9/2024)
.However, I'm not sure how if it would be straight forward to implement in a generalizable/scalable fashion?