cnorthwood / ternip

Temporal Expression Recognition and Normalisation in Python
Other
78 stars 17 forks source link

TIMEX3 strings need hyphens #19

Open leondz opened 12 years ago

leondz commented 12 years ago

Dates should be formatted with hyphens separating years and months/weeks from their sub-parts.

e.g:

199     =>  199
1993    =>  1993
199307  =>  1993-07
BC0045  =>  BC0045
BC004508    =>  BC0045-08
BC00450829T16:00    =>  BC0045-08-29T16:00
200401  =>  2004-01
20040101    =>  2004-01-01
20040101TNI     =>  2004-01-01TNI
20040101T1802   =>  2004-01-01T1802
200X04  =>  200X-04
2003W32     =>  2003-W32
2007W325    =>  2007-W32-5
cnorthwood commented 12 years ago

According to ISO-8601, both formats are valid, unless TimeML's updated to limit itself to a subset of ISO-8601 now :) (which would make things better)