Closed ronaldtse closed 5 years ago
This also applies to ITU, geolexica.com has:
itu_r_rec.tf.686_1.xml
and itu_r_rec.tf.686_1_.xml
@ronaldtse I'll fix the issue with the underscores.
Concerning using a document identifier. We can't use a document identifier as a filename. The problem is we don't have a document identifier when fetching a document. We have a reference only. Before we retrieve the document from the internet we should check if there is a file with a name matched to the reference.
One document could be fetched by many different references. For example "ISO 123 2014" returns a document and we save it to iso_123_2014
. If we search by "ISO 124" reference we will get the same document if it is the lastest with the number. But we don't know it before we fetch the document.
There are other cases like all parts
, attempts ISO/IEC and stages, when one document could be fetched by different references. If we save these documents using their document identifiers, how do we find the files by references?
This issue is related to relaton
gem. Move it there.
This is now superseded by https://github.com/metanorma/relaton/issues/53
In geolexica.org , both of these files are created in the cache:
iso_12620_1999.xml
iso_12620_1999_.xml
Both files have the same content. Obviously when Relaton creates the cache file, it must be using the normalized document identifier, not what was directly given. e.g. given "ISO 12620:1999 - blah", the normalized filename should still be
iso_12620_1999.xml
.There are 7 of these in geolexica.org:
iso_12620_1999.xml
andiso_12620_1999_.xml
iso_31_2_1992.xml
andiso_31_2_1992_.xml
iso_8601_2000.xml
andiso_8601_2000_.xml
iso_iec_11179_3_2003.xml
andiso_iec_11179_3_2003_.xml
iso_22028_1.xml
andiso_22028_1_.xml
iso_iec_19501_2005.xml
andiso_iec_19501_2005_.xml
iso_iec_tr_14252.xml
andiso_iec_tr_14252_.xml