relaton / relaton-iso

RelatonIso: ISO Standards metadata using the BibliographicItem model
BSD 2-Clause "Simplified" License
2 stars 1 forks source link

Fetching ISO vs ISO-joint-published standards #37

Closed ronaldtse closed 5 years ago

ronaldtse commented 5 years ago

Fetching ISO 2382 vs ISO/IEC 2382 results in a difference on what is returned.

From @opoudjis :

These are joint ISO/IEC standards. Referencing ISO 2382 instead of ISO/IEC 2382 is going to result in a mismatch with what has been fetched, and Relaton (correctly) is refusing to process it further.

@andrew2net could you help fix this (and spec it)? Thanks!

opoudjis commented 5 years ago

What is there to fix though? It is correct that ISO/IEC 2382 works and ISO 2382 does not. You should be providing the exact document identifier for fetching that ISO itself uses, not a truncation of it.

ronaldtse commented 5 years ago

Well, what you said is true but can often cause confusion. For example, people often refer to "ISO/IEC 27001" as "ISO 27001", and we should not bail on such since we have the information to recover from this. At most we should return a "warning". Right?

opoudjis commented 5 years ago

That's assuming you can trust the user to mean "ISO/IEC 27001" when they say "ISO 27001". Is it guaranteed that there will never be an ISO/X nnnn standard and an ISO/X nnnn standard with the same number? If so, as long as ISO is the first named body in the list of standards bodies.... I guess it's OK.

ronaldtse commented 5 years ago

Yes, the ISO project number for joint publishing will not be reused.

If there is an ISO/IEC 27000, then:

andrew2net commented 5 years ago

@ronaldtse @opoudjis Could you explain how we can get what user means ISO/IEC 2382 when he refers to ISO 2382?

ronaldtse commented 5 years ago

@andrew2net at ISO, a project number is only assigned once. For example, 2382 can be assigned to ISO 2382, or ISO/IEC 2382 or ISO/IEEE 2382 or ISO/IEC/IEEE 2382.

In this case, 2382 is assigned to ISO/IEC 2382. This means that ISO 2382, ISO/IEEE 2382 and ISO/IEC/IEEE 2382 all don't exist.

So if user searches for ISO 2382 and the result gives ISO/IEC 2382, it means the correct ID is ISO/IEC 2382.

andrew2net commented 5 years ago

@opoudjis so we should try ISO 2382 first. If no documents except ISO 2382-* found then we should try ISO/IEC 2382. Right?

opoudjis commented 5 years ago

If the prefix is ISO/IEC 2382, 2382-1 will have same prefix. But yes, if a search for ISO 2382 fails, try ISO/IEC 2382. I guess the hope is that a search for ISO 2382 will retrieve ISO/IEC 2382, and that we will do a prefix match so that that results will not be rejected.

andrew2net commented 5 years ago

@opoudjis I'm trying to understand when the search fails. If I search for ISO 2382 it returns me documents:

ISO 2382-7:1977 Data processing -- Vocabulary -- Section 07 : Digital computer programming [Withdrawn]
ISO 2382-5:1974 Data processing -- Vocabulary -- Section 05 : Representation of data [Withdrawn]
...

How can I determine that the search failed?

opoudjis commented 5 years ago

ISO 2382 should not match part documents. Moreover, without a year indication, it should not match withdrawn documents: the latest available document should always be active.

ronaldtse commented 5 years ago

Agree with @opoudjis, the search should not match part documents (ISO 2382-X).

andrew2net commented 5 years ago

@opoudjis done it. Please check if all ok.

opoudjis commented 5 years ago

A-OK