Closed ronaldtse closed 2 years ago
Here's the mapping. There are entries that are marked "UNKNOWN" or "NOT AVAILABLE IN DATASET".
Warning that some of the "NOT AVAILABLE IN DATASET" entries are due to https://github.com/relaton/relaton-ieee/issues/16, so they may become available once we are able to parse all IEEE entries.
This is the new set of mappings from bibxml6 filenames to IEEE dataset's title
attribute. Only the following items are missing.
These are the document identifiers that we cannot find in the IEEE dataset: not available on ieee.org and cannot be found anywhere (even when relaton/relaton-ieee#16 is complete). We need to find a way to resolve these. The last two possibly point to ISO co-published copies, but IEEE's dataset doesn't contain them.
reference.IEEE.802-1D.1991.xml: UNKNOWN; ONLY INTERNET SOURCE IS bibxml2
reference.IEEE.802-1D.1993.xml: UNKNOWN; ONLY INTERNET SOURCE IS bibxml2
reference.IEEE.802-1Y.1990.xml: UNKNOWN; there is a 802.1Y but is a completely different document.
reference.IEEE.P802-1A.1989.xml: UNKNOWN, ONLY INTERNET SOURCE IS bibxml2
reference.IEEE.P8021A.1989.xml: UNKNOWN, ONLY INTERNET SOURCE IS bibxml2
reference.IEEE.802-3.1990.xml: DOES NOT EXIST IN IEEE DATASET - ISO/IEC 8802-3:1990
reference.IEEE.802-3.1996.xml: DOES NOT EXIST IN IEEE DATASET - ISO/IEC 8802-3:1996
I've investigated these last 7 entries:
reference.IEEE.802-1D.1991.xml
: Possibly IEEE Std 802.1D-1990
?reference.IEEE.802-1D.1993.xml
: There is IEEE Std 802.1D-1990
, ANSI/IEEE Std 802.1D, 1998 Edition
, and IEEE Std 802.1D-2004 (Revision of IEEE Std 802.1D-1998)
but no 1993.reference.IEEE.802-1Y.1990.xml
: There is a textual reference to P802.1y
in the dataset from the entry of IEEE Std 802.1D-2004 (Revision of IEEE Std 802.1D-1998)
, but it is not available as an entry in the IEEE dataset or on the IEEE website. The only reference to this is the bibxml2/bibxml6 datasets.reference.IEEE.P802-1A.1989.xml
: This reference cannot be found in the IEEE dataset or the IEEE website. The only reference to this is the bibxml2/bibxml6 datasets.reference.IEEE.P8021A.1989.xml
: possibly a variation of the above entry.reference.IEEE.802-3.1990.xml
: ISO/IEC 8802-3:1990 - ANSI/IEEE Std 802.3-1990 Edition
existed but not in the IEEE dataset, and cannot be found on the IEEE website. There are textual references to this document in the IEEE dataset but it does not exist as an entry.reference.IEEE.802-3.1996.xml
: This document only exists as ISO/IEC 8802-3:1996
on ISO's site. In the IEEE dataset, there is only IEEE Std 802.3, 1998 Edition
.I have also verified that all bibxml2/reference.IEEE.*
entries exist in bibxml6/
.
The recommendations are:
reference.IEEE.802-1D.1991.xml
, reference.IEEE.802-1D.1993.xml
correct to 1990.reference.IEEE.802-3.1990.xml
, reference.IEEE.802-3.1996.xml
change to the corresponding IEEE published entry, not the ISO version.bibxml-misc
@rjsparks we will need guidance on next steps here. Thanks!
(Once this task is done we can remove all bibxml2/reference.IEEE.*
files from the bibxml-data-archive repo.)
@TonyLHansen Comments on the choice in the last bullet above?
I do not think we should deviate away from the IEEE dataset. This will keep it simple when IEEE releases updates.
Any supplemental entries can be put in bibxml-misc.
Here's the mapping. There are entries that are marked "UNKNOWN" or "NOT AVAILABLE IN DATASET".
These mappings seem to reference many document identifiers not present in IEEE bibliography source🤔
and others.
I’m not sure whether it has to do with wrong identifier format, or documents are missing altogether.
Answered my own question.
Mappings for bibxml6
were added to bibxml-data-archive in https://github.com/ietf-ribose/bibxml-data-archive/commit/c6b1281a13add3bc32ed229847f2162b737af3ef. Most reference nonexistent docids and so will fall back to xml2rfc archive data.
These mappings seem to reference many document identifiers not present in IEEE bibliography source🤔
- IEEE Std 1062, 1998 Edition
- IEEE Std 1061-1998
- IEEE Std 1012-2012 (Revision of IEEE Std 1012-2004) - Redline
- IEEE Std 1003.1, 2013 Edition (incorporates IEEE Std 1003.1-2008, and IEEE Std 1003.1-2008/Cor 1-2013)
and others.
I’m not sure whether it has to do with wrong identifier format, or documents are missing altogether.
We actually need to fix the relaton-data-ieee document identifiers to show the correct format. It is time to integrate relaton-ieee with pubid-ieee.
I’m happy if we do that, but I’m not sure if this will address the invalid mappings in short term…
This task will fix https://github.com/ietf-ribose/bibxml-service/issues/31