google / emoji4unicode

Automatically exported from code.google.com/p/emoji4unicode
Apache License 2.0
51 stars 20 forks source link

Update ARIB cross-reference per changes from April 2009 WG2 meeting #165

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
At the April 2009 WG2 meeting, code points for several ARIB symbols were 
changed:
26CE --> 2757 HEAVY EXCLAMATION MARK SYMBOL
26E2 --> 2B55 HEAVY LARGE CIRCLE
26E4 --> 2B56 HEAVY OVAL WITH OVAL INSIDE
26E5 --> 2B57 HEAVY CIRCLE WITH CIRCLE INSIDE
26E6 --> 2B58 HEAVY CIRCLE
26E7 --> 2B59 HEAVY CIRCLED SALTIRE

The emoji cross-reference to ARIB characters needs to be updated to reflect 
these (perhaps all the 
ARIB cross-references need to be checked to make sure they reflect the latest 
ARIB code points per WG2/N3625, L2/09-172).

Original issue reported on code.google.com by pedb...@apple.com on 15 May 2009 at 9:55

GoogleCodeExporter commented 9 years ago
Actually there are several other changes from April 2009 WG2:

2. 32FF --> 1F213 SQUARED KATAKANA DE

3. Reordering of ARIB broadcasting and baseball symbols, 1F210..1F231 and 
1F240..1F248. This I think affects a 
few cross-references.

Original comment by pedb...@apple.com on 15 May 2009 at 10:02

GoogleCodeExporter commented 9 years ago
The summary of changes above is basically from L2/09-177 
<http://www.unicode.org/L2/L2009/09177-
wg2consent.txt>. Also see the disposition of comments on Amd. 6 FPDAM, WG2 
N3631 
<http://std.dkuug.dk/jtc1/sc2/wg2/docs/n3631.pdf>. I will try to add the 
specifics for each affected character.

Original comment by pedb...@apple.com on 29 Jul 2009 at 8:41

GoogleCodeExporter commented 9 years ago
I made the changes as best I could, in r170. Peter, please review with a 
fine-tooth 
comb and check whether I got all of the ARIB code point changes, and whether I 
got them 
right.

Original comment by markus.icu on 29 Jul 2009 at 9:20

GoogleCodeExporter commented 9 years ago
Well, I had gone through the 45 ARIB cross-references in 
<http://www.unicode.org/~scherer/emoji4unicode/snapshot/full.html> and these 
are the ones that needed updating:

e-4B7, noted as not unified with ARIB-9129 = U+1F157 (new in 5.2): the name of 
U+1F157 has been updated to 
NEGATIVE CIRCLED LATIN CAPITAL LETTER H.

e-7F6, noted as unified with ARIB-9016 = U+1F17F (new in 5.2): the name of 
U+1F17F has been updated to NEGATIVE 
SQUARED LATIN CAPITAL LETTER P.

e-B3A, noted as unified with ARIB-9069 = U+1F22A (new in 5.2), SQUARED CJK 
UNIFIED IDEOGRAPH-7121: the code 
point for this has been updated to U+1F21A.

e-B40, noted as unified with ARIB-9336 = U+1F224 (new in 5.2), SQUARED CJK 
UNIFIED IDEOGRAPH-6307: the code 
point for this has been updated to U+1F22F.

e-B44, noted as unified with ARIB-9032 = U+26E2 (new in 5.2), HEAVY LARGE 
CIRCLE: the code point for this has been 
updated to U+2B55.

e-B63, noted as not unified with ARIB-9104 = U+26E6 (new in 5.2), HEAVY CIRCLE: 
the code point for HEAVY CIRCLE 
has been updated to U+2B58

I will check these against r170.

Original comment by pedb...@apple.com on 29 Jul 2009 at 10:18

GoogleCodeExporter commented 9 years ago
OK, looking at r170:

1. arib.ucm is missing one difference: 1F14F -> 1F190, SQUARE DJ 

2. In emoji4unicode.xml I did not see the changes for e-4B7 and e-B63 noted 
above (perhaps I missed them). 
It also had changes for e-B2E (1F231->1F2FF) and e-B2F (add ->1F232) that are 
not part of the ARIB 
changes per se.

I did not see any other issues, but I am not familiar with the usage of all of 
these files, and UnicodeData.txt 
was too large for diffs to be shown ("Files are too large to display a diff.").

Original comment by pedb...@apple.com on 29 Jul 2009 at 10:52

GoogleCodeExporter commented 9 years ago
> 1. arib.ucm is missing one difference: 1F14F -> 1F190, SQUARE DJ 

Fixed in r172 and r173. Thanks!

> 2. In emoji4unicode.xml I did not see the changes for
> e-4B7 and e-B63 noted above (perhaps I missed them).

First, I had not yet updated the files posted on unicode.org. Those were all 
from
April. (Sorry!) They are up to date now, with all changes for this issue.

e-4B7: I don't see the name of U+1F157 in the data.
e-B63: Done in r172 and r173. Also updated xref on e-B44.

> It also had changes for e-B2E (1F231->1F2FF)
> and e-B2F (add ->1F232) that are not part of the ARIB 
> changes per se.

Right. These are just proposed code points. I had to change them because of a
conflict with new Unicode 5.2 data. They will change again once I manage to 
suck in
the new code points and name changes from Michel's AMD8 NamesList. Actually, 
e-B2F
did not effectively change because my code iterates through proposed code 
points if
an <e> does not have a unicode attribute.

> ... UnicodeData.txt was too large for diffs to be shown

Right. The old version was handmade, with all of the AMD6 characters pasted at 
the
end of a Unicode 5.2 file that didn't already have AMD6 data. The new version 
is the
latest Unicode 5.2 data file, unchanged except for the header comment.

Original comment by markus.icu on 30 Jul 2009 at 9:20

GoogleCodeExporter commented 9 years ago
> e-4B7: I don't see the name of U+1F157 in the data.

Uh, right. I guess my comment about e-4B7 should have been more like "if you 
were to add the name for the 
xref, the name has been changed and is now XXX"

Original comment by pedb...@apple.com on 30 Jul 2009 at 9:54

GoogleCodeExporter commented 9 years ago

Original comment by pedb...@apple.com on 31 Jul 2009 at 6:38