As originally defined, the line break class AIcontainedallcharacters with East_Asian_Width value A (ambiguous width) that would otherwise beALin this classification. For more information on East_Asian_Width and how to resolve it, see Unicode Standard Annex #11,East Asian Width[UAX11].
The original definition included many Latin, Greek, and Cyrillic characters. These characters are now classified by default asALbecause use of theALline breaking class better corresponds to modern practice. Where strict compatibility with older legacy implementations is desired, some of these characters need to be treated asIDin certain contexts. This can be done by always tailoring them toIDor by continuing to classify them asAIand resolving them toIDwhere required.
As part of the same revision, the set of ambiguous characters has been extended to completely encompass the enclosed alphanumeric characters used for numbering of bullets.
As updated, theAIline breaking class includes all characters with East Asian Width A that are outside the range U+0000..U+1FFF, plus the following characters:
24EA
CIRCLED DIGIT ZERO
2780..2793
DINGBAT CIRCLED SANS-SERIF DIGIT ONE..DINGBAT NEGATIVE CIRCLED SANS-SERIF NUMBER TEN
UAX 14: