Composition dictionary should be changed [Bug 22059]

Comment #1 from https://www.w3.org/Bugs/Public/show_bug.cgi?id=22059 @TakayoshiKochi

Takayoshi Kochi 2013-07-04 03:35:40 EDT

As suggested by James Su, I'd like to incorporate composition dictionary within InputMethodContext.

It would look like: interface InputMethodContext { ... readonly attribute DOMString text; readonly attribute long selectionStart; readonly attribute long selectionEnd; readonly attribute Uint32Array segments; .... }

where selectionStart/End means identical to that for /, and added segments information for dividing the text into clauses.</p> </div> </div> <div class="comment"> <div class="user"> <a rel="noreferrer nofollow" target="_blank" href="https://github.com/AFBarstow"><img src="https://avatars.githubusercontent.com/u/662425?v=4" />AFBarstow</a> commented <strong> 9 years ago</strong> </div> <div class="markdown-body"> <p>Comment #2 <a href="https://www.w3.org/Bugs/Public/show_bug.cgi?id=22059#c2">https://www.w3.org/Bugs/Public/show_bug.cgi?id=22059#c2</a> @travisleithead </p> <p>Travis Leithead [MSFT] 2013-08-20 14:21:57 EDT</p> <p>(In reply to comment #1)</p> <blockquote> <p>As suggested by James Su, I'd like to incorporate composition dictionary within InputMethodContext.</p> <p>It would look like: interface InputMethodContext { ... readonly attribute DOMString text;</p> </blockquote> <p>The interface is labelled "InputMethodContext" and so "text" is a little ambiguous in my opinion. I liked "compositionText" better, but I could be OK with this.</p> <blockquote> <pre><code>readonly attribute long selectionStart; readonly attribute long selectionEnd;</code></pre> </blockquote> <p>Selection & composition are two completely different underlying concepts that shouldn’t be combined. I think calling these "selection.." is confusing with normal text selection. The currently selected text will already be available via the input and textarea's selection properties--no need to duplicate the functionality. Offset (in the MS proposal) makes it clear that it’s character positions and not DOM nodes. These offset character positions mark the actual "active" composition range (which may be different from what is currently selected). Maybe for brevity: "startOffset"/ "endOffset"? or "textContentStart"/"textContentEnd"?</p> <blockquote> <pre><code>readonly attribute Uint32Array segments;</code></pre> </blockquote> <p>OK. This is not relevant to all IMEs though. I suppose we could implement this for other IMEs by always returning only 1 segment.</p> <blockquote> <p>where selectionStart/End means identical to that for <input>/<textarea>, and added segments information for dividing the text into clauses.</p> </blockquote> <p>No need for the redundancy. What we found is that we actually needed the "active" composition offsets, not the selected text which varies depending on the state of the IME. See above.</p> </div> </div> <div class="comment"> <div class="user"> <a rel="noreferrer nofollow" target="_blank" href="https://github.com/AFBarstow"><img src="https://avatars.githubusercontent.com/u/662425?v=4" />AFBarstow</a> commented <strong> 9 years ago</strong> </div> <div class="markdown-body"> <p>Comment 3 <a href="https://www.w3.org/Bugs/Public/show_bug.cgi?id=22059#c3">https://www.w3.org/Bugs/Public/show_bug.cgi?id=22059#c3</a> @TakayoshiKochi </p> <p>Takayoshi Kochi 2013-10-02 01:42:00 EDT</p> <p>Sorry for my belated response.</p> <p>(In reply to Travis Leithead [MSFT] from comment #2)</p> <blockquote> <p>(In reply to comment #1)</p> <blockquote> <pre><code>readonly attribute long selectionStart; readonly attribute long selectionEnd;</code></pre> </blockquote> <p>Selection & composition are two completely different underlying concepts that shouldn’t be combined. I think calling these "selection.." is confusing with normal text selection. The currently selected text will already be available via the input and textarea's selection properties--no need to duplicate the functionality. Offset (in the MS proposal) makes it clear that it’s character positions and not DOM nodes. These offset character positions mark the actual "active" composition range (which may be different from what is currently selected). Maybe for brevity: "startOffset"/ "endOffset"? or "textContentStart"/"textContentEnd"?</p> </blockquote> <p>I agree this is a fair argument.</p> <p>I don't have strong preference of any of these, 1 startOffset / endOffset 2 textContentStart / textContentEnd 3 activeSegmentStart / activeSegmentEnd 4 activeSegmentStartOffset / activeSegmentEndOffset 5 etc. etc.</p> <p>but 1 is too simple and maybe confusing, 2 may be also confusing against DOM node's textContent. How about 3?</p> <blockquote> <blockquote> <pre><code>readonly attribute Uint32Array segments;</code></pre> </blockquote> <p>OK. This is not relevant to all IMEs though. I suppose we could implement this for other IMEs by always returning only 1 segment.</p> </blockquote> <p>(FYI now it's spec'ed as "sequence<unsigned long> getSegments();" <a href="https://dvcs.w3.org/hg/ime-api/raw-file/default/Overview.html#widl-Composition-getSegments-sequence-unsigned-long">https://dvcs.w3.org/hg/ime-api/raw-file/default/Overview.html#widl-Composition-getSegments-sequence-unsigned-long</a> )</p> <p>For non-segmenting IMEs (most non-Japanese IMEs) return just one '0' element.</p> <blockquote> <blockquote> <p>where selectionStart/End means identical to that for <input>/<textarea>, and added segments information for dividing the text into clauses.</p> </blockquote> <p>No need for the redundancy. What we found is that we actually needed the "active" composition offsets, not the selected text which varies depending on the state of the IME. See above.</p> </blockquote> <p>See above, too ;)</p> </div> </div> <div class="comment"> <div class="user"> <a rel="noreferrer nofollow" target="_blank" href="https://github.com/AFBarstow"><img src="https://avatars.githubusercontent.com/u/662425?v=4" />AFBarstow</a> commented <strong> 9 years ago</strong> </div> <div class="markdown-body"> <p>Comments 4 through 16</p> <p>= Comment 16 Takayoshi Kochi 2014-04-08 01:22:20 EDT</p> <p>Reopening this.</p> <p>= Comment 15 Takayoshi Kochi 2014-01-27 00:24:27 EST</p> <p>Okay, thanks for the comment. I'll work on updating the spec accordingly.</p> <p>= Comment 14 Jianfeng Lin 2014-01-21 20:15:52 EST</p> <p>We use offset because the key scenario we were trying to tackle is the search suggestion in <input type="text">, in which case it has to be an offset within the element's textContent. For contentEditable a range object could be more useful and the API could support both offset and range there.</p> <p>= Comment 13 Takayoshi Kochi 2013-12-13 03:06:08 EST</p> <p>I would like to make clarification - The original proposal[1] says:</p> <blockquote> <p>on an element with the contentEditable flag set, then this is the starting offset relative to the target's textContent property (textContent is a linear view of all the text under an element)</p> </blockquote> <p>But the current MSDN document[2](as of today, Dec. 13, 2013) doesn't mention about behavior when compositionStartOffset/End used in contenteditable.</p> <p>The way that a browser generates textContent from DOM tree and that a browser holds where an IME composition are not usually compatible - is there really a use case to get offsets within contenteditable?</p> <p>I personally suppose for contenteditable it is reasonable to return Range's before and after IME composition within contenteditable (to different attributes, of course) - but am not sure yet.</p> <p>What do you think?</p> <p>[1] <a href="https://dvcs.w3.org/hg/ime-api/raw-file/tip/proposals/IMEProposal.html#widl-InputMethodContext-compositionStartOffset">https://dvcs.w3.org/hg/ime-api/raw-file/tip/proposals/IMEProposal.html#widl-InputMethodContext-compositionStartOffset</a> [2] <a href="http://msdn.microsoft.com/en-us/library/ie/dn433247(v=vs.85).aspx">http://msdn.microsoft.com/en-us/library/ie/dn433247(v=vs.85).aspx</a></p> <p>= Comment 12 Jianfeng Lin 2013-12-02 21:40:10 EST</p> <p>Closing the bug as we agree with having composition{Start,End}Offset directly under InputMethodContext interface and moving active segment to a separate document.</p> <p>= Comment 11 Takayoshi Kochi 2013-12-02 21:09:17 EST</p> <p>The document has been moved: <a href="https://dvcs.w3.org/hg/ime-api/raw-file/default/Annex.html">https://dvcs.w3.org/hg/ime-api/raw-file/default/Annex.html</a></p> <p>See example 2.</p> <p>= Comment 10 Takayoshi Kochi 2013-11-07 04:49:41 EST</p> <p>(In reply to Takayoshi Kochi from comment #9)</p> <blockquote> <p>For active segments, it will be used for rendering composition by webapps, not browsers.</p> </blockquote> <p>See example 2 of the spec. <a href="https://dvcs.w3.org/hg/ime-api/raw-file/default/Overview.html">https://dvcs.w3.org/hg/ime-api/raw-file/default/Overview.html</a></p> <p>= Comment 9 Takayoshi Kochi 2013-11-06 22:38:51 EST</p> <p>It is because composition{Start,End}Offset are relative to its parent's "value" and external to the composition itself.</p> <p>For active segments, it will be used for rendering composition by webapps, not browsers.</p> <p>= Comment 8 Jianfeng Lin 2013-11-06 20:03:53 EST</p> <p>Thanks for accepting the proposal, Takayoshi. I saw that you put it right under InputMethodContext interface. Why not under the "composition" attribute of that interface? Since this is information about the composition, it makes more sense to be inside the composition attribute, and there you could simplify the name to be "startOffset/endOffset", so developers can reference them by element.inputMethodContext.composition.startOffset.</p> <p>I'm still curious about the use cases for active segments.</p> <p>= Comment 7 Takayoshi Kochi 2013-11-05 23:52:29 EST</p> <p>As composition{Start,End}Offset added in the spec, closing this.</p> <p>For hasComposition()/compositionText, see <a href="https://www.w3.org/Bugs/Public/show_bug.cgi?id=22028">https://www.w3.org/Bugs/Public/show_bug.cgi?id=22028</a></p> <p>= Comment 6 Takayoshi Kochi 2013-11-05 23:29:42 EST</p> <p>Thanks Jianfeng for clarification.</p> <p>Added compositionStartOffset/compositionEndOffset. <a href="https://dvcs.w3.org/hg/ime-api/raw-file/8c061ee19f99/Overview.html#widl-InputMethodContext-compositionStartOffset">https://dvcs.w3.org/hg/ime-api/raw-file/8c061ee19f99/Overview.html#widl-InputMethodContext-compositionStartOffset</a></p> <p>= Comment 5 Jianfeng Lin 2013-10-21 21:31:17 EDT</p> <p>Takayoshi, the compositionStartOffset / compositionEndOffset we proposed is different from activeSegmentStartOffset / activeSegmentEndOffset you suggested, so please don't replace them. For example when the user types "honnwoyomu" in Japanese IME and hits space, the whole sentence will be in composition while only the first part "本を" will be the active segment you mentioned. So the text in between compositionStartOffset and compositionEndOffset should be "本を読む"　while the text in between activeSegmentStartOffset and activeSegmentEndOffset should be "本を". We are not against exposing the information about where the active segment is, but exposing the position of the composition is more important.</p> <p>= Comment 4 Takayoshi Kochi 2013-10-10 04:11:13 EDT</p> <p>changed to activeSegmentStart/End <a href="https://dvcs.w3.org/hg/ime-api/rev/10a3d6ec9336">https://dvcs.w3.org/hg/ime-api/rev/10a3d6ec9336</a></p> </div> </div> <div class="page-bar-simple"> </div> <div class="footer"> <ul class="body"> <li>© <script> document.write(new Date().getFullYear()) </script> Githubissues.</li> <li>Githubissues is a development platform for aggregating issues.</li> </ul> </div> <script src="https://cdn.jsdelivr.net/npm/jquery@3.5.1/dist/jquery.min.js"></script> <script src="/githubissues/assets/js.js"></script> <script src="/githubissues/assets/markdown.js"></script> <script src="https://cdn.jsdelivr.net/gh/highlightjs/cdn-release@11.4.0/build/highlight.min.js"></script> <script src="https://cdn.jsdelivr.net/gh/highlightjs/cdn-release@11.4.0/build/languages/go.min.js"></script> <script> hljs.highlightAll(); </script> </body> </html>

w3c / ime-api

Composition dictionary should be changed [Bug 22059] #4