Open aarppe opened 6 years ago
@aarppe Please provide an explicit list of requirements for the "click-in-text" functionality.
This format may be useful:
- [ ] requirement 1 title
- [x] requirement 2 title ([x] means it's already working)
- [ ] requirement 3 title
...
- [ ] requirement N title
## requirement 1 title
requirement 1 description
### Acceptance criteria
brief acceptance criteria
## requirement 2 title
requirement 2 description
### Acceptance criteria
brief acceptance criteria
...
Work in progress - needs proofreading.
Generally expected behavior for crk -> eng:
4b. N.B. In contrast to the desktop version of itwêwina, no partial matches of lexical entries should be presented.
Generally expected behavior for eng -> crk
All the above applies for selection of orthographical representation of crk lemmas / lexical entries.
Click on 'read with itwêwina'
Potentially select language and orthography by clicking on [Ā] tab to the left. For crk, this means crk x eng with the three different crk orthographies
Test cases in: http://sapir.artsrn.ualberta.ca/itwewina-click-in-text.html First invoke click-in-text from bookmarks toolbar.
[ ] SRO/circumfl: ispîhk
[ ] SRO/macron: ispîhk
[ ] SRO: ispihk
[ ] SRO/circumfl: kâ-kî-awâsisîwiyân
[ ] SRO/macron: kā-kī-awāsisīwiyān
[ ] SRO: ka-ki-awasisiwiyan
[ ] SRO/hyph: kakihawasisiwiyan
[ ] SRO/circumfl: nikî-nitawi-kiskinwahamâkosin
[ ] SRO/macron: nikī-nitawi-kiskinwahamākosin
[ ] SRO: niki-nitawi-kiskinwahamakosin
[ ] SRO/non-hyph: nikinitawikiskinwahamakosin
One should not see partial matches of the string.
One should not see partial matches of the string.
One should not see partial matches of the string.
Test cases in: http://sapir.artsrn.ualberta.ca/itwewina-click-in-text.html
😻 Excellent!
Syllabic cases to be added later. Needs thinking as to prefixes being separated from stem by space (not hyphen as in SRO).
In my discussion with Arden and Arok, it seems that
Yes, I agree. For syllabic script with the click-in-text functionality:
For words with multiple parts separated by spaces, we can leave those cases until later (likely involving either a) the possibility of combining alt-click and painting the string sequence in question; b) or itwêwina attempting to look at the context and find what might be an appropriate form.
For single-part words, the click-in-text functionality should work with syllabics exactly the same as with SRO.
For multipart (preverbed/reduplicated) words where the component strings are not separated by a space, the click-in-text functionality should work with syllabics exactly the same as with SRO.
A. Examples with syllabic script for crk input as well as output script for lemma, with translations into English:
NOTE: When word parts are written together, a joiner -h- is required if a preverb ends in a vowel and the subsequent stem begins with a vowel, e.g. ᑳᑮᐦᐊᐚᓯᓰᐏᔮᐣ <-> kâkîhawâsisîwiyân. The joiner can be added to the end of the preverb even when it is kept separate with a space from the following stem (examples: ê <-> eh; wî <-> wîh).
Settings:
Input: ᐃᐢᐲᕽ Output (correct: 1 match presented in syllabics, including lemma): Input: ᑳ Output: 2 correct matches presented in syllabics (including preverb)
Input: ᑮ Output: 1 correct match of preverb
Input: ᐊᐚᓯᓰᐏᔮᐣ Output: 1 correct match of bare conjunct (normally a conjunct would require some conjunct preverb, but that can be omitted in fast speech, so we've allowed for preverbless conjunct forms, which is useful here in click-in-text):
Input: ᓂᑮ Output: no match, as we don't (yet) present an analysis prefixal fragments for independent verb forms (like nikî- / ᓂᑮ here).
Input: ᓂᑕᐏ Output: 1 sort-of correct match, as nitawi- is both an independent particle and a non-independent preverb (probably does not match with the preverb in the XML source, as the preverb meanings have the hyphen in their lemma):
Input: ᑭᐢᑭᓌᐦᐊᒫᑯᓯᐣ Output: Correct with no match, since stem+suffix section of an independent verb cannot occur by itself.
Input: ᐅᐦᐱᒣ Output: 1 correct match (ᐅᐦᐱᒣ), 1 unnecessary (ᐅᐦᐱᒣᐦᐱᓱᐤ) match due to partial matching:
Input: ᓂᑎᐢᑯᓂᑲᓂᕽ Output: 1 correct match
Input: ᐅᐦᒋ Output: 3 correct matches (2 x ᐅᐦᒋ - particle + 1 x ᐅᐦᒌᐤ - verb), potentially other incorrect match due to partial matching:
Input: ᐃᐢᐲᕽ Output: Correct, 1 single match
Input: ᑳᑮᐦᐊᐚᓯᓰᐏᔮᐣ Output: Correct, single match: ᐊᐚᓯᓰᐏᐤ (verb)
Input: ᓂᑮᓂᑕᐏᑭᐢᑭᓌᐦᐊᒫᑯᓯᐣ Output: 1 correct match: ᑭᐢᑭᓌᐦᐊᒫᑯᓯᐤ
Input: ᐅᐦᐱᒣ Output: 1 correct matches: ᐅᐦᐱᒣ (particle) and 1 incorrect match: ᐅᐦᐱᒣᐦᐱᓱᐤ (verb), due to partial matching:
Input: ᓂᑎᐢᑯᓂᑲᓂᕽ Output: 1 correct match: ᐃᐢᑯᓂᑲᐣ
Input: ᐅᐦᒋ Output: 3 correct matches: 2 particles (ᐅᐦᒋ) and 1 verb (ᐅᐦᒌᐤ)
Results should be almost exactly the same is in section 2 above.
B. crk input in syllabics, output of lemma in SRO/circumflex or SRO/macron, translation into English
Click-in-text should work the same, if the output of the lemma is set as SRO but the input is in syllabics
Settings:
Input: ᐃᐢᐲᕽ Output: No match - should present lemma ispîhk (seems to be the case with short syllabic words)
Input: ᑳᑮᐦᐊᐚᓯᓰᐏᔮᐣ Output: awasîsiwiw (verb)
Input: ᓂᑮᓂᑕᐏᑭᐢᑭᓌᐦᐊᒫᑯᓯᐣ Output: kiskinwahamâkosiw
Input: ᐅᐦᐱᒣ Output: No match, though should present: ohpimê
Input: ᓂᑎᐢᑯᓂᑲᓂᕽ Output: 1 correct match: iskonikan (noun)
Input: ᐅᐦᒋ Output: 1 correct match: ohcîw, 1/2 missing matches; ohci-
When selecting SRO/macron as output format for the lemma, the results should be exactly the as for the SRO/circumflex cases above:
For instance, for the following case: Input: ᑲᑭᐦᐊᐘᓯᓯᐏᔭᐣ Output: 1 correct match: awāsisīwiw
However, the shorter syllabic words get no matches when they surely should, e.g.
This now works in general. But we should restrict the behavior of the click-in-text functionality so that partial matches won't be presented (or sent over Internet) when itwêwina is accessed via the click-in-text functionality.