Closed LinguList closed 5 years ago
Given that there are 260 of those, it is definitely worth doing this.
It is systematic, so I can fix many things without going manual, will send an update soon.
So, here are the ones really missing, they should be added (should be quick, by comparing concepts):
number | short ID (autoconvert) | id-in-stedt | concept-in-sttedt |
---|---|---|---|
1 | 13a.5 | XIIIA5 | cause (smn to do sthg) |
2 | 12c. | XIIC | under (below) |
3 | 12c. | XIIC | under |
4 | 12d. | XIID | old (of clothing) |
5 | 12c. | XIIC | where (w. to) |
6 | 12c. | XIIC | behind (not visible) |
7 | 13a.3 | XIIIA3 | BENEFACTIVE |
8 | 12d. | XIID | never |
9 | I.9 | I.9 | all |
10 | Hale 73 c.Sd. | Hale 73 CSD | jackal |
11 | 12d. | XIID | unmixed |
12 | 12c. | XIIC | hither and thither |
13 | 13a.1 | XIIIA1 | can / be able to do something or know something |
14 | 12d. | XIID | unmixed / pure |
15 | 12d. | XIID | least |
16 | 12c. | XIIC | through |
17 | 13a.4 | XIIIA4 | let (smn do sthg) / permit |
18 | 12d. | XIID | real |
19 | 12c. | XIIC | around |
20 | 12d. | XIID | new one; one which is new |
21 | 0 2b1.59 | 0 2b1.59 | get up |
22 | 12d. | XIID | new |
23 | 12c. | XIIC | over (above) |
24 | 12c. | XIIC | toward |
25 | 66 | 66 | armpit |
26 | 01.009;12d | 01.009;12d | all (for things) |
27 | 12c. | XIIC | in / inside of |
28 | 12d. | XIID | mixed |
29 | 12c. | XIIC | behind (visible) |
30 | 12d. | XIID | before |
31 | 12c. | XIIC | away from |
32 | 12c. | XIIC | under (beneath) |
33 | 93 | 93 | wing |
34 | 12c. | XIIC | out of |
35 | srcid | srcid | gloss |
36 | 12d. | XIID | early |
37 | 12c. | XIIC | above (directly) |
38 | 12d. | XIID | partial |
39 | 12c. | XIIC | between |
40 | 12c. | XIIC | down |
41 | 12d. | XIID | frequently |
42 | 12c. | XIIC | beneath |
43 | 01.009,12d | 01.009,12d | all |
44 | 12d. | XIID | until / as long as |
45 | 12c. | XIIC | across |
46 | 12d. | XIID | less |
47 | 12d. | XIID | during / in the midle |
48 | 12d. | XIID | old (of objects) |
49 | 12d. | XIID | all |
50 | 12c. | XIIC | beyond |
51 | 12c. | XIIC | where (w. at) |
52 | 12c. | XIIC | beside |
53 | 12d. | XIID | during |
54 | 12d. | XIID | old |
55 | hand | ||
56 | 13a.4 | XIIIA4 | let (smn do sthg) |
57 | 01.009,12d | 01.009,12d | all (for things) |
58 | 12d.7 | XIID7 | more |
59 | 12d. | XIID | most |
60 | 12c. | XIIC | up (up country) |
61 | 12d. | XIID | more |
62 | 12d. | XIID | frequently / sometimes |
63 | 13a.2 | XIIIA2 | know (fact / person) |
64 | 12c. | XIIC | where |
65 | 12d. | XIID | when |
66 | 12c. | XIIC | behind |
67 | 66 | 66 | palm of hand |
68 | 12d. | XIID | after |
69 | 12c. | XIIC | up |
70 | 12d. | XIID | unmixed / without condiment |
71 | 12d. | XIID | infrequent |
72 | 12d. | XIID | frequent |
73 | 06a.0406a. | 06a.0406a. | place |
74 | 158 | 158 | tickle |
75 | 12c.1 | XIIC1 | far |
76 | 12d. | XIID | until |
77 | 12d. | XIID | late |
78 | 12d. | XIID | daily |
79 | 12c. | XIIC | up (straight up) |
80 | 13a.1 | XIIIA1 | can / be able to do something |
81 | 66 | 66 | arm |
82 | 13a.2 | XIIIA2 | know (sthg) |
83 | 12c. | XIIC | above |
84 | 01.009,12. | 01.009,12. | all |
85 | 12d. | XIID | whole |
86 | 12d. | XIID | after / at last |
87 | 12c. | XIIC | over |
Could you look into this, @chrzyki ?
I'll soon push the orthography profile (yes, this is actually working!)
Cool that the orthography profile is working! Will have a look at the SRCIDS.
Thanks @natalia-morozova & see here @LinguList:
https://github.com/lexibank/halenepal/blob/master/raw/srcids_corrected.csv
Four are still hleft, if you re-run the code, otherwise it's fine.
The reason why we have only some 7000 instead of 10000 forms int eh data now is that there are sourcids that are in STEDT but not in hale:
If those are identified (and ideally corrected in some json or whatever), we should have the full account of the data.