Closed funderburkjim closed 9 years ago
24 tasdI OLD: don't edit
<H1><h><hc3>000</hc3><key1>tasdI</key1><hc1>1</hc1><key2>tasdI</key2></h><body>
<c>in_<ab>astron.</ab>_=_$_,_hexagon.</c> </body><tail><mul/> <MW>054217</MW>
<pc>441,2</pc> <L>83680</L></tail></H1>
24 tasdI NEW: edit
<H1><h><hc3>000</hc3><key1>tasdI</key1><hc1>1</hc1><key2>tasdI</key2></h><body>
<c>in_<ab>astron.</ab>_=_تسْديس_,_hexagon.</c> </body><tail><mul/> <MW>054217</MW>
<pc>441,2</pc> <L>83680</L></tail></H1>
25 tIra OLD: don't edit
<H1B><h><hc3>110</hc3><key1>tIra</key1><hc1>1</hc1><key2>tIra</key2><hom>1</hom></h>
<body> <lex>n.</lex> <c>a_kind_of_arrow</c> <p><ab>cf.</ab>~<c><ab>Pers.</ab>_$</c></p>
<ls>Pan5cad._ii_,_76</ls> </body><tail><pc>449,1</pc> <L>85530</L></tail></H1B>
25 tIra NEW: edit
<H1B><h><hc3>110</hc3><key1>tIra</key1><hc1>1</hc1><key2>tIra</key2><hom>1</hom></h>
<body> <lex>n.</lex> <c>a_kind_of_arrow</c> <p><ab>cf.</ab>~<c><ab>Pers.</ab>_تير</c></p>
<ls>Pan5cad._ii_,_76</ls> </body><tail><pc>449,1</pc> <L>85530</L></tail></H1B>
26 tUda OLD: don't edit
<H1A><h><hc3>100</hc3><key1>tUda</key1><hc1>1</hc1><key2>tUda</key2></h><body>
<lex type="inh">m.</lex> <eq/> <s>tUta</s> <p>$</p> <ls>Npr.</ls> </body><tail><pc>452,3</pc>
<L>86395</L></tail></H1A>
26 tUda NEW: edit
<H1A><h><hc3>100</hc3><key1>tUda</key1><hc1>1</hc1><key2>tUda</key2></h><body>
<lex type="inh">m.</lex> <eq/> <s>tUta</s> <p>توت</p> <ls>Npr.</ls> </body><tail><pc>452,3</pc>
<L>86395</L></tail></H1A>
27 dArvawa OLD: don't edit
<H1><h><hc3>110</hc3><key1>dArvawa</key1><hc1>1</hc1><key2>dArvawa</key2></h><body>
<lex>n.</lex> <p>fr._<ab>Pers.</ab>_$</p> <c>_a_court_or_council-house</c> <ls>L.</ls> <p>
<ab>cf.</ab>~<s>darBawa</s></p>. </body><tail><mul/> <MW>059213</MW> <pc>476,2</pc>
<L>91800</L></tail></H1>
27 dArvawa NEW: edit
<H1><h><hc3>110</hc3><key1>dArvawa</key1><hc1>1</hc1><key2>dArvawa</key2></h><body>
<lex>n.</lex> <p>fr._<ab>Pers.</ab>_دربار</p> <c>_a_court_or_council-house</c> <ls>L.</ls> <p>
<ab>cf.</ab>~<s>darBawa</s></p>. </body><tail><mul/> <MW>059213</MW> <pc>476,2</pc>
<L>91800</L></tail></H1>
28 nakta OLD: don't edit
<H1><h><hc3>003</hc3><key1>nakta</key1><hc1>1</hc1><key2>nakta</key2><hom>2</hom>
</h><body><OR group="102809,nakta;102809.1,nakla"/> <c>or</c> <s>nakla</s> <p>?</p>
<lex>n.</lex> <p>in_<ab>astron.</ab></p>_N._of_the_fifth_<as0>Yoga</as0><as1><s>yoga</s>
</as1>_<p>=_$</p> <c>.</c> </body><tail><mul/> <MW>066383</MW> <pc>524,1</pc>
<L>102809</L></tail></H1>
28 nakta NEW: edit
<H1><h><hc3>003</hc3><key1>nakta</key1><hc1>1</hc1><key2>nakta</key2><hom>2</hom>
</h><body><OR group="102809,nakta;102809.1,nakla"/> <c>or</c> <s>nakla</s> <p>?</p>
<lex>n.</lex> <p>in_<ab>astron.</ab></p>_N._of_the_fifth_<as0>Yoga</as0><as1><s>yoga</s>
</as1>_<p>=_نقل</p> <c>.</c> </body><tail><mul/> <MW>066383</MW> <pc>524,1</pc>
<L>102809</L></tail></H1>
29 nakla OLD: don't edit
<H1><h><hc3>003</hc3><key1>nakla</key1><hc1>1</hc1><key2>nakla</key2><hom>2</hom></h>
<body><OR group="102809,nakta;102809.1,nakla"/> <c>or</c> <s>nakta</s> <p>?</p> <lex>n.</lex>
<p>in_<ab>astron.</ab></p>_N._of_the_fifth_<as0>Yoga</as0><as1><s>yoga</s></as1>_
<p>=_$</p> <c>.</c> </body><tail><mul/> <MW>066383</MW> <pc>524,1</pc> <L>102809.1</L>
</tail></H1>
SAME IMAGE AS FOR 28 nakta.
29 nakla NEW: edit
<H1><h><hc3>003</hc3><key1>nakla</key1><hc1>1</hc1><key2>nakla</key2><hom>2</hom>
</h><body><OR group="102809,nakta;102809.1,nakla"/> <c>or</c> <s>nakta</s> <p>?</p>
<lex>n.</lex> <p>in_<ab>astron.</ab></p>_N._of_the_fifth_<as0>Yoga</as0><as1><s>yoga</s>
</as1>_<p>= نقل</p> <c>.</c> </body><tail><mul/> <MW>066383</MW> <pc>524,1</pc>
<L>102809.1</L></tail></H1>
30 niHSARa OLD: don't edit
<H1><h><hc3>110</hc3><key1>niHSARa</key1><hc1>1</hc1><key2>niH-SARa</key2></h>
<body> <lex>m.</lex> <c>or</c> <lex type="hw">n.</lex> <c>march_,_procession</c> <ls>Sa1h.</ls>
<p><ab>Pers.</ab>_$_?</p> <c>.</c> </body><tail><mul/> <MW>069398</MW> <pc>544,1</pc>
<L>107746</L></tail></H1>
30 niHSARa NEW: edit
<H1><h><hc3>110</hc3><key1>niHSARa</key1><hc1>1</hc1><key2>niH-SARa</key2></h>
<body> <lex>m.</lex> <c>or</c> <lex type="hw">n.</lex> <c>march_,_procession</c> <ls>Sa1h.</ls>
<p><ab>Pers.</ab>_نشان_?</p> <c>.</c> </body><tail><mul/> <MW>069398</MW> <pc>544,1</pc>
<L>107746</L></tail></H1>
31 pAtasAka OLD: don't edit
<H1><h><hc3>110</hc3><key1>pAtasAka</key1><hc1>1</hc1><key2>pAtasAka</key2></h><body>
<lex>m.</lex> <c>=_$_,_a_king</c> <ls>Cat.</ls> </body><tail><mul/> <MW>078101</MW>
<pc>617,1</pc> <L>121814</L></tail></H1>
31 pAtasAka NOTE to ejf: Headword correction pAtasAka -> pAtasAha
NEW: edit
<H1><h><hc3>110</hc3><key1>pAtasAka</key1><hc1>1</hc1><key2>pAtasAka</key2></h>
<body> <lex>m.</lex> <c>=_پادشاه_,_a_king</c> <ls>Cat.</ls> </body><tail><mul/> <MW>078101</MW>
<pc>617,1</pc> <L>121814</L></tail></H1>
32 pIlu OLD: don't edit
<H2A><h><hc3>100</hc3><key1>pIlu</key1><hc1>2</hc1><key2>pIlu</key2></h><body>
<lex type="inh">m.</lex> <c>an_elephant</c> <p><cf/>~<c>Aribic_$_,_Persian_$</c></p>
<ls>L.</ls> </body><tail><pc>630,2</pc> <L>124989</L></tail></H2A>
32 pIlu NEW: edit
<H2A><h><hc3>100</hc3><key1>pIlu</key1><hc1>2</hc1><key2>pIlu</key2></h><body>
<lex type="inh">m.</lex> <c>an_elephant</c> <p><cf/>~<c>Aribic_فيل_,_Persian_پيل</c></p>
<ls>L.</ls> </body><tail><pc>630,2</pc> <L>124989</L></tail></H2A>
33 puruzapura OLD: don't edit
<H3><h><hc3>110</hc3><key1>puruzapura</key1><hc1>3</hc1><key2>pu/ruza--pura</key2></h>
<body> <lex>n.</lex> <c>N._of_the_capital_of_<as0>Ga1ndha1ra</as0><as1><s>gAnDAra</s>
</as1>_,_the_modern_<as0 type="ns">Pesha1war</as0><as1>Peshawar</as1></c>
<p>$</p> <ls>L.</ls> </body><tail><MW>081125</MW> <pc>637,2</pc> <L>126512</L></tail>
</H3>
33 puruzapura NEW: edit
<H3><h><hc3>110</hc3><key1>puruzapura</key1><hc1>3</hc1><key2>pu/ruza--pura</key2></h>
<body> <lex>n.</lex> <c>N._of_the_capital_of_<as0>Ga1ndha1ra</as0><as1><s>gAnDAra</s>
</as1>_,_the_modern_<as0 type="ns">Pesha1war</as0><as1>Peshawar</as1></c>
<p>پيشاور</p> <ls>L.</ls> </body><tail><MW>081125</MW> <pc>637,2</pc> <L>126512</L></tail>
</H3>
34 peraja OLD: don't edit
<H1><h><hc3>003</hc3><key1>peraja</key1><hc1>1</hc1><key2>peraja</key2></h><body>
<OR group="128969,peraja;128969.1,peroja"/> <c>or</c> <s>peroja</s> <lex>n.</lex>
<c>a_turquoise</c> <ls>L.</ls> <p><ab>cf.</ab>~<c><ab>Pers.</ab>_$</c></p> <c>.</c> </body>
<tail><pc>Page648,3</pc> <mul/> <MW>082664</MW> <L>128969</L></tail></H1>
34 peraja NEW: edit
<H1><h><hc3>003</hc3><key1>peraja</key1><hc1>1</hc1><key2>peraja</key2></h><body>
<OR group="128969,peraja;128969.1,peroja"/> <c>or</c> <s>peroja</s> <lex>n.</lex>
<c>a_turquoise</c> <ls>L.</ls> <p><ab>cf.</ab>~<c><ab>Pers.</ab>_فيروزه</c></p> <c>.</c> </body>
<tail><pc>Page648,3</pc> <mul/> <MW>082664</MW> <L>128969</L></tail></H1>
35 peroja OLD: don't edit
<H1><h><hc3>003</hc3><key1>peroja</key1><hc1>1</hc1><key2>peroja</key2></h><body>
<OR group="128969,peraja;128969.1,peroja"/> <c>or</c> <s>peraja</s> <lex>n.</lex>
<c>a_turquoise</c> <ls>L.</ls> <p><ab>cf.</ab>~<c><ab>Pers.</ab>_$</c></p> <c>.</c> </body>
<tail><pc>Page648,3</pc> <mul/> <MW>082664</MW> <L>128969.1</L></tail></H1>
[THE IMAGE IS SAME AS for 34 peraja]
35 peroja NEW: edit
<H1><h><hc3>003</hc3><key1>peroja</key1><hc1>1</hc1><key2>peroja</key2></h><body>
<OR group="128969,peraja;128969.1,peroja"/> <c>or</c> <s>peraja</s> <lex>n.</lex>
<c>a_turquoise</c> <ls>L.</ls> <p><ab>cf.</ab>~<c><ab>Pers.</ab>_فيروزه</c></p> <c>.</c> </body>
<tail><pc>Page648,3</pc> <mul/> <MW>082664</MW> <L>128969.1</L></tail></H1>
36 pradIpasAha OLD: don't edit
<H4><h><hc3>100</hc3><key1>pradIpasAha</key1><hc1>3</hc1><key2>pra-<sr/>dIpa---
sAha</key2></h><body> <lex>m.</lex> <c>N._of_a_prince</c> <ls>Cat.</ls> <p><s>sAha</s>~
<c>=_$</c></p> </body><tail><pc>680,1</pc> <L>134738</L></tail></H4>
36 pradIpasAha NEW: edit
<H4><h><hc3>100</hc3><key1>pradIpasAha</key1><hc1>3</hc1><key2>pra-<sr/>dIpa---
sAha</key2></h><body> <lex>m.</lex> <c>N._of_a_prince</c> <ls>Cat.</ls> <p><s>sAha</s>~
<c>=_شاه</c></p> </body><tail><pc>680,1</pc> <L>134738</L></tail></H4>
37 priyadarSana OLD: don't edit
<H3B><h><hc3>110</hc3><key1>priyadarSana</key1><hc1>3</hc1><key2>priya/--
darSana</key2></h><body> <lex type="inh">m.</lex>
<c>a_plant_growing_in_wet_weather_on_trees_and_stones_<p><c1>in_<as0
type="ns">Mara1t2hi1</as0><as1>Marathi</as1>_called</c1>~<s>dagaqaPUla</s>~,~<c1>in_<as0
type="ns">Hindu1sta1ni1</as0><as1>Hindustani</as1>_$</c1></p></c> <ls>L.</ls> </body><tail>
<mat/> <pc>710,2</pc> <L>140522</L></tail></H3B>
37 priyadarSana NEW: edit
<H3B><h><hc3>110</hc3><key1>priyadarSana</key1><hc1>3</hc1><key2>priya/--
darSana</key2></h><body> <lex type="inh">m.</lex>
<c>a_plant_growing_in_wet_weather_on_trees_and_stones_<p><c1>in_<as0
type="ns">Mara1t2hi1</as0><as1>Marathi</as1>_called</c1>~<s>dagaqaPUla</s>~,~<c1>in_<as0
type="ns">Hindu1sta1ni1</as0><as1>Hindustani</as1>_پتهركي پهول</c1></p></c> <ls>L.</ls> </body><tail>
<mat/> <pc>710,2</pc> <L>140522</L></tail></H3B>
38 premasAhi OLD: don't edit
<H3><h><hc3>112</hc3><key1>premasAhi</key1><hc1>3</hc1><key2>prema--sAhi</key2></h>
<body> <p><s>sAhi</s>~<c>=_$</c></p> <lex>m.</lex> <eq/> <s>-nArAyaRa</s> <ls>Inscr.</ls>
</body><tail><MW>089503</MW> <pc>711,2</pc> <L>140817</L></tail></H3>
38 premasAhi NEW: edit
<H3><h><hc3>112</hc3><key1>premasAhi</key1><hc1>3</hc1><key2>prema--sAhi</key2></h>
<body> <p><s>sAhi</s>~<c>=_شاه</c></p> <lex>m.</lex> <eq/> <s>-nArAyaRa</s> <ls>Inscr.</ls>
</body><tail><MW>089503</MW> <pc>711,2</pc> <L>140817</L></tail></H3>
39 PatihaBUpati OLD: don't edit
<H1><h><hc3>003</hc3><key1>PatihaBUpati</key1><hc1>1</hc1><key2>Patiha-BUpati</key2>
</h><body><OR group="141699,PatihaBUpati;141699.1,PatihaSAha"/> <c>and</c>
<s>PatihaSAha</s> <lex>m.</lex> <c>N._of_a_king_of_<as0>Kas3mi1ra</as0><as1>
<s>kaSmIra</s></as1></c> <ls>Cat.</ls> <p>=_$</p> <c>.</c> </body><tail><mul/>
<MW>090086</MW> <pc>716,2</pc> <L>141699</L></tail></H1>
39 PatihaBUpati NEW: edit
<H1><h><hc3>003</hc3><key1>PatihaBUpati</key1><hc1>1</hc1><key2>Patiha-BUpati</key2>
</h><body><OR group="141699,PatihaBUpati;141699.1,PatihaSAha"/> <c>and</c>
<s>PatihaSAha</s> <lex>m.</lex> <c>N._of_a_king_of_<as0>Kas3mi1ra</as0><as1>
<s>kaSmIra</s></as1></c> <ls>Cat.</ls> <p>=_فتح شاه</p> <c>.</c> </body><tail><mul/>
<MW>090086</MW> <pc>716,2</pc> <L>141699</L></tail></H1>
40 PatihaSAha OLD: don't edit
<H1><h><hc3>003</hc3><key1>PatihaSAha</key1><hc1>1</hc1><key2>Patiha-SAha</key2></h>
<body><OR group="141699,PatihaBUpati;141699.1,PatihaSAha"/> <c>and</c> <s>PatihaBUpati</s>
<lex>m.</lex> <c>N._of_a_king_of_<as0>Kas3mi1ra</as0><as1><s>kaSmIra</s></as1></c>
<ls>Cat.</ls> <p>=_$</p> <c>.</c> </body><tail><mul/> <MW>090086</MW> <pc>716,2</pc>
<L>141699.1</L></tail></H1>
[SAME IMAGE AS 38 PatihaBUpati]
40 PatihaSAha NEW: edit
<H1><h><hc3>003</hc3><key1>PatihaSAha</key1><hc1>1</hc1><key2>Patiha-SAha</key2></h>
<body><OR group="141699,PatihaBUpati;141699.1,PatihaSAha"/> <c>and</c> <s>PatihaBUpati</s>
<lex>m.</lex> <c>N._of_a_king_of_<as0>Kas3mi1ra</as0><as1><s>kaSmIra</s></as1></c>
<ls>Cat.</ls> <p>=_فتح شاه</p> <c>.</c> </body><tail><mul/> <MW>090086</MW> <pc>716,2</pc>
<L>141699.1</L></tail></H1>
41 PARita OLD: don't edit
<H2><h><hc3>110</hc3><key1>PARita</key1><hc1>2</hc1><key2>PARita</key2></h><body>
<lex>m.</lex> <p><ls>Ni1lak.</ls></p> <lex type="hw">n.</lex> <p><c>fr._<ab>Caus.</ab>_of</c>~
<root/>~<s>PaR</s>~;~<ab>cf.</ab>~<ls>Pa1n2._7-2_,_18_<ab>Sch.</ab></ls></p>
<c>the_inspissated_juice_of_the_sugar_cane_and_other_plants</c> <ls>A1past.</ls> <ls>MBh.</ls>
<ls>Hariv._</ls> <b><c><ab>cf.</ab>_
<ab>Arab.</ab>$ </c>~~;~~<c><ab>Pers.</ab> $ ;
_medieval_<ab>Lat.</ab></c>~<etym>penidium</etym>.</b> </body><tail><MW>090404</MW>
<pc>718,2</pc> <L>142157</L></tail></H2>
41 PARita NEW: edit
<H2><h><hc3>110</hc3><key1>PARita</key1><hc1>2</hc1><key2>PARita</key2></h><body>
<lex>m.</lex> <p><ls>Ni1lak.</ls></p> <lex type="hw">n.</lex> <p><c>fr._<ab>Caus.</ab>_of</c>~
<root/>~<s>PaR</s>~;~<ab>cf.</ab>~<ls>Pa1n2._7-2_,_18_<ab>Sch.</ab></ls></p>
<c>the_inspissated_juice_of_the_sugar_cane_and_other_plants</c> <ls>A1past.</ls> <ls>MBh.</ls>
<ls>Hariv._</ls> <b><c><ab>cf.</ab>_
<ab>Arab.</ab>فانيذ </c>~~;~~<c><ab>Pers.</ab> پانيد;
_medieval_<ab>Lat.</ab></c>~<etym>penidium</etym>.</b> </body><tail><MW>090404</MW>
<pc>718,2</pc> <L>142157</L></tail></H2>
42 bandI OLD: don't edit
<H2><h><hc3>110</hc3><key1>bandI</key1><hc1>2</hc1><key2>bandI</key2></h><body>
<lex>f.</lex> <p><ab>cf.</ab>~<c><ab>Pers.</ab>_$</c></p> <c>_a_male_or_female_prisoner</c>
<ls>Ka1lid.</ls> <ls>Bhat2t2.</ls> </body><tail><pc>720,1</pc> <L>142504</L></tail></H2>
42 bandI NEW: edit
<H2><h><hc3>110</hc3><key1>bandI</key1><hc1>2</hc1><key2>bandI</key2></h><body>
<lex>f.</lex> <p><ab>cf.</ab>~<c><ab>Pers.</ab>_بنده</c></p> <c>_a_male_or_female_prisoner</c>
<ls>Ka1lid.</ls> <ls>Bhat2t2.</ls> </body><tail><pc>720,1</pc> <L>142504</L></tail></H2>
This is a bit over half of the words. I'll put the rest in a second issue.
@funderburkjim This system works great. I have finished all the transcriptions in this issue.
Does not it take too much of your time, Jim?
@jlreeder It looks like the two arabic words are in the wrong order. in # 39 and #40
@funderburkjim Astute observation. Let me explain my decision. What you're noticing is a reflection of the way in which the original publisher chose to work with line breaks and Arabic text, and not the way in which the term was meant to be read. The term is definitely فتح شاه (fatah shah - reflects the Sanskrit order). This means that the word that should come to the left (which we right-to-left readers call "first") is شاه, while the word that should come to the right ("second") is the one to the right, فتح. In most of the other cases of the dictionary, that ordering of multi-word terms like this one doesn't matter, since they're just treated as one block of text. But when those terms fall over a line break, as in this instance, it gets complicated. The original publisher recognized that the term consisted of two words, and broke them up over two lines. But in dividing the words they treated it as left-to-right (LTR) writing system, like Latin/Devanagari, instead of a RTL, as it really is. In other words, they put the leftmost word, شاه, first, even though really it should come second. The critical point here is that the printer still intended شاه to be on the left and فتح to be on the right, which is correct. In other words, if the entry had been over one line, this difference would not have mattered, and the term would have been printed فتح شاه, as it should. So I think it is not only being faithful to the Sanskrit/Persian here (ie linguistically correct) but also being faithful to the original printing of the dictionary to transcribe it as I have, although you're right that at first it looks backward. Does that make sense?
@jlreeder That's a terrific explanation, Jason. I think it completely justifies your original coding, and explains why my observation only occurred where there was a line break in the scan. Thank you for taking the time to explain this point about RTL languages.
I'll go ahead and finish the installation of these cases tomorrow.
@funderburkjim No troubles. I'm glad it made sense. I'm standing by for future edits
Installation of changes finished.
You can check the MW display, by entering a headword, for instance PatihaBUpati.
Closing this issue.
The approach is to have one issue comment devoted to one headword in the MW dictionary in which Arabic text occurs.
The digitized version of the record for this headword appears in an XML format. Initially, the Arabic text is represented by the '$' character.
There will be an 'old' copy of the record. This one should remain untouched.
And there will be a 'new' copy of the record.
Jason Reeder, (or whoever is providing the Arabic text) should edit the 'new' copy, and replace the '$' symbol by the appropriate Arabic text.
I'll edit this comment field when we work with this a bit. For now, this is just an experiment.