sanskrit-lexicon / COLOGNE

Development of http://www.sanskrit-lexicon.uni-koeln.de/
18 stars 3 forks source link

fri_01 #420

Closed funderburkjim closed 5 months ago

funderburkjim commented 5 months ago

We aim to get fri.txt into a form consistent with the cdsl conventions.

I noticed that in general the current fri_00.txt entries are in a 3-line form:

<L><K1>x</K1><K2>y</K2>
<K2>a</K2>
<p>LENG n = "RUSSIAN">b</LENG>
<p><LENG n = "ENGLISH">c</LENG>
<p><LENG n = "CZEСH">d</LENG>
<pc>page<p>
<LEND>

Note 1:  The 5 middle lines are actually in ONE line of fri_00.txt.  
Note 2:  When there are homonyms, (e.g. 'a', 'akza'), then there is one 
of these  middle sections for each homonym.

However, there are some exceptions to this general form: For instance, there are 1789 lines in fri_00.txt that do not start with the '<' character. e.g.

575@ambu@ambu n.@вода@water@31@voda

@artanat I suggest you rewrite these '@' lines in the general form.

Call the resulting version fri_01.txt.

As a point of organization, I suggest:

Let's see how this goes.

Also, do you notice any other variation from the 'general' form?

funderburkjim commented 5 months ago

Oops! I put this issue in the wrong place!

funderburkjim commented 5 months ago

Copied this issue at https://github.com/sanskrit-lexicon/Frish/issues/2

drdhaval2785 commented 5 months ago

Copying is easier, but it is also possible to transfer an issue across repositories. https://docs.github.com/en/issues/tracking-your-work-with-issues/transferring-an-issue-to-another-repository# is an easy step.