sanskrit-lexicon / hwnorm1

Headword normalization for Cologne dictionaries
0 stars 0 forks source link

sanhw1 moved #14

Open funderburkjim opened 4 years ago

funderburkjim commented 4 years ago

sanhw1.txt contains a listing of all headwords in all dictionaries with Sanskrit headwords. Example lines:

aMSa:BEN,BHS,BOP,BUR,CAE,CCS,GRA,GST,IEG,INM,MD,MW,MW72,PD,PE,PUI,PW,PWG,SCH,SHS,SKD,STC,VCP,WIL,YAT
aMSaH:AP,AP90,SKD
aMSaka:GST,MW,MW72,PD,PW,PWG,SCH,SHS,VCP,WIL,YAT

Previously, the generation of sanhw1.txt was done in an obscure place on the Cologne server (scans/awork/sanhw1/).

The relevant code has been moved to this repository, in the sanhw1 subdirectory.

This directory also recomputes hwnorm1c.txt, which was previously done also in the scans/awork/sanhw1 directory.

The regeneration step can be done locally provided this hwnorm1 repository is a 'sibling' of the locally generated dictionaries (mw, pw, etc).

gasyoun commented 4 years ago

sanhw1.txt contains a listing of all headwords in all dictionaries with Sanskrit headwords.

Only sopasarga dhatus in PWG left out?