Open drdhaval2785 opened 3 years ago
To check non-comma items, regex is cat v02/acc/acc.txt | grep -n '[^a-zA-Z,]\#\}'
output is as below
964:{#ajYAnaboDinI#}¦ or {#aDyAtmavidyopadeSaviDi#} or {#saMkziptavedA-#}
1763:{#aDikaraRanyAyamAlA,#}¦ also {#vedAntADikaraRamAlA, SArI-#}
7483:{#AtmopadeSaviDi#}¦ or {#AtmavidyopadeSa#} or {#AtmavidyopadeSa-#}
9155:{#AruRIyopanizad#}¦ or {#AruRikopanizad#} or {#AruReyopa-#}
11613:{#upadeSasAhasrI#}¦ or complete {#sakalavedopanizatsAropadeSasA-#}
14202:{#karaRakutUhala#}¦ or {#grahAgamakutUhala#} or {#brahmatulya#} or {#brahmatulyasi-#}
16267:{#kAmadeva mImAMsakadIkzita^0#}¦
51289:{#nArAyaRadeva (gajapativIranArAyaRadeva)#}¦ son of Padmanā-
55799:{#pativratAmAhAtmya#}¦ Oppert 7335. II, 469, and {#pativratopA-#}
56001:{#padArTaKaRqana#}¦ or {#padArTatattva#}¦ or {#padArTatattvanirUpaRa#} or {#padArTa-#}
58861:{#pASakakevalI#}¦ sometimes spelled {#pASAkevalI#} or {#pASakake-#}
58893:{#pAzaRqacapewikA#}¦ or {#pAzaRqamuKacapewikA#} or {#pAzARqAsyaca-#}
67301:{#brahmasUtra#}¦ or {#uttaramImAMsA#} or {#bAdArAyaRasUtra#} or {#brahmamI-#}
71524:{#BAskararAya#}¦ or {#BAskararAja dIkzita#} or {#BAsurAnanda#} or {#BA-#}
79243:{#mImAMsABAzya#}¦ or {#mImAMsAsUtraBAzya#}¦ or {#SabaraBAzya#}¦ or {#SA-#}
81038:<>{#mEtrAyaRyupanizad#} or {#mEtreyISAKopanizad#} or {#mEtreyopa-#}
81216:{#modamaYjarIguRaleSamAtrasUcakAzwaka#}¦ and {#modamaYjarIguRaleSa-#}
85052:{#ratnamaYjarIguRaleSamAtrasUcakAzwaka#}¦ and {#ratnamaYjarIguRaleSasU-#}
87298:{#rARaka#}¦ or {#nyAyasuDA#} or {#vArttikayojanA#} or {#sarvAnavadyakA-#}
91330:{#rAjAnaka rucaka (ruyyaka)#}¦ son of Rājānaka Tilaka, guru
97243:{#vAdanakzatramAlikA#}¦ also {#nakzatravAdamAlikA#}¦ and {#nakzatravA-#}
120565:{#sapaSukEkAhikacAturmAsyaprAyoga#}¦ and {#sapaSukEkAhikacAturmA-#}
130834:{#hayagrIva{??}#}¦ a poem, by Bhartṛmeṇṭha. Rājataraṅgiṇī
142382:<HI>{#kASIsTagOramuKavivAdADikAripraSnAnAM kampanIkASIpAWa-#}
142894:{#kfzRaBaktikalpavallI#}¦ called also {#BaktimaYjarI#} and {#hariBakti-#}
145587:{#cInAcArasAratantra#}¦ or {#AcArasAratantra#} or {#mahAcInakramA-#}
146389:{#jYAnaBAskara#}¦ or {#sUryAruRasaMvAda#} or {#sUryAruRIyakarmavipA-#}
147221:{#tAjikatantrasAra#}¦ or {#gaRakaBUzaRa#} or {#karmaprakASa#} or {#manuzya-#}
169457:<>{#manyurvyAGre me'ntarAmayaH .#} It is alluded to in Baudhā-
175734:{#daSakumAracarita#}¦ by Daṇḍin. Ulwar 922.--{#daSakumAra-#}
182064:<>fill pages {#1--261#}. It is to be regretted that no distinctive signs have been adopted in the description of the
182526:{#aDikaraRanyAyamAlA#}¦ or {#aDikaraRaratnamAlA#}¦ or {#vEyAsi-#}
185564:{#kAmanandABiDAnakAvya (kAmA ?)#}¦ by Dhananda Kavi.
188995:{#candrikAKaRqana-#}¦ directed against Rāmatīrtha's Candrikā,
189493:<>{#na punarjanma yogayuktasya jAyate ..#}
191795:{#dinakaroddyota (dAnadinakara)#}¦ commenced by Dinakara,
203633:<>{#tadyatayo viSanti .. 1 .. BAzyawIkAvivaraRaM tannibanDa-#}
203634:<>{#nasaMgrahaM . vyAKyAnavyAKyeyeBAvakleSahAnAya racyate .. 2 ..#}
204447:{#vedAntagranTa ?#}¦ (Nimbārka school). Bd. 708.
204746:{#vEzRavagItA (kfzRArjunasaMvAda)#}¦ Hpr. 1, 343.
206322:<>(printed {#pu^0#}). AK 710.
208369:{#sidDavinAyakapUjApadDati#}¦ dh. Bd. 319. Read {#sidDi^0#}.¦
210200:<P>{#aTAnuvAkAnvakzyAmi brahmaRA vihitAnpurA . SizyA-#}
210204:<P>{#barhiRAbarhApIqaH#} etc. || 1 || {#wIkAsarvasvaM daSawIkA-#}
210205:<>{#vitkarotyamarakoSe . SrImatsarvAnando vandyaGawIyArtihara-#}
210210:<>{#vidyate cATa jammani . utpadyate jAyate ca prarohatyudBava-#}
210213:<P>{#sanAtanaM rUpamihopadarSayannAnandasinDuM paritaH pravarta-#}
210214:<>{#yan . antastamaHstomaharaH sa rAjatAM cEtanyarUpo viDuraBu-#}
210215:<>{#todayaH .. 2 .. gopAlatApanIM nOmi yA kfzRaM svayamI- #}
210216:{#[Page3-159-b1+ 20]#}
210217:<>{#Svaram . karasTaratnasaMkASaM saMdarSayati sidDaye .. 3 .. Otka-#}
210219:<>{#gopAlopanizatsvapratipAdyaM pareSaM praRamati saditi .#} AK
210223:<>{#kurve 'maramAlAM karomyaham .. ahaMkAranAma .. darpo 'Bi-#}
210224:<>{#mAnAhaMkArasmayagarvamadAstaTA ..#} BC 436.
210227:<>{#mitavfttyarTasaMgraham . pariBAzApradIpArcistatropAyo nirU-#}
210228:<>{#pyate .. 3 ..#} AK p. 115.
210231:<>{#rUpyaM munIndrE raBimataPaladaM nirguRaM yadguRAQyam .#} IL.
210233:<P>{#aSvasenaM jinaM natvA gOtamAdigaRADipAn . caritra-#}
210235:<>{#dattaM BojanfpeRa tu . prabanDaM tasya vakzyAmi BavyAnAM boDa-#}
210236:<>{#hetave .. 2 ..#} Tod 147.
210241:<>{#mahAsOBAgyadAyakam . sADakAnAM ca pApaGnaM muktisarvEka-#}
210242:<>{#BAjanam .. 2 ..#} The Yantrakrama from the Śivālaya
210246:<>{#vaSe kurvanBagavanmArgadarSakaH .. 1 ..#} Ulwar Extr. 131.
210249:<P>{#vaSIkaraRamuccAwo mohanaM stamBanaM taTA . SAntikaM pO-#}
210250:<>{#zwikaM karma viviDAni maheSvari .. 1 ..#} IL.
210253:<>{#yukte na Saktitulyo BavennaraH .. 1 ..#} IL.
40:{#akzapAda#}¦ or {#akzacaraRa,#} a name of Gautama, the philo-
It can be easily replaced by the following
40:{#akzapAda#}¦ or {#akzacaraRa#}, a name of Gautama, the philo-
77:{#akzoByatIrTa,#}¦ formerly Govindaśāstrin, successor of Mā-
This is a bit tricky. I suggest
77:{#akzoByatIrTa#},¦ formerly Govindaśāstrin, successor of Mā-
I do not know what would be the effect on the code which extracts the headwords from the text file. There may need to be some tweaking. I am also OK with the following versions.
77:{#akzoByatIrTa#}¦, formerly Govindaśāstrin, successor of Mā-
Request others to suggest which one to convert to.
I do not know what would be the effect on the code which extracts the headwords
There is no current code that does this!
Such code was needed originally, when headwords were not known.
Now, however, the headwords are deemed to have been extracted and present in the metaline <k1>X<k2>Y
.
The broken bar was an attempt to identify the text from which headwords were derived, and should still be left, I think, for possible future analytical purposes.
Currently, there are no programmatic constraints on what goes before/after the broken bar.
Regex in csl-orig -
Output