sanskrit-lexicon / COLOGNE

Development of http://www.sanskrit-lexicon.uni-koeln.de/
18 stars 3 forks source link

source for Scans of mahabharata, etc #383

Open funderburkjim opened 2 years ago

funderburkjim commented 2 years ago

Sources for some of the frequently mentioned editions of Sanskrit works have been identified here.

The link shows the title pages and asserts that the works have been digitized by Google.

These have the potential to be developed into link targets for references to

This note included here so the references may be more findable later when work is done.

A first step, in developing a link target, would be to separate the pdfs into separate one-page pdf files, with 'useful' file names (i.e. file names corresponding to the page citations in dictionaries referring to these editions).

Andhrabharati commented 2 years ago

@funderburkjim

The context makes me point to another post of mine- https://github.com/sanskrit-lexicon/COLOGNE/issues/371#issuecomment-971742611

gasyoun commented 2 years ago

would be to separate the pdfs into separate one-page pdf files, with 'useful' file name

Is there a real need for that split? There are thousands of pages there. To be handled manually? What kind of automation can be thought of @Andhrabharati ?

funderburkjim commented 2 years ago

The separation into individual page pdfs can be done with Adobe Acrobat, and the renaming of the generated single-page pdfs can be done by a Python script. So, by this estimation, relatively little 'manual' work is required.

gasyoun commented 2 years ago

The separation into individual page pdfs can be done with Adobe Acrobat,

This is a 20 min task.

renaming of the generated single-page pdfs can be done by a Python script

No idea how you the woodoo.