Open funderburkjim opened 2 years ago
@funderburkjim
The context makes me point to another post of mine- https://github.com/sanskrit-lexicon/COLOGNE/issues/371#issuecomment-971742611
would be to separate the pdfs into separate one-page pdf files, with 'useful' file name
Is there a real need for that split? There are thousands of pages there. To be handled manually? What kind of automation can be thought of @Andhrabharati ?
The separation into individual page pdfs can be done with Adobe Acrobat, and the renaming of the generated single-page pdfs can be done by a Python script. So, by this estimation, relatively little 'manual' work is required.
The separation into individual page pdfs can be done with Adobe Acrobat,
This is a 20 min task.
renaming of the generated single-page pdfs can be done by a Python script
No idea how you the woodoo.
Sources for some of the frequently mentioned editions of Sanskrit works have been identified here.
The link shows the title pages and asserts that the works have been digitized by Google.
These have the potential to be developed into link targets for references to
This note included here so the references may be more findable later when work is done.
A first step, in developing a link target, would be to separate the pdfs into separate one-page pdf files, with 'useful' file names (i.e. file names corresponding to the page citations in dictionaries referring to these editions).