sanskrit-lexicon / COLOGNE

Development of http://www.sanskrit-lexicon.uni-koeln.de/
18 stars 3 forks source link

Cologne Digital Sanskrit Dictionaries Harvard Bibliography #55

Closed gasyoun closed 9 years ago

gasyoun commented 10 years ago

I guess it's the right time to add bibliography to the pages of different scans, to the web pages. Some bibliographical data is spread around the XML files, but in most cases not enough to quote. And one would not want (and know) to download a .zip archive, to find one .xml file, to get the reference out of it, right? Jim, I've collected it for you in a single place. It's from and I could even make an HTML page like:

WILSON, H. (1832). A dictionary in Sanskrit and English: Translated, amended, and enlarged from an original compilation. Calcutta, The Education Press. [WorldCat]

Not always I could locate the exact edition at https://www.worldcat.org but in most cases I did and in the Hardard-styled references the data what we actually need is there, not always identical to records from https://www.worldcat.org

Shalu411 commented 10 years ago

:) I do not know if its related, but please see here- Dsal people put it this way- apte1 apte2

funderburkjim commented 10 years ago

There is for each dictionary X a file Xheader.xml.
Here it is for Borooah (borheader.xml):

<?xml version="1.0" encoding="UTF-8"?>
<!-- %{date2) -->
<!--<?oxygen RNGSchema="file:tei_corpus.rnc" type="compact"?>-->
<?oxygen RNGSchema="http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rnc" type="compact"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0" xmlns:xi="http://www.w3.org/2001/XInclude">
  <teiHeader xml:lang="en-Latn">
    <fileDesc>
      <titleStmt>
        <title>English-Sanskrit Dictionary</title>
        <editor>Jim Funderburk</editor>
        <editor>Thomas Malten</editor>
        <principal>Thomas Malten</principal>
        <funder>DFG - Deutsche Forschungsgemeinschaft</funder>
        <!--<funder>Glasenapp Stiftung</funder>-->
        <!--<funder>ECCP, European Union</funder>-->
      </titleStmt>
      <editionStmt>
         <edition>digital edition, bor.txt <date>2014</date></edition>
        <respStmt>
          <resp>markup</resp>
          <name>Thomas Malten</name>
        </respStmt>
      </editionStmt>
      <extent>3.5 MB</extent>
      <publicationStmt>
        <publisher>University of Cologne</publisher>
        <pubPlace>Cologne, Germany</pubPlace>
        <date when="2014">2014</date>
        <availability>
          <p>All rights reserved other than those granted under the Creative Commons Attribution
            Non-Commercial Share Alike license available in full at <ptr
              target="http://creativecommons.org/licenses/by-nc-sa/3.0/legalcode"/>, and summarized
            at <ptr target="http://creativecommons.org/licenses/by-nc-sa/3.0/"/>. Permission is
            granted to build upon this work non-commercially, as long as credit is explicitly
            acknowledged exactly as described herein and derivative work is distributed under the
            same license.</p>
        </availability>
      </publicationStmt>
      <seriesStmt>
        <title level="s"></title>
        <respStmt>
          <resp></resp>
          <name></name>
        </respStmt>
        <idno></idno>
      </seriesStmt>
      <notesStmt>
        <note></note>
      </notesStmt>
      <sourceDesc>
        <biblStruct xml:id="BOR">
          <monogr>
            <author>Anundoram Borooah</author>
            <author> </author>
            <title type="main" level="m">English-Sanskrit Dictionary</title>
            <title type="subordinate" level="m"></title>
            <edition></edition>
            <imprint>
              <pubPlace>Publication Board, Assam</pubPlace>
              <publisher>TODO</publisher>
              <date>1877 , reprinted in 1971</date>
            </imprint>
            <extent>772 pages</extent>
          </monogr>
        </biblStruct>
      </sourceDesc>
    </fileDesc>
    <encodingDesc>
      <!--projectDesc (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected.-->
      <projectDesc>
        <p><quote>The Cologne Digital Sanskrit Lexicon (CDSL) project undertakes to digitize and
            merge all published Sanskrit dictionaries. Its aim is
            to provide a basic lexical corpus to provide an easy access to all available meanings of
            Sanskrit words and to allow the creation of a number of computer programs that will help
            to analyze Sanskrit texts. <bibl><author>Malten</author><date>1997</date><title>Review
                of the Cologne Digital Sanskrit Lexicon project</title></bibl></quote></p>
      </projectDesc>
      <editorialDecl>
        <p>
          <date>2014</date></p>
      </editorialDecl>
      <tagsDecl> </tagsDecl>
    </encodingDesc>
    <profileDesc>
      <creation> </creation>
      <langUsage>
        <language ident="de-Latn">English</language>
        <language ident="sa-Latn-x-AS">Sanskrit</language>
      </langUsage>
      <textClass>
        <keywords scheme="#lcsh">
          <list>
            <item>Sanskrit language -- Dictionaries -- English language</item>
          </list>
        </keywords>
        <classCode scheme="#lc"></classCode>
      </textClass>
    </profileDesc>
    <revisionDesc>
      <change>0</change>
    </revisionDesc>
  </teiHeader>
  <facsimile>
    See downloads bor_bookmark.pdf,  
   from http://www.sanskrit-lexicon.uni-koeln.de
  </facsimile>

  <text>
    <body>
      <xi:include href="bor.txt">
        <xi:fallback>
         See download bor.txt from http://www.sanskrit-lexicon.uni-koeln.de
        </xi:fallback>
      </xi:include>
    </body>
  </text>
</TEI>

This is modeled after a TEI form which Peter developed for MW.

Currently, this file is available as part of the bortxt.zip download (and similarly for other dictionaries). The file contains several types of information, including bibliographic information. The bibliographic information was inserted by me based upon the scans that appear in the title pages; these scans are available as part of the Front Matter in the documentation. For instance, for Borooah, at http://www.sanskrit-lexicon.uni-koeln.de/scans/csldoc/dictionaries/bor.html.

I think the files like borheader.xml already contain the relevant bibliographic information in a standard form (TEI based).

If displays based on these Xheader.xml files are needed, they might be based on displays of similar bibliographic data files that Peter has worked on. I think the displays on his site, http://sanskritlibrary.org/tomcat/sl/TextsList, are of this type.

gasyoun commented 10 years ago

Jim, I must say that I love the idea of TEI, but to work with it is not always such an easy way. As the reprints do not add any valuable data (like lists of corrections) - I want to quote the original source. And quote it without copypasting from an XML file. Are you sure people will look for book names, will hunt them? TEI is good for documentation and markup. It's of less use when we are on the web. "For instance, for Borooah, at http://www.sanskrit-lexicon.uni-koeln.de/scans/csldoc/dictionaries/bor.html." - I would want to see bibliographic data everywhere. On web, inside front pages /csldoc/, maybe even in download sections as a Harvard reference ready to be quotes. What you have after copy posting is not quotable. It's a LEGO, DIY.

gasyoun commented 9 years ago

Partly implemented, issue dead.