wibarab / featuredb

WIBARAB is a project in the field of Arabic dialectology. It consists of various regional sub-projects (four PhD projects) and a large database about bedouin-type dialects of Arabic. The Feature Database will be the main point of integrating the results of the sub-projects. In this repository we collect the primary data of the database in TEI/XML.
Other
1 stars 0 forks source link
acdh-ch arabic-dialects linguistics

WIBARAB feature database

About WIBARAB

WIBARAB is a very nice project in the field of Arabic dialectology. It consists of various regional sub-projects (four PhD projects) and a large database about bedouin-type dialects of Arabic.

The Feature Database will be the main point of integrating the results of the sub-projects. In this repository we collect the primary data of the database in TEI/XML.

Principal Investigator: Stephan Procházka (University of Vienna)
National Cooperation Partner: Charly Mörth (Austrian Academy of Sciences)

See https://wibarab.acdh.oeaw.ac.at/ for more information

Contact us at wibarab@oeaw.ac.at or follow us on Twitter.

Status of the data

THIS IS PRELIMINARY DATA AND COPYRIGHTED MATERIAL!

If you want to use any material in this repository please contact us at wibarab@oeaw.ac.at

This will change at the end of the project.

Directory Structure

Directory Content Remarks
001_src Original sources Any external source data coming to the project
082_scripts_xsl XSLT scripts various XSLT scripts to convert the data scripts
102_derived_TEI TEI-XML documents TEI documents derived from a automatized conversion process (from 001_src or elsewhere)
010_manannot manually annotated TEI-XML documents TEI documents which are manually annotated / curated / edited. Automated processed are not expected to write into this directory. We want to make sure that a human curator has validated the data in this directory and that nothing manually curated is overwritten by some script.
802_tei_odd TEI customization (ODD) This is the source of truth for the WIBARAB FeatureDB Schema and the HTML documentation generated from it.
804_xsd XML Schemas These are derived from the ODD in 802_tei_odd. Each version of the schema should bear its number in the file name.
850_docs Documentation Further data documentation, encoding guidelines etc.

Schema Development

At this point, the model of the WIBARAB Feature Database schema is still evolving to a certain extent while new data is being curated, existing data being curated etc. In order to make sure that transitioning from one version of the schema to the next happens in a structured manner, we set up the following rules:

Schema release workflow

When a new version of the schema is to be released:

About this file

This README file has a long-wound and dark history of editing. If you dare, you can check it out here.