cdli-gh / Framework

CDLI General issues & CDLI Framework Update project work packages
24 stars 15 forks source link

Database design #15

Closed epageperron closed 6 years ago

epageperron commented 7 years ago

Summary

In order to make the CDLI Platform more efficient, we need to completely redesign the database, optimizing it based on the data.

Design a fully relational model to store catalogue, textual data and new data from MTAAC.

Tasks

Other links or relevant information

This task is dependant on 2 things: 1) the FM Challenge (#16) 2) The requirements of a corpus analysis tool

After this issue is done, we will have a full functional first draft of the DB but we might still have to make later adjustments to the schema and thus also to the conversion script and the model layer of Cake.

Other notes : Don't forget to take into account revisions and retired P nos. Store CDLI-CoNLL version of ATF Bool version consistent or not Bool uncertain for period and provenience

Everything that will have a uri should have a specific field Rdb2rdf for the LOD: http://www.rdb2rdf.org/

Roadmap Data

πŸ—“ Start Date: 10-25-2017

πŸ—“ Expected Date: 02-09-2018

πŸ’ͺ Label: wp

πŸ“ˆ Progress (0-1): 0.3

See Gantt: http://cdli-dev.org/gantt/Framework/

epageperron commented 6 years ago

Bibliographic data management has been added to #79, a new issue.

Most of the work in this issue has been completed, @kbabu105 will be finishing up the conversion and scripts during the summer.