Clayton-TV / claytontv

2 stars 1 forks source link

Create and populate SQL database with current data #8

Closed Fehings closed 2 weeks ago

Fehings commented 1 month ago

Assuming we are going to wait for now to request the full data from the current claytonTV providers, we need to use the master spreadsheet to populate the database with as close to the real data as possible, so we have some data to use for testing.

Requirements:

Ettie-ClaytonTV commented 1 month ago

What data cleaning will need doing?

Missive00 commented 1 month ago

What fields will need adding to the SQL database? Currently I understand that videos have a title + three tags (book of the bible, speaker, ministry area). Do we want to expand the number of tags to allow videos to be searched in more detail?

Fehings commented 2 weeks ago

NB we talked about using dummy data for this rather than faffing on extracting all the real data (as it is missing values compared to the actual clayton TV database which we will request at some point). Will change this issue accordingly.

Fehings commented 2 weeks ago

As for data cleaning, that will depend on the format we get the data in from the company behind the current site, hard to say now but I would guess it will involve reformatting into a set of csv files corresponding to tables in our database for easy import, and potentially some checks for missing data and coming up with protocols for how to deal with that.