Closed mialondon closed 1 year ago
Today's key topics:
Is anyone actively using the database in research for work to be substantially completed by mid-July? Demo of what's available What do we put on the roadmap for potential future development?
Griff's slides for today https://hackmd.io/@griff-rees/H1VN7S7r2
Access to the server requires some specific IP/SSH work.
User accounts don't have admin access by default.
Currently offers access to newspapers (metadata), not census, (newspapers) fulltext, Mitchells or gazetteer data.
'Data provider' values need to be updated for accuracy
Demo: Accessible (via auth and routing) from anywhere. Two views: django (python) and admin browse. Data (203m+ items) is from last summer. Titles are truncated to <= 100 chars. The schema is available (link pending - someone help out).
Demo. Accessible (via auth and routing) from anywhere. Two views: django (python) and admin browse. Data is from last summer. The schema is available (link pending - someone help out).
Searching slack, previous links to schemas https://dbdiagram.io/d/62bb46ba69be0b672c5d2a15
Example notebooks e.g. looking for extra long titles in newspapers metadata
DECISION: if there are research needs, focus will be on the cloud hosted deployment (for lwm people access only). Local install should still be an option. Both should be well documented and deployment as automated as possible
DECISION: For environmental scan, their work is not dependent on lwm-db
INFO: linking census and Mitchell's will take additional work (we can build on previous work)
DECISION: For environmental scan, their work is not dependent on lwm-db
Not for the paper currently being completed, but the next steps for ES work should make use of the DB
DECISION: For environmental scan, their work is not dependent on lwm-db
Not for the paper currently being completed, but the next steps for ES work should make use of the DB
Is that a roadmap thing, post-July 2023?
My suggestion based on the discussion:
Things needed for LwM:
Things needed for legacy:
Things we should leave out due to lack of time:
From @mcollardanuy's comment here: is the material for the Gazetteer not already more-or-less in place?
Seconding @thobson88's list (with Daniel's caveat acknowledged), and messaging you on slack now. Let's revisit priority list and sign off on Monday. Thank you for all you've done so far!
Thanks @thobson88 for your summary! Can I add 'review and confirm column labels' so that sources e.g. FMP, BNA, etc are accurately labelled / documented?
Closing this as it fed into the lwmdb roadmap
.
Please add notes as comments below.
See also https://github.com/alan-turing-institute/Living-with-Machines/issues/3211