FreeUKGen / FreePROBATE

For everything related to the FreePROBATE project
Apache License 2.0
0 stars 0 forks source link

FreePROBATE MEGA epic #1

Open PatReynolds opened 6 years ago

PatReynolds commented 6 years ago

Done (this sprint)

  1. Create tools/pathway for Digital Photograph ⇒ Machine learning ⇒ CSV code
  2. Document how to do this
  3. Test document to ensure we understand what we need to do

Further steps A. Create structure for the digital photographs of the books B. Place existing books in the structure C. Photograph remaining books

  1. Create a branch of Myopic Vicar
  2. Write ingestion script User can search on FreePROBATE by name, county and date
  3. Automate OCR, processing and ingestion [after 5, but before 8]. Add link to image Options (a) cropped portion (b) full page (c) hybrid, full page with cropped portion clear, but remaining part of page greyed
  4. Use geocoded linked open data and personal names from our other sites to produce master lists against which to check quality of data, marking records that are suspect.
  5. Create correction of OCR app (cf FreeREG #1362 - may be adaptation of this, or vice versa) Options (a) corrected data is on FreePROBATE as additional, rather than amended version, or, (b) corrected data is the only version (as is the case with FreeREG now.
Captainkirkdawson commented 5 years ago

Examples of Norfolk Probate Images 4033344_00459 4033344_00458 4033344_00461 4033344_00462 4033344_00460

PatReynolds commented 5 years ago

Hi @Captainkirkdawson adding such images would be quite different from the modern half FreePROBATE project (the subject of this issue). . Earlier wills (handwritten, rather than already transcribed, and printed) are being considered, but needing an entirely funded project to do so (preliminary bid sent to the Heritage Lottery Fund). Documentation is in the FreePROBATE folder - shared now. we do need people like yourself with familiarity to advise on the wider context.