SPEC : Format of viewable/editable HTML file

Whatever the starting point, each input document (OCR of image , newdocument , html, .docx etc.) has a corresponding html file that can be consulted in UpBrowser or edited in UpEdit
This file is "raw" html, i.e. there are no upScriber specific elements, e.g. the various span's that allow the phrase by phrase navigation
The file is the equivalent of the contents of the element of a webpage.
There are no calls to external scripts or css files
The file is stored on the S3 bucket
All images in the html file is stored in base64 format , i.e. no external image files.

carbontracking / upscribers