pwhipp / dsnac

This is a highly extendable open online book archive service.
0 stars 0 forks source link

Default Book Reader Start Page #19

Closed oldrepository closed 9 years ago

oldrepository commented 9 years ago

Each scanned book has a scandata.xml file which includes data regarding pages, the page that is marked as the book cover, thats the page where the book should open in the book reader. Right now its going to the very first scanned image.

pwhipp commented 9 years ago

Unfortunately not all your books have scandata.xml, some use the book_identifier followed by _scandata.xml and others have a scandata.zip compressed folder (that looks like it usually includes a scandata.xml) and in at least one case the scandata is missing completely...

I dealt with that using a tolerant importer so the books under media all have scandata.xml (except where no source at all was present - not much I can do there).

You can review the actual scandata.xml files under the books under media in admin (e.g. here).

Looking at the scandata.xml, I can see pages marked as the 'bookStart' - there is nothing in the xml about the cover. Unfortunately, the bookstart seems to be set on the first scanned page or absent or, in the case of the 'punjab' book it is set for every single page.

I've created a scanned_start_page integer field associated with the book. You can edit this field as necessary.

The field defaults to 0 (the very first scan page is zero - you cannot rely on the printed page numbers).

I've run an import process that sets the field to the page I used for the cover page.

When the scans use the bookStart tag correctly, the importer should get the value.

oldrepository commented 9 years ago

great work, thanks