Closed bduga closed 5 years ago
Possible use-case (?): free audio book sample/preview, first two "chapters" only. The 'reading order' is composed of three items:
1) first chapter (link to audio resource ch1.mp3
)
2) second chapter (link to audio resource ch2.mp3
)
3) "buy me" page (link to HTML resource gimme-more.html
)
I expect that the reading systems / specialised user agents which will implement support for the "audio profile" of web publications (packaged or not) will offer audio-centric user experiences (typical playback-oriented user interface). I suspect that some of these audiobook "players" may in fact not be capable of rendering web pages at all, either by design (e.g. mobile app with no built-in webview) or due to technical limitations (hardware device with physical buttons, basic visual display, or no screen at all).
A mobile app should be able to launch an external web "activity" (e.g. system / installed web browser) in order to present the "buy me" page to the user, assuming the gimme-more.html
document and its associated resources are available online via a public URL. However, any authentication + credentials context would be lost when transitioning from the app to a standalone web browser, so I think that this design pattern would be unrealistic in the real world. Plus, this would not work at all in the "packaged web publication" case.
So why not just state: "user agents / reading systems that do not support non-audio resources listed in the reading order should ignore/skip these resources" ?
I think you're making several very good points @danielweck but this is a very complex issue:
encodingFormat
, they won't necessarily know in advance the media type of a resourceBased on all of these points, I think we should carefully consider an audiobook profile where:
resources
and links
Such a profile would be immediately (as in not in 2020+) useful for the industry (for the supply chain as well as end users), since there's a complete lack of standard format for audiobooks.
Hadrien’s proposal seems really good. A UA which is able to switch between Audio resources and html resources will not stop when processing a « Book » publication, an audio-only UA will, as it will filter non « audiobook » publications.
This issue was discussed in a meeting.
The following is IMO a proper way to solve this issue: The spec states that an Audiobook publication SHOULD contain audio resources only in its reading order. And it is not expected from UAs supporting this profile that other types of resources will be played/read (be it in the reading order, resources or links). A non-normative note adds that other types of ressources MAY be present in an Audiobook so that advanced UAs may make use of them; and that in such case it is recommended to insert them in "resources".
The SHOULD means that the schema will not constrain the resource type for an Audiobook. We don't constrain the behavior of the UA but we state expectations (non-expectation in this case).
One of the most important thing is the ability to detect when the context should switch to "audio-only" or "mostly-audio".
With our current specification, we can't look at the readingOrder
and know what to expect since it won't necessarily contain any information concerning the media type of our resources (something that I still find problematic).
We could of course fetch every single resource to figure out what's returned, but this is not the most effective way of dealing with this issue.
Inspecting the value of type
is therefore a good fallback and it seems reasonable to expect audio in the reading order when it is set to Audiobook
.
I don't know if we'll end up with a specific profile or just a best practice document (that's up to the Audiobook TF to decide cc @wareid), but this should be within the scope of such documents.
We could of course fetch every single resource to figure out what's returned, but this is not the most effective way of dealing with this issue.
I would be fine to restrict the current approach described in the current draft so that we would have a series of common suffixes that the processor would recognize to set the media type (after all, these suffixes are part of the media type registration) and the processor would remove all the other ones. I believe that if we do recognize html, svg, jpeg, png, gif, mp3 and maybe a few more common datatypes, we would keep the goal of simplicity for the usual cases and would make the life of implementations easier.
Note, however, that in general, and unfortunately, the only secure way is to fetch the resource and see what is returned, because even if PublicationLink
objects are used instead of simply URL-s, the media type may be missing or simply wrong. The Web is messy...
I would be fine to restrict the current approach described in the current draft so that we would have a series of common suffixes that the processor would recognize to set the media type (after all, these suffixes are part of the media type registration) and the processor would remove all the other ones
I'm not a super big fan of that approach either:
This issue was discussed in a meeting.
This issue was discussed in a meeting.
PROPOSAL: Leave mixed media content out of the readingOrder
and specify that supplemental content live in resources
. This will allow it to be referenced in the TOC, but if a user agent cannot support it, it can be skipped.
@wareid: is it a SHOULD or MUST? I mean is it
If the publication is of type
audiobook
, then the reading order MUST NOT include any content other than audio.
or is it SHOULD NOT? The behaviour of the user agent is very different.
The Publishing Working Group just discussed https://github.com/w3c/wpub/issues/322
.
Resolution: Leave mixed media content out of the readingOrder and specify that supplemental content live in resources. This will allow it to be referenced in the TOC, but if a user agent cannot support it, it can be skipped.
There will be issues opened to address the TOC and rel questions.
This issue was discussed in a meeting.
RESOLVED: Leave mixed media content out of the readingOrder and specify that supplemental content live in resources. This will allow it to be referenced in the TOC, but if a user agent cannot support it, it can be skipped.
During a discussion of issue #320 the question of non-audio content (for example, HTML) in the linear reading order of an audiobook came up. Specifically, is that allowed, and if it is what is the expected behavior? Potential answers are "yes" and "we are specifying a format not behaviors". Discuss!