MPEGGroup / FileFormat

MPEG file format discussions
21 stars 0 forks source link

ISOBMFF Presentation terms #8

Open dwsinger opened 4 years ago

dwsinger commented 4 years ago

The term "presentation" is used throughout the spec (~180 times) with different meanings. This is confusing. We propose to clarify it when possible and to replace it in other cases.

When used standalone, it usually means "rendering" or "a set of related media" as in the introduction:

"The ISO Base Media File Format is designed to contain timed media information for a presentation in a flexible, extensible format that facilitates interchange, management, editing, and presentation of the media. This presentation may be ‘local’ to the system containing the presentation, or may be via a network or other stream delivery mechanism."

It is currently defined as follows:

"one or more motion sequences, possibly combined with audio"

This definition is outdated. We suggest replacing the definition with the simple:

"set of related media data "

We also suggest rephrasing the introduction which has too many 'presentation'.

There is no formal term in the definition section but the semantics of the 'tfra' box (8.8.10.3) says:

"The presentation time is the composition time of a sample, as adjusted by any edit list."

We suggest moving this text as a term definition in the definition clause.

We find also that:

"presentation times are in the movie timeline" (8.6.13.1 Segment Index Box definition)

which seems consistent with our understanding and the definition above. We suggest moving that text as a note in the definition clause. But, we find in Annex A, A.4:

" The exact presentation time (its time-stamp) of a sample is defined by summing the durations of the preceding samples."

which is confusing "time stamp" and "time" and more importantly "decoding time stamp" and "presentation time".

We suggest fixing that sentence as follows, by replacing:

"The exact presentation time (its time-stamp) of a sample is defined by summing the durations of the preceding samples."

with:

"The exact decoding time stamp of a sample is defined by summing the durations of the preceding samples."

Sometimes the term "movie presentation time" is used. We suggest removing "movie" (or always using it) as the "presentation time" is indeed in a "movie time".

The terms are "earliest presentation time" and "end presentation time" are used but don't seem ambiguous as they do consider the movie timeline (i.e. with edit list).

The different sections about RTP use the term of "presentation time stamp" with a different meaning:

This is wrong and should be fixed.

We suggest defining the term "composition order" (or "output order") as above and to use it consistently. Similarly, we suggest defining "decoding order" and using it consistently (versus "decode order").

Similarly, in 11.2, it says:

" The main file containing the metadata may use other files to contain media-data. These other files may contain header declarations from a variety of standards, including this one. If such a secondary file has a metadata declaration set in it, that metadata is not part of the overall presentation. This allows small presentation files to be aggregated into a larger overall presentation by building new metadata and referencing the media-data, rather than copying it."

But in 6.1.2, the COR mixes this term with the notion of segment:

"A presentation file logically includes all its segments."

dwsinger commented 4 years ago

We believe this should be fixed. We suggest:

In "6.1.2 Object Structure", it says:

"The sequence of objects in the file shall contain exactly one presentation metadata wrapper (the MovieBox)."

We suggest replacing it with:

"The sequence of objects in the file shall contain exactly one MovieBox."

The sentences using this term can easily be removed or the term replaced by "media information".

cconcolato commented 4 years ago

Just edited the above comments for readability.

dwsinger commented 3 years ago

This is improved in the 7th edition but more work is needed. a 'pull request' (edited file) would be appreciated.