Make MP4 Descriptor code cope with erroneous input

MP4 descriptors have there own structure with a type and a variable size field. They are typically not used much, except for conveying information about MPEG-4 audio codecs, in particular the DecoderSpecificInformation descriptor.

The specification in parts 1 and parts 3 of MPEG-4 is rather hard to read, and there exists erroneous implementation in the wild. The mp4ff implementation is also not complete.

There have been two issues reported on audio descriptors that mp4ff cannot handle:

331 is an example that lacks the SLConfigDescriptor
348 is an example where the two bytes (of three) of the SLConfigDescriptor are inside the DecoderConfigDescriptor

Since the descriptor information is typically not relevant beyond the DecoderSpecificInformation it should be possible to make the Descriptor parsing and writing code more general so that it can handle undefined field, or bad lengths and store the data in special constructs like "trailing_unknown_data".

A general approach could look something like:

Restructure the descriptor handling to be similar to the MP4 Box handling with an interface and general parsing code that parses the tag and length field and dispatches a decoder depending on the tag.
Support for unknown tags
Support for trailing unknown descriptors but also arbitrary byte data in ESDescriptor and DecoderConfigDescriptor.
It should be possible to parse (Decode) and then write (Encode) the erroneous samples in the two issues mentioned.
Add Info output of descriptors similar to Boxes (include unknown/bad data in hex format)
When generating descriptors from scratch, they should follow the specifications.

Eyevinn / mp4ff

Make MP4 Descriptor code cope with erroneous input #350

331 is an example that lacks the SLConfigDescriptor

348 is an example where the two bytes (of three) of the SLConfigDescriptor are inside the DecoderConfigDescriptor