Spatial Audio RFC lacks a canonical decoding algorithm

The Spatial Audio RFC proposes a very useful standard for encoding. But the spec lacks any recommendations about decoding Spatial Audio in a consistent way (i.e. to discrete PCM channels). Without a proposed recommendation here, there is no "correct" way to play Spatial Audio on a device or convert it to channel-mapped audio. This seems like a negative quality for an RFC.

I understand that clients may experiment with their own "secret sauce" for decoding/downmixing with proprietary psychoacoustics. This doesn't replace the need for the spec to have an opinion on the canonical decoding & downmixing algorithm, though.

Decoding to mono seems unambiguous; simply amplifying the W component and dropping the remaining X, Y, Z components.

The other interesting cases for decoding are:

Decoding to 2-channel stereo
- At standard, centered angle (front-facing)
- At a specific angle value
Decoding to 5.1-channel surround
- At standard, centered angle (front-facing)
- At a specific angle value
Decoding to 7.1-channel surround
- At standard, centered angle (front-facing)
- At a specific angle value

If nothing else, the first on this list (a canonical stereo decoding/downmixing algorithm without panning) would provide tremendous value. For example, it would enable non-spatial-audio clients to decode the same Spatial Audio audio track without needing to provide a second stereo audio track.

google / spatial-media

Spatial Audio RFC lacks a canonical decoding algorithm #139