In those example embodiments that make use of such predictions, the EL encoder (116) generates, based at least in part on the second multiplexed 3D image frame (108-V) and the prediction reference image frame, multiplexed 3D image residuals or differences between the prediction reference image frame and the second multiplexed 3D image frame 108-V and stores the image residuals in the enhancement layer video signal to be carried in the EL FC video stream (112-3). Further, based on the prediction and coding process, the RPU (114) may generate coding information which can be transmitted to a decoder as metadata using an RPU stream (112-2).
FIG. 1B illustrates a multi-layer video decoder (150) that receives input video signals in which high spatial frequency content from an original video sequence (which may be the input video sequence as discussed in connection with FIG. 1A) in two orthogonal directions has been preserved in complementary image data carried in the enhancement layer and in the base layer, respectively, in accordance with an embodiment. In an example embodiment, the input video signals are received in multiple layers (or multiple bitstreams). As used herein, the term “multi-layer” or “multiple layers” may refer to two or more bitstreams that carries input video signals having one or more logical dependency relationships between one another (of the input video signals).