A coding tool known as inside-view motion prediction (IVMP) may operate as follows. In IVMP mode, the motion information, for example including mode information (e.g. mb_type and sub_mb_type), reference indices and motion vectors of the co-located block (e.g. a co-located macroblock) in a texture view component may be reused by the depth view component of the same view. It may be signaled whether a block uses the IVMP mode. For example, a flag may be signaled in each macroblock or macroblock partition to indicate whether it uses the IVMP mode. If the spatial resolution of the depth view component differs from that of the texture view component, the motion vectors of the depth view components may be scaled proportionally to the ratio between the spatial resolutions of the texture view component and the depth view component, when they are used as the motion vectors of the co-located block or macroblock of the texture view component.
In the case of joint coding of texture and depth for depth-enhanced video, view synthesis can be utilized in the loop of the codec, thus providing view synthesis prediction (VSP). In VSP, a prediction signal, such as a VSP reference picture, is formed using a DIBR or view synthesis algorithm, utilizing texture and depth information. For example, a synthesized picture (i.e., VSP reference picture) may be introduced in the reference picture list in a similar way as it is done with interview reference pictures and inter-view only reference pictures. Alternatively or in addition, a specific VSP prediction mode for certain prediction blocks may be determined by the encoder, indicated in the bitstream by the encoder, and used as concluded from the bitstream by the decoder.