According to a tenth embodiment there is provided a video decoder configured for encoding a bitstream comprising a base layer, a first enhancement layer and a second enhancement layer, wherein said video decoder is further configured for: interpreting, from the bitstream, an indication indicating both the base layer and the first enhancement layer used for prediction for the second enhancement layer; interpreting, from the bitstream, an indication of a first set of prediction types that is applicable from the base layer to the second enhancement layer, wherein the first set of prediction types is a subset of all prediction types available for prediction between layers; interpreting, from the bitstream, an indication of a second set of prediction types that is applicable from the first enhancement layer to the second enhancement layer, wherein the second set of prediction types is a subset of all prediction types available for prediction between layers; and decoding said second enhancement layer using only said first set of prediction types from the base layer and said second set of prediction types from the first enhancement layer.