In the examples shown in FIG. 6C and 6D, multiple assets are encoded (e.g., by encoder 530) in a single data stream. This encoding can comprise interlacing the multiple assets; concatenating the multiple assets; and/or employing any other suitable technique. In some examples, encoding multiple video assets may comprise composing a video from respective time-matched frames of two or more input video assets. For example, a first frame of the composed video can comprise video data from a first frame of a first video asset alongside video data from a first frame of a second video asset. Corresponding input frames can be scaled and positioned in the composed video, such as described further below; the composed video can be encoded (e.g., on a frame-by-frame basis) by an encoder, and the encoded data delivered to device 550 as described above. Other suitable implementations are contemplated, and specific implementations of encoding multiple assets in a single data stream can vary depending on a codec used.