According to embodiments, the video coded attribute bitstream, the video coded geometry bitstream, the video coded occupancy bitstream, and the patch sequence bitstream may be carried in the mdat box.
The structure of each file is described in detail below.
The V-PCC bitstream may be composed of a set of V-PCC units (See
Depending on the type of a V-PCC unit, the payload may include 2D video encoded information (for geometry, attribute, and occupancy map components of the encoded point cloud), non-video encoded information (for patch sequence data), or configuration and metadata information (for sequence parameter sets).
The general layout of the ISOBMFF V-PCC container may be configured as shown in the figure. The main design principle is to map V-PCC units in the V-PCC bitstream to individual tracks in the container file based on the type.
Based on this layout, the V-PCC ISOBMFF container may include the followings.
There may be a V-PCC track containing samples for carrying payloads of sequence parameter sets and a non-video encoded information V-PCC unit (e.g., a V-PCC sequence parameter set, V-PCC patch sequence data). This track may also provide a track that references other tracks containing samples for carrying the payload of a video compressed V-PCC unit.
V-PCC tracks include one or more restricted video-based tracks. Here, samples may include NAL units for video-coded elementary streams for geometry, attributes, and occupancy map data.