Step S504: The server sends a target request feedback to the client in response to the target request, and the client receives the target request feedback by which the server responses to the target request, where the target request feedback includes the information about the multiplex video stream that is obtained by performing the preset multiplexing processing on the original video stream corresponding to the target spatial object.
In the prior art, for content that a user requests to obtain, a server directly returns a corresponding video stream. Therefore, there may be a large amount of redundant video stream code, especially in some VR video scenes that have some repeated scenes. For example, in VR experience scenarios in tour and sightseeing, a color of the sky or a color and texture of a river are basically consistent. Therefore, the repeated content can be multiplexed, to reduce a bandwidth and time for transmitting video streams and improve efficiency.
In a possible implementation, the multiplexing description information further includes: the spatial location information, in the VR content component, respectively corresponding to the N multiplexed sub video streams. The multiplexing description information includes specific spatial location information of a plurality of multiplexed sub video streams. Therefore, the client may finally parse and present, based on such information in the multiplexing description information, the VR video that the user needs to watch.