In the embodiments of the present disclosure, the server multiplexes and encapsulates, based on viewport location information in request information of the client, a video stream related to the viewport location information, and transmits an encapsulated multiplex video stream to the client. The video stream related to the viewport location information is a video stream having video content that partially or entirely overlaps with content of a viewport range requested by the client. To be specific, the server performs preset multiplexing processing on a video stream that responds to the request, to respond to the request from the client. This reduces a quantity of requests from the client, and also reduces a quantity of responses from the server. In addition, this ensures simultaneous arrival of video stream information of fields of view of a same moment, thereby reducing a time of waiting for all video streams to be separately received, and reducing a presentation delay of the fields of view.
To describe the technical solutions in the embodiments of the present disclosure more clearly, the following briefly describes the accompanying drawings required for describing the embodiments or the prior art.