In one embodiment, the feature map output by the convolution layer of the recognition model can better reflect the features extracted from the corresponding input image. Therefore, the confidence level of the target image being a live facial image can be obtained by classifying on the fully connected layer according to the feature map that reflects the features, and recognition accuracy of the recognition model is ensured.