The person extraction unit 10 analyzes the moving image data to be analyzed, which includes the plurality of pieces of child data, for each frame, and detects persons from each frame. Then, the person extraction unit 10 groups the persons detected from the different frames such that those having outer appearance feature values (for example, face feature values) that are similar to each other by a predetermined level or more belong to the same group. As a result, in a case where the same person is detected from a plurality of frames, these can be grouped. As a result, it is possible to determine in which frame each person detected in the detected moving image data 100 to be analyzed appears.
Then, based on the relationship between each of the plurality of pieces of child data and the frames included in each piece of child data, the person extraction unit 10 determines whether or not each person detected in the moving image data 100 to be analyzed appears in each of the plurality of pieces of child data. A person is determined as appearing in first child data if the person appears in at least one of the plurality of frames included in the first child data.
As a result, a determination result shown in