The input receiving unit 30 receives an input for setting the time window from the user. For example, the input receiving unit 30 may receive a user input for setting the time width of the time window (for example, 30 seconds, one minute, 30 minutes, one hour, or one day).
In addition, the input receiving unit 30 may receive a user input for individually setting the start position and the end position of each of the plurality of time windows.
The person extraction unit 10 analyzes the moving image data to be analyzed in units of the set time window. Then, the person extraction unit 10 determines whether or not each person detected in the moving image data to be analyzed appears in each of the plurality of time windows, and calculates the appearance frequency based on the determination result. Other configurations of the person extraction unit 10 are the same as those in the first and second example embodiments.
According to the present example embodiment described above, the same advantageous effect as in the first and second example embodiments can be achieved.
According to the present example embodiment in which the user can set the time window, the user can obtain a desired output result by setting the time window to a desired state.