According to the present example embodiment in which the criteria for the similarity can be set, persons extracted from a plurality of frames can be grouped with high accuracy such that the same person belongs to the same group. In a case where the criteria are too low, a possibility that different persons will be erroneously determined as the same person increases. On the other hand, in a case where the criteria are too high, a possibility that the same person will be erroneously determined as different persons increases. According to the present example embodiment, the user can adjust the criteria for similarity to a desired state while checking the determination result. As a result, persons extracted from a plurality of frames can be grouped with high accuracy such that the same person belongs to the same group.
A data processing apparatus 1 of the present example embodiment is different from those of the first and second example embodiments in that the user can set the time window described in the first example embodiment. Other configurations are the same as those in the first and second example embodiments.