What is claimed is:1. An information processing apparatus comprising:at least one memory configured to store one or more instructions; andat least one processor configured to execute the one or more instructions to:obtain a first image including a plurality of first faces captured by a first camera;store a plurality of second images captured by one or more second-image generation cameras installed in a facility;identify a second image, among the plurality of second images, which includes at least two second faces matching at least two first faces, among the plurality of first faces included in the first image;extract the stored second image including the at least two second faces matching the at least two first faces in the first image, the at least two second faces corresponding to at least two persons; andcause a display to the extracted second image.2. The information processing apparatus according to claim 1,wherein the processor is further configured to execute the one or more instructions to, in a case where the first image includes the plurality of faces each with a predetermined size or larger, extract the second image including the at least two persons included in the first image with the predetermined size or larger.3. The information processing apparatus according to claim 1,wherein the processor is further configured to execute the one or more instructions to obtain an image generated by a first-image generation camera which images a periphery of the display, as the first image.4. The information processing apparatus according to claim 3,wherein the processor is further configured to execute the one or more instructions to:cause the display to display an image captured by the first-image generation camera orcause a display apparatus including the display to function as a mirror while a predetermined condition is not satisfied, andcause the display to display the extracted second image when the predetermined condition is satisfied.5. The information processing apparatus according to claim 1,wherein the processor is further configured to execute the one or more instructions to:store the second image in association with an imaging date and time and an imaging position,obtain position information of a specific person included in the second image in the facility and time information corresponding to the position information, andexclude the second image from an object to be extracted as an image including the specific person in a case where an imaging date and time and an imaging position do not satisfy a predetermined condition in relation to the position information and the time information.6. The information processing apparatus according to claim 5,wherein the processor is further configured to execute the one or more instructions to obtain an imaging date and time and an imaging position of the second image including a person having a similarity with the specific person included in the first image equal to or higher than a predetermined level, as the position information and the time information for the specific person.7. The information processing apparatus according to claim 1,wherein the processor is further configured to execute the one or more instructions to:store one or a plurality of pieces of identification information obtained by a wireless communication apparatus which is installed in a periphery of each of the plurality of second-image generation cameras and obtain the identification information from one or a plurality of electronic tags or portable terminals of which a positional relationship satisfies a predetermined condition, in association with the second image generated within a predetermined time from a date and time when the identification information is obtained,obtain an image generated by a first-image generation camera which images a periphery of the display, as the first image,obtain the one or the plurality of pieces of identification information obtained by a wireless communication apparatus which is installed in a periphery of the first-image generation camera and obtain the identification information from the one or the plurality of electronic tags or portable terminals of which a positional relationship satisfies a predetermined condition, andextract the second image including a person included in the first image and being associated with the obtained identification information.8. The information processing apparatus according to claim 7,wherein the processor is further configured to execute the one or more instructions to, in a case where M persons (M is equal to or more than one) are included in the first image and N (N is equal to or more than one) pieces of the identification information are obtained, extract the second image including at least one person among the M persons and being associated with at least one of the N pieces of the identification information.9. The information processing apparatus according to claim 8,wherein the processor is further configured to execute the one or more instructions to, in a case where a plurality of persons having predetermined sizes or larger are included in the first image and N (N is equal to or more than one) pieces of the identification information are obtained, extract the second image including all of the plurality of persons having the predetermined sizes or larger included in the first image and being associated with at least one of the N pieces of the identification information.10. The information processing apparatus according to claim 1, further comprising:excluding the stored second image of which an imaging date and time and an imaging position do not satisfy a predetermined condition in relation to first information, from an object to be extracted as an image which includes the at least two or more persons included in the first image,wherein the first information is information indicating whether the at least two or more persons included in the first image was in the facility at each date and time.11. An information processing apparatus comprising:at least one memory configured to store one or more instructions; andat least one processor configured to execute the one or more instructions to:obtain a first image including a plurality of first faces captured by a first camera in association with one or a plurality of pieces of identification information obtained by a wireless communication apparatus which is installed in a periphery of the first camera and obtain the identification information from one or a plurality of electronic tags or portable terminals of which a positional relationship satisfies a predetermined condition;identify a second image, among a plurality of second images, which includes at least two second faces matching at least two first faces, among the plurality of first faces included in the first image, the plurality of second images being captured by one or more second cameras; andextract the stored second image including the at least two second faces matching the at least two first faces in the first image, the second image including at least two or more persons included in the first image and being associated with the obtained identification-information, from a storage unit which stores the second image in association with the one or the plurality of pieces of identification information obtained by a wireless communication apparatus which is installed in a periphery of each of the plurality of second cameras and obtain the identification information from the one or the plurality of electronic tags or portable terminals of which a positional relationship satisfies a predetermined condition.12. The information processing apparatus according to claim 11,wherein the processor is further configured to execute the one or more instructions to, in a case where M persons (M is equal to or more than one) are included in the first image and N (N is equal to or more than one) pieces of the identification information are obtained, extract the second image including at least one person among the M persons and being associated with at least one of the N pieces of the identification information.13. The information processing apparatus according to claim 12,wherein the processor is further configured to execute the one or more instructions to, in a case where a plurality of persons having predetermined sizes or larger are included the first image and N (N is equal to or more than one) pieces of the identification information are obtained, extract the second image including all of the plurality of persons having the predetermined sizes or larger included in the first image and being associated with at least one of the N pieces of the identification information.14. An information processing method executed by a computer the method comprising:obtaining a first image including a plurality of first faces captured by a first camera;storing a plurality of second images captured by one or more of second-image generation cameras installed in a facility,identifying a second image, among the plurality of second images, which includes at least two second faces matching at least two first faces, among the plurality of first faces included in the first image;extracting the stored second image including the at least two second faces matching the at least two first faces in the first image; andcausing a display to display the extracted second image.15. A non-transitory storage medium storing a program causing a computer to:obtain a first image including a plurality of first faces captured by a first camera;store a plurality of second images captured by one or more second-image generation cameras installed in a facility;identify a second image, among the plurality of second images, which includes at least two second faces matching at least two first faces, among the plurality of first faces included in the first image;extract the stored second image including the at least two second faces matching the at least two first faces in the first image; and cause a display to display the extracted second image.16. An information processing method executed by a computer, the method comprising:obtaining a first image including a plurality of first faces captured by a first camera in association with one or a plurality of pieces of identification information obtained by a wireless communication apparatus which is installed in a periphery of the first camera and obtains the identification information from one or a plurality of electronic tags or portable terminals of which a positional relationship satisfies a predetermined condition;identifying a second image, among a plurality of second images, which includes at least two second faces matching at least two first faces, among the plurality of first faces included in the first image, the plurality of second images being captured by one or more second cameras; andextracting the stored second image including the at least two second faces matching the at least two first faces in the first image, the second image including at least two or more persons included in the first image and being associated with the obtained identification information, from a storage unit which stores the second image in association with the one or the plurality of pieces of identification information obtained by a wireless communication apparatus which is installed in a periphery of each of the plurality of second cameras and obtain the identification information from the one or the plurality of electronic tags or portable terminals of which a positional relationship satisfies a predetermined condition.17. A non-transitory storage medium storing a program causing a computer to:obtain a first image including a plurality of first faces captured by a first camera in association with one or a plurality of pieces of identification information obtained by a wireless communication apparatus which is installed in a periphery of the first camera and obtain the identification information from one or a plurality of electronic tags or portable terminals of which a positional relationship satisfies a predetermined condition;identify a second image, among a plurality of second images, which includes at least two second faces matching at least two first faces, among the plurality of first faces included in the first image, the plurality of second images being captured by one or more second cameras; andextract the stored second image including the at least two second faces matching the at least two first faces in the first image, the second image including at least two or more persons included in the first image and being associated with the obtained identification-information, from a storage unit which stores the second image in association with the one or the plurality of pieces of identification information obtained by a wireless communication apparatus which is installed in a periphery of each of the plurality of second cameras and obtain the identification information from the one or the plurality of electronic tags or portable terminals of which a positional relationship satisfies a predetermined condition.