When two or more of the devices are present in a user's room, the user hesitates in determining which voice assistant of any one of the devices is used in some cases and needs to memorize a keyword for each device, which is inconvenient for the user. In the future, when the internet of things (IoT) becomes more popular, so that various kinds of devices and services cooperate with each other, the number of devices usable for the voice assistant further increases. Therefore, a situation in which a user hesitates in determining which device is needed to recognize a user's voice.
The present embodiments have been made in view of the above-described circumstances. It is an object of some embodiments of the present invention to provide a voice assistant system, a server apparatus, a device, a voice assistant method, and a program to be executed by a computer which allow a user to easily utilize the voice assistant without hesitation in determining which device usable for the voice assistant is used even when there are two or more of the devices.
In one embodiment, a voice assistant system includes a server apparatus performing voice assistant and a plurality of devices, in which the server apparatus and the devices are communicatively connected to each other. The plurality of devices each records the same user's speech through a microphone, and then transmits recorded data of the same user's speech to the server apparatus. The server apparatus receives the recorded data transmitted from each of the plurality of devices, and then voice-recognizes two or more of the received recorded data in accordance with a predetermined standard to thereby interpret the contents of the user's speech to perform the voice assistant.