When a user's predetermined verbal start command (e.g., Hello, oo) is input through the microphone 22, for example, the request module 23 may record a user's speech after the input, and then transmit the recorded data of the user's speech (recorded data) to the server apparatus 10 through the communication module 21. Thus, the use of the same keyword (verbal start command) in the plurality of devices 20 eliminates the necessity for the user to memorize keywords for the devices, which is convenient for the user.
The request module 23 may transmit the recording state information indicating the recording state in recording the user's speech to the server apparatus 10 through the communication module 21. The recording state information may include one or two or more of a recording level, a noise level, the influence of echoes, the distance from the user (sound source of the speech), and the direction of the user (sound source of the speech), for example.
The request module 23 may also transmit usage state information indicating the usage state of the devices 20 in recording the user's speech. The usage state information may include information, such as a state where the device 20 is in use or not in use, an output unit currently used in the device 20, and the distance between the device 20 and the user, for example.
In the devices 20, the request module 23 can be configured by modules of hardware or software (programs, such as OS, API, service, utility, library, driver, and application) or a combination thereof, for example. The functions of the request module 23 may be realized by the execution of the programs by a computer.