In various embodiments, the data transmitted from the first playback device 702a to the second playback device 702b may comprise, for example, raw microphone data and/or processed sound data from one, some or all of the microphones (e.g., after being processed by one or more of the first AEC 764a and the first spatial processor 766a). Processing the data to be transmitted may include compressing the data prior to transmission. In some implementations, it may be beneficial to perform acoustic echo cancellation (via the first AEC 764a) with the reference signal(s) before transmitting the detected sound to reduce bandwidth. In some embodiments, the second AEC 764b may be bypassed or omitted from the second voice processor 760b in configurations in which acoustic cancellation is applied to sound data to be transmitted from the first playback device 702a to the second playback device 702b. In additional or alternate embodiments, spatial processing may be carried out on the data to be transmitted to the second playback device 702b, in which case the second spatial processor 766b may be bypassed or omitted from the second voice processor 760b.