The method 500 may begin at 510, where the user device 502 may receive user input. By way of example, a user (not depicted), standing within a threshold distance of user device 502 may provide vocal input which may be received at a microphone of user device 502. The vocal input may include a wake word and/or phrase that is recognizable by the user device 502 as indicating additional input is about to be provided. The user device 502 may capture audio via the microphone including the wake word/phrase and any subsequent utterance. By way of example, the user may state aloud “[wake phrase] play songs by [Artist A],” where [Artist A] is the name of a particular music artist.