Systems and methods for voice-assisted media content selection

專利號(hào)

US11175880B2

公開日期

2021-11-16

申請(qǐng)人

Sonos, Inc.（US CA Santa Barbara）

發(fā)明人

Sherwin Liu; Paul Bates

IPC分類

G10L15/30; G06F3/16; G10L15/22

技術(shù)領(lǐng)域

playback,vas,mps,voice,zone,media,may,content,in,or

地域： CA CA Santa Barbara

摘要

Systems and methods for media playback via a media playback system include (i) capturing a voice input comprising a request for media content, (ii) receiving information derived at least from the request for media content, (iii) requesting and receiving information from at least one remote computing device associated with a first media content service and at least one remote computing device associated with a second media content service, wherein (a) the information identifies first media content available via the first media content service for playback and identifies second media content available via the second media content service for playback, and (b) the first and second media content are related to the requested media content, and (iv) after receiving at least one of the first information and the second information, (a) selecting the first media content instead of the second media content, and (b) playing back the first media content.

說(shuō)明書

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88

The remote computing devices 106a associated with the VAS 160 may process the voice input by converting the voice input to text (for example, via a speech-to-text component, discussed above with reference to FIG. 6) and analyzing the text to determine the intent of the request. In some embodiments, the remote computing devices 106a may employ NLU systems that maintain and utilize a lexicon of language, parsers, grammar and semantic rules, and associated processing algorithms to derive information related to the requested media content. For example, the VAS 160 may (i) identify derived payload 783a and/or field types 870 within the voice input that correspond to the intent of the voice input, and (ii) associate the derived payload 783a with one or more of the fields. The derived payload 783a and/or field types 870 identified by the VAS 160 and contained within the packet 783 may be derived by the VAS 160 based on a search and/or metadata provided by the MPS 100 (described in greater detail below) and/or may be stated explicitly by the user. For example, the voice input “Play the ‘In the Zone’ album” explicitly names derived payload 783a (i.e., “In the Zone”) and a field type (i.e., “album”); as such, the resulting response 783 would include {album: “In the Zone”}. In some embodiments, the response 783 contains only the fields populated with derived payload 783a. In particular embodiments, the response 783 contains all of the predefined fields, whether null or populated. In certain cases, the response 783 from the VAS does not include any metadata derived from the voice input.

白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Systems and methods for voice-assisted media content selection

摘要

說(shuō)明書

權(quán)利要求

白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Systems and methods for voice-assisted media content selection

摘要

說(shuō)明書

權(quán)利要求

該功能需要專業(yè)版企業(yè)版VIP權(quán)限，您可以：