白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Event based audio-video sync detection

專利號
US11659217B1
公開日期
2023-05-23
申請人
Amazon Technologies, Inc.(US WA Seattle)
發(fā)明人
Hooman Mahyar; Avijit Vajpayee; Abhinav Jain; Arjun Cholkar; Vimal Bhat
IPC分類
H04N21/242; H04N21/234; H04N21/233
技術領域
audio,video,frames,bins,feature,bin,frame,sets,may,component
地域: WA WA Seattle

摘要

Techniques are described for detecting desynchronization between an audio component and a video component of a media presentation. Feature sets may be determined for portions of the audio component and portions of the video component, which may then be used to generate correlations between portions of the audio component and portions of the video component. Synchronization may then be assessed based on the correlations.

說明書

As discussed further below in relation to FIGS. 5 and 6, correlations may also be determined between feature sets generated from video frames compared to predicted feature sets generated from an audio bin (FIG. 5) and correlations may be determined from feature sets generated from audio bins compared to predicted feature sets generated from a video frame (FIG. 6). In some implementations, multiple correlations may be determined for an audio bin/video frame pair based on the various techniques discussed herein, e.g., FIGS. 5 and 6.

The video component and the audio component may be determined not to be synchronized based on the correlations (308). In some implementations, the correlations are provided as inputs to a model, such as a classifier. The classifier may be trained on correlations between video components and audio components of media presentations from a training data set. In some implementations the correlations may include correlations between each audio bin of a portion of the audio component and each video frame of a portion of the video component of a media presentation. The model may analyze the correlations to determine whether the audio component and the video component are synchronized. In some implementations the model may implement a threshold strategy, where an audio bin and a video frame sharing a media timeline are considered desynchronized if the correlation does not exceed a threshold value.

權利要求

1
微信群二維碼
意見反饋