白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Event based audio-video sync detection

專利號(hào)
US11659217B1
公開(kāi)日期
2023-05-23
申請(qǐng)人
Amazon Technologies, Inc.(US WA Seattle)
發(fā)明人
Hooman Mahyar; Avijit Vajpayee; Abhinav Jain; Arjun Cholkar; Vimal Bhat
IPC分類
H04N21/242; H04N21/234; H04N21/233
技術(shù)領(lǐng)域
audio,video,frames,bins,feature,bin,frame,sets,may,component
地域: WA WA Seattle

摘要

Techniques are described for detecting desynchronization between an audio component and a video component of a media presentation. Feature sets may be determined for portions of the audio component and portions of the video component, which may then be used to generate correlations between portions of the audio component and portions of the video component. Synchronization may then be assessed based on the correlations.

說(shuō)明書

A machine learning model may then be applied to pairs of video feature sets and audio feature sets to determine a confidence score between a frame and an audio bin. For example, the object vector and object attribute vector for frame 102b and the audio vector for audio bin 104b are provided as inputs to a machine learning model that outputs a confidence score that the frame 102b and audio bin 104b are synchronized. Confidence scores may also be determined between frame 102b and audio bin 104a and audio bin 104c, as well as between frame 102a and each of audio bins 104a-c and frame 102c and each of audio bins 104a-c.

The confidence scores are used to determine whether frames 102a-c are desynchronized with audio bins 104a-c. For example, the confidence score between frame 102b and audio bin 104a may be higher than the confidence score between frame 102b and audio bin 104b. Similarly, the confidence score between frame 102c and audio bin 104b may be higher than the confidence score between frame 102c and audio bin 104c. Based on this, the audio component and the video component of media presentation 100 are determined to be desynchronized.

權(quán)利要求

1
微信群二維碼
意見(jiàn)反饋