白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Automated workflows from media asset differentials

專(zhuān)利號(hào)
US11659214B2
公開(kāi)日期
2023-05-23
申請(qǐng)人
Netflix, Inc.(US CA Los Gatos)
發(fā)明人
Yadong Wang; Chih-Wei Wu; Kyle Tacke; Shilpa Jois Rao; Boney Sekh; Andrew Swan; Raja Ranjan Senapati
IPC分類(lèi)
H04N21/2343; G11B27/10; G11B27/031; H04N21/234; G06Q10/0631
技術(shù)領(lǐng)域
media,edits,content,temporally,data,segments,may,sequenced,workflow,object
地域: CA CA Los Gatos

摘要

The disclosed computer-implemented method may include (1) accessing a first media data object and a different, second media data object that, when played back, each render temporally sequenced content, (2) comparing first temporally sequenced content represented by the first media data object with second temporally sequenced content represented by the second media data object to identify a set of common temporal subsequences between the first media data object and the second media data object, (3) identifying a set of edits relative to the set of common temporal subsequences that describe a difference between the temporally sequenced content of the first media data object and the temporally sequenced content of the second media data object, and (4) executing a workflow relating to the first media data object and/or the second media data object based on the set of edits. Various other methods, systems, and computer-readable media are also disclosed.

說(shuō)明書(shū)

As another example of pre-processing, systems described herein may downsample audio content to a specified sampling frequency (e.g., 16000 Hz, 12000 Hz, 8000 Hz, etc.). This may have the benefit of reducing computational load and improving efficiency while preserving human-salient differences. These systems may extract features from the audio content useful for comparing the similarity of the content. For example, these systems may convert the content to spectrograms. In some examples, these systems may convert the content into log-mel spectrograms. For example, these systems may extract 128 mel frequencies, thereby producing 128-dimensional log-mel features.

Similarly, systems described herein may downsample video content to a specified resolution (e.g., 320×180). Furthermore, these systems may crop video content to achieve a consistent size and/or aspect ratio. In some examples, these systems may also apply cropping to each frame to remove potentially irrelevant content. For example, these systems may crop approximately 2% of the horizontal portion of the frame and approximately 15% of the vertical portion of the frame to remove potentially irrelevant textual content. In addition, these systems may reformat the content as a vector (e.g., converting the downsampled 320×180 frame to a 57600×1 vector).

權(quán)利要求

1
微信群二維碼
意見(jiàn)反饋