System and computerized method for subtitles synchronization of audiovisual content using the human voice detection for synchronization

專利號(hào)

US11445266B2

公開日期

2022-09-13

申請(qǐng)人

IChannel.IO Ltd.（IL Petah Tikva）

發(fā)明人

Oren Jack Maurice

IPC分類

H04N7/00; H04N21/488; H04N21/43; G10L15/26; G10L25/57

技術(shù)領(lǐng)域

subtitle,subtitles,voice,human,audio,correction,s430,segments,analyzer,content

地域： Petah Tikva

摘要

Audiovisual content in the form of video clip files, streamed or broadcasted may further contain subtitles. Such subtitles are provided with timing information so that each subtitle should be displayed synchronously with the spoken words. However, at times such synchronization with the audio portion of the audiovisual content has a timing offset which when above a predetermined threshold is bothersome. The system and method determine time spans in which a human speaks and attempts to synchronize those time spans with the subtitle content. Indication is provided when an incurable synchronization exists as well as the case where the subtitles and audio are well synchronized. It further is able to determine, when an offset exists, the type of offset (constant or dynamic) and providing the necessary adjustment information so that the timing used in conjunction with the subtitles timing provided may be corrected and synchronization deficiency resolved.

說明書

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

One prior art solution, described for example by Tiedemann in “Synchronizing Translated Movie Subtitles” suggests a synchronization that is done on the basis of aligned anchor points, which is a dictionary-based approach using automatic word alignment. Other approaches suggest various kinds of filters to identify such potential anchor points. Guimar?es et al. in “A Lightweight and Efficient Mechanism for Fixing the Synchronization of Misaligned Subtitle Documents” propose two-phase subtitle synchronization framework. This approach uses both anchors as well as audio fingerprint annotation. This requires the enrichment of the subtitle file with additional audio fingerprints and analysis at the first level and then adding the anchors, such as shown in other prior art approaches, so as to fix the presentation of misaligned subtitle entries. In such cases it is necessary to prepare the files, automatically or manually as the case may be, as well as perform complex operations so as to reach the required alignment between the subtitles and the actual audio content. Yet another prior art solution performs time alignment speech recognition such as suggested by Kirby et al. in U.S. Pat. No. 7,191,117. It is therefore desirable to provide a solution that will allow for affordable, simple and real-time lip sync to support the ever increasing demand to resolve the lip sync error problem.

It is therefore desirable to provide a solution that will allow for affordable, simple synchronization between the audio content of an audiovisual content to its respective subtitles so as to support the ever increasing demand to resolve the subtitle misalignment problem.

SUMMARY

白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

System and computerized method for subtitles synchronization of audiovisual content using the human voice detection for synchronization

摘要

說明書

權(quán)利要求

白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

System and computerized method for subtitles synchronization of audiovisual content using the human voice detection for synchronization

摘要

說明書

權(quán)利要求

該功能需要專業(yè)版企業(yè)版VIP權(quán)限，您可以：

該功能需要專業(yè)版企業(yè)版VIP權(quán)限，您可以：