白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

System and computerized method for subtitles synchronization of audiovisual content using the human voice detection for synchronization

專利號
US11445266B2
公開日期
2022-09-13
申請人
IChannel.IO Ltd.(IL Petah Tikva)
發(fā)明人
Oren Jack Maurice
IPC分類
H04N7/00; H04N21/488; H04N21/43; G10L15/26; G10L25/57
技術(shù)領(lǐng)域
subtitle,subtitles,voice,human,audio,correction,s430,segments,analyzer,content
地域: Petah Tikva

摘要

Audiovisual content in the form of video clip files, streamed or broadcasted may further contain subtitles. Such subtitles are provided with timing information so that each subtitle should be displayed synchronously with the spoken words. However, at times such synchronization with the audio portion of the audiovisual content has a timing offset which when above a predetermined threshold is bothersome. The system and method determine time spans in which a human speaks and attempts to synchronize those time spans with the subtitle content. Indication is provided when an incurable synchronization exists as well as the case where the subtitles and audio are well synchronized. It further is able to determine, when an offset exists, the type of offset (constant or dynamic) and providing the necessary adjustment information so that the timing used in conjunction with the subtitles timing provided may be corrected and synchronization deficiency resolved.

說明書

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Reference is now made to FIG. 3 where there is shown an exemplary and non-limiting schematic block diagram of a system 300 for synchronization between human voice segments and subtitles according to an embodiment. An audiovisual content 302 provides video content 302 to a potential video analyzer 310 (not discussed within the scope of the invention), an audio content 304 provided to a human voice analyzer 320, and subtitle content 308 provided to a subtitle analyzer 330. As noted above the video analyzer operation 310 is not discussed herein but could be used in accordance with the principles disclosed in a co-pending patent application titled “A System and a Computerized Method for Audio Lip Synchronization of Video Content”, filed on the same day and date, assigned to common assignee and hereby incorporated by reference. One of ordinary skill in the art would be readily able to include the operation of the video analyzer 310 for additional synchronization and therefore such explanation is not further elaborated herein. The audio content 302 is provided to a human voice analyzer 320. The function of the human voice analyzer 320 is to analyze the audio content and determine the beginning and end of human voice segments, i.e., segments of the audio content that are identified as containing a human voice. There are known techniques in the art some of which may be found in Gerhard's “Audio Signal Classification: History and Current Techniques” or in Dov et. Al. “Audio-Visual Voice Activity Detection Using Diffusion Maps” providing non-limiting examples of voice activity detection (VAD) methods. From the hearable sounds it is necessary to distinguish speech from other hearable sources of sound that may be music, artificial sounds, natural sounds, and noise. Various spectral analysis techniques may be used separately or in combination for the purpose of extracting the desired human voice segments HAi. Regardless of the technique used, the output of the human voice analyzer 320 is provided to both a subtitle and human voice misalignment analyzer 340 and to a subtitle and audio content alignment unit 350.

權(quán)利要求

1
微信群二維碼
意見反饋