白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

System and computerized method for subtitles synchronization of audiovisual content using the human voice detection for synchronization

專利號
US11445266B2
公開日期
2022-09-13
申請人
IChannel.IO Ltd.(IL Petah Tikva)
發(fā)明人
Oren Jack Maurice
IPC分類
H04N7/00; H04N21/488; H04N21/43; G10L15/26; G10L25/57
技術(shù)領(lǐng)域
subtitle,subtitles,voice,human,audio,correction,s430,segments,analyzer,content
地域: Petah Tikva

摘要

Audiovisual content in the form of video clip files, streamed or broadcasted may further contain subtitles. Such subtitles are provided with timing information so that each subtitle should be displayed synchronously with the spoken words. However, at times such synchronization with the audio portion of the audiovisual content has a timing offset which when above a predetermined threshold is bothersome. The system and method determine time spans in which a human speaks and attempts to synchronize those time spans with the subtitle content. Indication is provided when an incurable synchronization exists as well as the case where the subtitles and audio are well synchronized. It further is able to determine, when an offset exists, the type of offset (constant or dynamic) and providing the necessary adjustment information so that the timing used in conjunction with the subtitles timing provided may be corrected and synchronization deficiency resolved.

說明書

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Subtitle content 308 are provided as part of the audiovisual content 302 and include the subtitle text as well as timing information. The timing information may include, but is not limited to, the starting time and the end time of a particular subtitle, the starting time and the duration of the particular subtitle, or provide relative time information between one subtitle and the other. Regardless of the particular format, the subtitle content 308 is provided to the subtitle analyzer 330. The subtitle analyzer 330 extracts the timing information for each subtitle segment STi. The subtitle and human voice misalignment analyzer 340 receive both the human voice segments HAi and the STi segments and respective timing information. It then performs an analysis to determine the nature of the correction and as further explained with respect of FIGS. 4 and 5. Based on the determination, and as further discussed with respect of FIGS. 4 and 5, a set of correction factors may be provided to the subtitle and audio content alignment unit 350. The correction factors may be such that no alignment correction is necessary, such as in the case shown with respect of FIG. 2A, that a correction is needed and may be performed by the system as it is within its capabilities between predetermined thresholds and then providing a set of correction factors to the subtitle and audio content alignment unit 350, which includes the cases shown in FIGS. 2A-2C, or, a determination that correction is not possible and providing an error notification signal on the notification signal 345. Notification signal 345 may further be used to provide other kinds of notifications including that no correction is needed, or when a correction is needed notifying of the type of correction performed. It should be noted, as also mentioned herein that the correction factor may be a constant or, if TΔ continuously increases or continuously decreases, a set of correction factors that changes continuously over time.

權(quán)利要求

1
微信群二維碼
意見反饋