Reference is now made to FIG. 3 where there is shown an exemplary and non-limiting schematic block diagram of a system 300 for synchronization between human voice segments and subtitles according to an embodiment. An audiovisual content 302 provides video content 302 to a potential video analyzer 310 (not discussed within the scope of the invention), an audio content 304 provided to a human voice analyzer 320, and subtitle content 308 provided to a subtitle analyzer 330. As noted above the video analyzer operation 310 is not discussed herein but could be used in accordance with the principles disclosed in a co-pending patent application titled “A System and a Computerized Method for Audio Lip Synchronization of Video Content”, filed on the same day and date, assigned to common assignee and hereby incorporated by reference. One of ordinary skill in the art would be readily able to include the operation of the video analyzer 310 for additional synchronization and therefore such explanation is not further elaborated herein. The audio content 302 is provided to a human voice analyzer 320. The function of the human voice analyzer 320 is to analyze the audio content and determine the beginning and end of human voice segments, i.e., segments of the audio content that are identified as containing a human voice. There are known techniques in the art some of which may be found in Gerhard's “Audio Signal Classification: History and Current Techniques” or in Dov et. Al. “Audio-Visual Voice Activity Detection Using Diffusion Maps” providing non-limiting examples of voice activity detection (VAD) methods. From the hearable sounds it is necessary to distinguish speech from other hearable sources of sound that may be music, artificial sounds, natural sounds, and noise. Various spectral analysis techniques may be used separately or in combination for the purpose of extracting the desired human voice segments HAi. Regardless of the technique used, the output of the human voice analyzer 320 is provided to both a subtitle and human voice misalignment analyzer 340 and to a subtitle and audio content alignment unit 350.