In some examples, comparing the first temporally sequenced content represented by the first media data object with the second temporally sequenced content represented by the second media data object may include (1) rendering the first temporally sequenced content from the first media data object and (2) rendering the second temporally sequenced content from the second media data object.
In some examples, the first media data object and the second media data object may each include simultaneous video content and audio content. In these or other examples, identifying the set of common temporal subsequences between the first media data object and the second media data object may include identifying a set of common temporal video subsequences and a set of common temporal audio subsequences. In addition, executing the workflow based on the set of edits may include determining the workflow based at least in part on determining a difference between the set of common temporal video subsequences and the set of common temporal audio subsequences.
In some examples, identifying the set of edits may include generating metadata that indicates a start time and an end time for each subsequence within the set of common temporal subsequences.