In some examples, the first media data object and the second media data object may each include simultaneous video content and audio content. In these examples, identifying the set of common temporal subsequences between the first media data object and the second media data object may include identifying both a set of common temporal video subsequences and a set of common temporal audio subsequences. Thus, in these examples, executing the workflow based on the set of edits may include determining the workflow based at least in part on determining a difference between the set of common temporal video subsequences and the set of common temporal audio subsequences. For example, if there is a discrepancy between the set of common temporal video subsequences and the set of common temporal audio subsequences (such that video changed where audio did not), systems described herein may determine that a scene involving the common subsequences is the same as before but the scene has been visually updated with new shots. Thus, these systems may execute a workflow based around adding new visual descriptions but not changing subtitles. In another example, where audio changed but video did not, these systems may execute a workflow based around adding new subtitles but not changing visual descriptions.
In some examples, a workflow may trigger transformation of stored data (e.g., stored media data). For example, the workflow may include transforming stored data corresponding to temporally sequenced content of the second media data object that falls outside the set of common temporal subsequences between the first media data object and the second media data object based at least in part on the set of edits. Thus, changed content may be targeted for a workflow where unchanged content may not be targeted for a workflow.