In addition, a corresponding system for computer-generated and -executed automated workflows from media asset differentials may include at least one physical processor and physical memory including computer-executable instructions that, when executed by the physical processor, cause the physical processor to (1) access a first media data object and a different, second media data object that, when played back, each render temporally sequenced content, (2) compare first temporally sequenced content represented by the first media data object with second temporally sequenced content represented by the second media data object to identify a set of common temporal subsequences between the first media data object and the second media data object, (3) identify a set of edits relative to the set of common temporal subsequences that describe a difference between the temporally sequenced content of the first media data object and the temporally sequenced content of the second media data object, and (4) execute a workflow relating to at least one of the first media data object and the second media data object based on the set of edits.