FIG. 5A is a histogram of calculated differences between media segments that are considered different from each other. FIG. 5B is a histogram of calculated differences between media segments that are considered the same as each other. When a threshold 502 is selected, very few media segments that are considered (e.g., to human judgment) the same as each other are classified by a comparison process as different from each other. In some examples the systems described herein may use a threshold that leads to more false positives of sameness (e.g., those segment pairs to the left of threshold 502 in FIG. 5A) than to false negatives of sameness (e.g., those segment pairs to the right of threshold 502 in FIG. 5B). Favoring false positives over false negatives may lead to a more accurate identification of the longest common subsequence, at least in part because a false negative is unlikely to cause a different longest common subsequence to be identified.