Furthermore, in one embodiment, the duration of haptic output for consonant phonemes may be shorter than that of vowel phonemes. This may occur either due to the shorter duration for each activation of a cutaneous actuator for a consonant phoneme versus a vowel phoneme, or due to the vowel phoneme using a two point sequence of haptic output versus a single haptic output for a consonant phoneme. For example, a haptic output for a consonant phoneme may be of a duration of 180 ms, while a haptic output for a vowel phoneme may be of a duration of 270.5 ms. For consonant phonemes, the duration of the haptic output may be determined experimentally by starting at a baseline duration (e.g., 150 ms) and adding the time needed to “ramp up” and “ramp down” (as described above), which may be 20% of the duration time. For vowel phonemes, the duration of the haptic output may be determined based on a separate baseline (e.g., 100 ms), plus the ramp up and ramp down interval, and plus the SOA interval.