The word-phoneme converter 1108 converts the input words into sequences of phonemes. The output of the word-phoneme converter 1108 may be in the form of a standard phonetic alphabet, such as the IPA, or may be in another format, such as a compressed phonetic alphabet compressed to have a number of phonetic symbols representing phonemes in the corresponding language of the input words 1106 that corresponds to the total number of phonemes available in that language. The word-phoneme converter 1108 may convert words to phonemes based on a word-phoneme sequence database, or using a set of rules to convert letters to sounds. These rules consider the context of letters within an input word 1106 to generate an approximate phonetic spelling of the word. In some cases, an input word 1106 may be composed of multiple individual words, and while the input word 1106 has no correspondence in the word-phoneme sequence database, the individual subparts do. The word-phoneme converter 1108 may search an input word 1106 which is not recognized in the word-phoneme sequence database for sub-parts that are recognized, and if recognized, the word-phoneme converter 1108 may convert the individual sub-parts to their phonetic counterparts.