Then, the word classifying section 12 classifies words into a group of words appearing in common between the reference language and the target language and a group of words uncommon between the reference language and the target language (in step S30). For example, the word classifying section 12 executes morphological analysis on the target language learning corpus 22 and outputs words indicated by results of the analysis. The word classifying section 12 uses the words indicated by the results of the analysis and the alignment dictionary 23 to produce correspondence relationships between words in the target language and words in the reference language. Then, the word classifying section 12 uses the correspondence relationships to classify the words included in the word space of the target language into a group of words common to words included in the word space of the reference language and a group of words not common to words included in the word space of the reference language.
Subsequently, the parameter adjusting section 13 adjusts a learning parameter W′ for the distributed representation learning for the group of the words appearing in common and included in the word space of the target language (in step S40). For example, the parameter adjusting section 13 sequentially selects the words from among the group of the words appearing in common and replaces distributed representations of the selected words with distributed representations of the words in the reference language. For example, the parameter adjusting section 13 replaces the hidden layer of the Skip-gram model with the distributed representations of the words in the reference language. Then, the parameter adjusting section 13 adjusts the weight W′N×V that is the parameter between the hidden layer and the output layer.