The word representation for each of the words in the sentence 402 may be determined by the word representation generation module 520. All the word representations 528 may be provided for the representation generation module 530. The representation generation module 530 determines, based on the word representations 528, a sentence representation 412 for the sentence 402 for use in a natural language processing task related to the sentence. In some embodiments, the word representations 528 may be organized together to directly form the sentence representation 412. In some other embodiments, the representation generation module 530 may further process the word representations, for example, by applying one or more other neural network layers. The scope of the embodiments of the present invention is not limited in this regard.
The sentence representation 412 may be utilized in various manners in different natural language processing tasks, such as by the decoder 420 of the system 400. The utilization of the sentence representation 412 is also not limited in the embodiments of the present invention. Some examples of the natural language processing tasks may include machine translation, NLI, semantic role labeling, entity reorganization, text summarization, reading comprehension, relation extraction, and so on.