Alternatively, or in addition, the weights may include a second set of weights each determined based on the numbers of nodes having edges connected with respective neighbor nodes in the set of neighbor nodes and the number of nodes in the set of neighbor nodes. In a graph convolution based on the second set of weights (referred to as a second graph convolution), the word representation for the given node may further based on the given node in addition to the weighted summation of the set of neighbor nodes.
The second graph convolution based on the second set of weights performed at each layer 710, . . . , 720 may be represented as follows: