In the second control mode described above, the vertical control unit 70 reads out the pixel signals from the top pixel and the bottom pixel among the three pixels disposed consecutively along the column direction, which are designated as combination-target pixels, to the first vertical signal line VL1 and combines them on the first vertical signal line VL1, whereas it reads out the pixel signal from the pixel at the center to the second vertical signal line VL2. The arithmetic unit 50 then generates a sum pixel signal by adding together and averaging the combined pixel signal having been output on the first vertical signal line VL1 and the pixel signal from the pixel at the center, having been output on the second vertical signal line VL2. Through these measures, it becomes possible in the present embodiment to apply the greatest weight to the pixel signal from the pixel 10 at the center when combining signals from a plurality of pixels 10 disposed consecutively along the column direction. As a result, image data can be generated in conjunction with sum pixel signals, each generated by combining the pixel signals from the pixels around (above and below) the pixel at the center with the pixel signal from the pixel at the center.