Through the pixel signal readout method described above, sum pixel signals derived from the pixels 10 in three rows, i.e., the first through third rows, are read out. Once the sum pixel signals derived from the pixels 10 in the first through third rows are read out, the vertical control unit 70 reads out sum pixel signals derived from the pixels 10 in the fourth through sixth rows. Through the second control mode described above, sum pixel signals are read out sequentially in units of three rows. The sum pixel signals, each generated by combining signals from three pixels disposed consecutively along the column direction, which are output in sequence from the arithmetic units 50, first undergo signal processing at A/D conversion units and the like and are then output to the control unit 4. The control unit 4 generates image data (e.g., video image data) by using the sum pixel signals output from the image sensor 3.