What is claimed is:1. A processing device comprising:a processor coupled to a memory storing instructions to permit the processor to function as:an envelope computation unit configured to compute an envelope for a frequency response of a sound pickup signal detected a microphone;a scale conversion unit configured to generate scale converted data by performing scale conversion and data interpolation on frequency data of the envelope;a normalization factor computation unit configured to divide the scale converted data into a plurality of frequency bands, obtain a characteristic value for each frequency band, and compute a normalization factor, based on the characteristic values;a normalization unit configured to, using the normalization factor, normalize the sound pickup signal in a time domain,a transform unit configured to transform the normalized sound pickup signal to a frequency domain and compute a normalized frequency response;a dip correction unit configured to perform dip correction on a power value or an amplitude value of the normalized frequency response; anda filter generation unit configured to generate a filter, using the normalized frequency response subjected to the dip correction and configured to output the filter to an out-of-head localization device which reproduces a reproduction signal on which an out-of-head localization is performed using the filter to headphones or earphones.2. The processing device according to claim 1, wherein the dip correction unit corrects a dip, using a different threshold value for each frequency band.3. The processing device according to claim 1, wherein the normalization factor computation unit obtains a plurality of characteristic values with respect to each of the frequency bands and computes the normalization factor by performing weighted addition of the plurality of characteristic values.4. A processing method comprising:a step of computing an envelope for a frequency response of a sound pickup signal detected by a microphone;a step of generating scale converted data by performing scale conversion and data interpolation on frequency data of the envelope;a step of dividing the scale converted data into a plurality of frequency bands, obtaining a characteristic value for each frequency band, and computing a normalization factor, based on the characteristic values;a step of, using the normalization factor, normalizing the sound pickup signal in a time domain,a step of transforming the normalized sound pickup signal to a frequency domain and compute a normalized frequency response;a step of performing dip correction on the normalized frequency response;a step of generating a filter, using the normalized frequency response subjected to the dip correction; anda step of outputting the filter to an out-of-head localization device which reproduces a reproduction signal on which the out-of-head localization is performed using the filter to headphones or earphones.5. A reproducing method comprisinga step of performing out-of-head localization on a reproduction signal, using the filter generated by the processing method according to claim 4.6. A non-transitory computer readable medium storing program causing a computer to execute a processing method, the processing method comprising:a step of computing an envelope for a frequency response of a sound pickup signal;a step of generating scale converted data by performing scale conversion and data interpolation on frequency data of the envelope;a step of dividing the scale converted data into a plurality of frequency bands, obtaining a characteristic value for each frequency band, and computing a normalization factor, based on the characteristic values;a step of, using the normalization factor, normalizing the sound pickup signal in a time domain,a step of transforming the normalized sound pickup signal to a frequency domain and compute a normalized frequency response;a step of performing dip correction on the normalized frequency response;a step of generating a filter, using the normalized frequency response subjected to the dip correction; anda step of outputting the filter to an out-of-head localization device which outputs signal performed by the out-of-head localization on a reproduction process using the filter to headphones or earphones.