Weighting Survey Data: Not Necessarily a Brilliant Idea
Should one weight their survey data? Is it worth the effort? The short answer must be ‘maybe’ or ‘it depends’. A slightly longer and much more useful answer was given by Leslie Kish in his enormously helpful paper ‘Weighting: Why, when and how’. Today (well, actually I submitted the final manuscript 2.5 years ago – that’s scientific progress for you!), I have added my own two cent with a short chapter that looks at the effects and non-effects of common weighting procedures (in German). The bottom line is that if you employ the usual weighting variables (age, gender, education and maybe class or region) as controls in your regression, weighting will make next to no difference but might mess with your standard errors.


My instincts are also not to weight, and I agree with your argument if all effects are additive, but what about if you're worried about omitted variable bias not for main effects, but interaction effects?
For instance, for the sake of argument, let's assume that blacks get different income returns to education than whites (ie, there's a race*edu interaction) and that your data have an oversample of blacks such that they are half the sample. If you control but do not weight for race you're only controlling for the possibly different intercepts by race but not the interaction with education. You'll thus have an estimate of the grand slope that is the mean of the black slope and white slope, when in reality it should be more similar to the white slope because in the population whites are more numerous. On the other hand weighting should produce the correct grand slope. Maybe you should just specify the interaction effect, but a) interactions are a huge pain to interpret and b) it may not occur to you that a specific interaction has appreciable effects.
- spam
- offensive
- disagree
- off topic
Like