Support #156: Variables and weights - Understanding Society User Support

Actions

Copy link

Support #156

open

Variables and weights

Added by Giulia Montresor almost 13 years ago. Updated 27 days ago.

Status:

Resolved

Priority:

Low

Assignee:

Redmine Admin

Category:

Data analysis

Start date:

06/09/2013

% Done:

100%

Description

Hello,
I would like to ask a few questions:

- In wave 2, what is the difference between the variable b_hcondn and b_hcondno? There is no description for b_hcondno in the documentation section of the Understanding Society website.

- I would like to use the variable ivintlang but I don't know what weight to apply, all the weights seems to have zero value for this variable.

- When I compute estimates using weights, e.g. indinus_xw, I see that I get different standard errors depending on whether I use the command:
svy, subpop (if varname=...): mean varname
or the command:
svy: mean varname, over(varname)
Why does this happen? What command is reliable?

Thanks and regards
Giulia

Actions

Copy link

Updated by Redmine Admin almost 13 years ago

Giulia,
Looks like it would be useful to have a short guide to the health condition module. This might take a little bit of time. In the short term, you would have to study the questionnaires to see how we ask this of new vs existing respondents.
The course materials here can give you some ideas of how to take the complex survey design into account in your estimates: https://www.iser.essex.ac.uk/bhps/courses

...,over(varname)

I guess Stata would protest if you ask for the mean of a variable over the same variable?
The data sets come with several weights each for a very specific purpose. The user guide is the best starting point to find out more about the different weights.
Hope some of these leads can be useful at least in the short term.
Jakob

Actions

Copy link

Updated by Olena Kaminska over 12 years ago

Giulia,

With regard to your question on subpopulation use in estimation of standard errors, I feel this is a question for a theoretical statistician. Within Understanding Society dataset we suggest using subpopulation option mainly because it helps with good correction for clustering (psu's).

Yet, here is a discussion why subpopulation option is better for other esimtation:
http://www.ats.ucla.edu/stat/stata/faq/svy_stata_subpop.htm

The difference arises because with 'if' option standard errors are calculated using only the subset of the dataset. With subpop option stata uses the whole dataset for calculating standard errors. The article suggest that the latter is a correct way of calculating standard errors.

Hope this helps,
Olena

Actions

Copy link