Understanding Society User Support: Issueshttps://iserredex.essex.ac.uk/support/https://iserredex.essex.ac.uk/support/support/favicon.ico?15995719382023-10-12T11:19:12ZUnderstanding Society User Support
Redmine Understanding Society User Support - Support #1982 (Resolved): reference person weights https://iserredex.essex.ac.uk/support/issues/19822023-10-12T11:19:12ZAmelia Wattsamelia.watts678@outlook.com
<p>Dear Olena/support team,</p>
<p>I'm selecting reference persons from households across waves to form a panel. Can the individual longitudinal weights for these respondents in the last wave be used as suboptimal weights in the analysis?</p>
<p>Many thanks, <br />Amelia</p> Understanding Society User Support - Support #1975 (Resolved): Weights - Cross-sectional Analysis...https://iserredex.essex.ac.uk/support/issues/19752023-09-19T09:55:04ZCaitlin Schmid
<p>Good morning,</p>
<p>Using the main survey, I aim to run a cross-sectional analysis on a number of variables to analyse sex differences between adults and their variation across Local Authority Districts. To increase the sample sizes, I want to pool UKHLS Waves 11 and 12. Do I require tailored weights or can I proceed with the two provided cross-sectional adults weights of the respective waves (_indinui_xw)?</p>
<p>Many thanks and best wishes,</p>
<p>Caitlin</p> Understanding Society User Support - Support #1902 (Resolved): weights individual files waves 10 ...https://iserredex.essex.ac.uk/support/issues/19022023-05-15T13:20:37ZAelen Valen
<p>Hi,</p>
<p>I am trying to merge individual files across waves 10 and 11 into wide format to create a 2019 calendar year dataset.<br />I used this method from "Box 1: Example syntax for pooled analysis for cross-sectional estimation relating <br />to calendar year 2011, with weight re-scaling" in <a class="external" href="https://www.understandingsociety.ac.uk/sites/default/files/downloads/documentation/user-guides/mainstage/weighting_faqs.pdf">https://www.understandingsociety.ac.uk/sites/default/files/downloads/documentation/user-guides/mainstage/weighting_faqs.pdf</a></p>
<p>ge wts=0 <br />replace wts=indpxui_xw if month>=13 & month<=24 <br />ge ind=1 <br />sum ind [aw=indpxui_xw] if month>=1 & month<=12 <br />gen jwtdtot=r(sum_w) <br />sum ind [aw=indpxui_xw] if month>=1 & month<=12 <br />gen kwtdtot=r(sum_w) <br />replace wts=indpxui_xw*(jwtdtot/kwtdtot) if month>=1 & month<=12</p>
<p>For the purpose of the research I am working on, I am using the equivalised household income and other variables referring to parental occupation, education and place of birth.</p>
<p>Since I am using it together with EUSILC 2019 for different EU countries, I was comparing the weights with the weights in EUSILC. While the sum of the weights in the latter equals on average the 80% of the real population in each country, the sum of weights of the dataset I created for UK 2019 (with the merge of wave 10 and 11) gives a number way lower than the census 2019 UK population.</p>
<p>Could you please help me understanding how those weights are constructed, which characteristics of the population they consider, whether they can comparable to ones in EUSILC and whether the procedure I followed to merge the two waves is correct. <br />Many thanks in advance for the support!</p> Understanding Society User Support - Support #1859 (Resolved): sample size loss due to weighting https://iserredex.essex.ac.uk/support/issues/18592023-02-20T14:26:07ZCaroline Kienast von Einem
<p>Hi,</p>
<p>I am aware that weighting will affect and alter the sample size of the analysis, however, I am working with a pooled sample of participants from wave 3-6 and when I specify a weighted model my sample drops from ~45k to 27k. This seems quite significant, particularly once I start to investigate subgroup characterstics..</p>
<p>Would you be able to confirm whether a drop by ~20k is normal once weighting is applied ( ai am using the longitudinal wave f weight "f_indinub_lw" / whether the below STATA code makes you think it is instead an error with my coding?</p>
<p>STATA CODE:</p>
<p>//Open wave 6:<br />use f_hidp f_psu f_strata pidp f_sex_dv f_age_dv f_indinub_lw using "$inpath\f_indresp", clear</p>
<p>save "test", replace</p>
<p>foreach w in c d e {</p>
<pre><code>// Extract the variables needed<br /> use "$inpath/`w'_indresp", clear<br /> isvar pidp `w'_addrmov_dv `w'_adcts `w'_distmov_dv `w'_mvyr `w'_mvever `w'_plnowy4 <br /> keep `r(varlist)'</code></pre>
<pre><code>// save each wave specific file<br /> save `w'junk.dta, replace<br />}</code></pre>
<p>// Open the file for wave f and then add the rest of the wave specific files<br />use "test", clear<br />foreach w in c d e {<br /> merge 1:1 pidp using `w'junk.dta<br /> drop _merge<br /> }</p>
<p>save "test", replace</p>
<p>// get rid of unwanted temporary files<br />foreach w in c d e {<br /> erase `w'junk.dta<br />}</p>
<p>mvdecode _all, mv(-9/-1)</p>
<p>//I only want those with data at wave 6 <br />drop if f_hidp==.</p>
<p>tabulate f_sex_dv // -> n= 45,186</p>
<p>svyset f_psu , strata(f_strata) singleunit(scaled)|| pidp, weight(f_indinub_lw)<br />svy: tabulate f_sex_dv, count col // -> n=27,094</p> Understanding Society User Support - Support #1827 (Resolved): Correct weights to usehttps://iserredex.essex.ac.uk/support/issues/18272022-12-07T09:40:16ZAmelia Wattsamelia.watts678@outlook.com
<p>Dear Olena,</p>
<p>I have two questions regarding using weights. I’m trying to conduct a cross-sectional analysis using data from a UKHLS wave.</p>
<p>1) If I select a sub-sample using respondents interviewed in certain years/months within a wave, can I still use the existing cross-sectional weights, or will I need to make adjustments to the cross-sectional weights?</p>
<p>2) All the dependent and independent variables are from one wave (eg wave 5), apart from one independent variable which was measured at an earlier wave (eg wave 2). I will match respondents from wave 5 and wave 2 to obtain the values of this independent variable. In this case, can I still use the cross-sectional weights in wave 5, or should I use the longitudinal weights in wave 5?</p>
<p>Thank you for your help.</p> Understanding Society User Support - Support #1794 (Resolved): Weights https://iserredex.essex.ac.uk/support/issues/17942022-10-27T09:34:20ZCaroline Kienast von Einem
<p>Hi team,</p>
<p>I have a question about applying survey weights: <br />When I am applying the weights do I have to do this via specific commands e.g. svy prefix in stata or is it also possible to multiply my varaible of interest with the weight to create a new weighted variable that I could then use alongside commands that cannot be combined with survey weights directly.</p>
<p>Thank you for your help.</p>
<p>Best wishes, <br />Caroline</p> Understanding Society User Support - Support #1743 (Resolved): Averaging regional data to obtain ...https://iserredex.essex.ac.uk/support/issues/17432022-08-05T10:14:30ZCarolin Schmidtcs2100@cam.ac.uk
<p>Hi there,</p>
<p>I am using wave 6 to study household heads' homeownership probabilities. I am looking at native Brits and immigrants (I came up with an immigrant dummy for every household head).</p>
<p>I would now like to generate a control variable for each of my household heads: the variable should reflect the proportion of immigrants in the UK region where the person resides (that is, every household head in e.g London will have the same immigrant share attached, etc.). I am wondering how I should calculate that average: does it have to be weighted (i.e. egen immishare = wtmean(immigrant), weight(indscui_xw) by(region) using the gwtmean package which calculates weighted statistics)? I would think so, because without weighting it, I would have an average immigrant share based on the (not-per-se representative) raw data. However, if I calculate a weighted mean, then I would effectively double-weight the data because the regression itself would be weighted too, no?</p>
<p>I am unsure how to proceed and would appreciate any help.</p>
<p>Best wishes,<br />Carolin</p> Understanding Society User Support - Support #1724 (Resolved): Weights and accounting for individ...https://iserredex.essex.ac.uk/support/issues/17242022-07-12T10:58:09ZCatherine Bunting
<p>Hello,</p>
<p>I am carrying out logistic regression to estimate the association between transitioning from unemployment to employment and health service use. My sample contains all individuals who were unemployed at UKHLS W7. Transition to employment is captured using their employment status at W8 and health outcomes are measured at W9.</p>
<p>I have created an equivalent cohort using individuals who were unemployed at W6, and pooled the two cohorts to increase my sample size. I therefore have some clustering by pidp, as individuals contribute twice to the analysis if they were unemployed at both W6 and W7.</p>
<p>In Stata, I am using svyset to specify the psu, cross-sectional weight (indscui_xw) and the strata. How can I also account for clustering by pidp? Normally I would use logistic regression with option vce(cluster pidp), but this is not possible when using the svy:logistic command.</p>
<p>Thank you!<br />Catherine</p> Understanding Society User Support - Support #1624 (Resolved): Weights for subsamplehttps://iserredex.essex.ac.uk/support/issues/16242022-01-06T14:49:27ZAshley Burdett
<p>Hello,</p>
<p>I am trying to estimate the fraction of people that transition to their first relationship (cohabitation or marriage) by age using the BHPS.</p>
<p>To do this I have constructed an unbalanced panel containing observations for individuals who have never had a relationship (marriage or cohabitation) before. Precisely I use observations for individuals that did not report a relationship in the marital history datasets but provided a full response to the wave 2 main survey. I also include observations for individuals that aged into the sample during the panel to increase my sample size.</p>
<p>I include observations for these individuals up until either they form their first relationship, they have a missing observation or the survey ends (2008).</p>
<p>Using this sample, I simply calculate the fraction of individuals observed at each age that transition to their first relationship at that given age.</p>
<p>My question is how do I appropriately incorporate weights into this analysis? I have tried numerous ways of approaching this problem and get very different results each time.</p>
<p>Many thanks in advance for your help.</p>
<p>All the best,</p>
<p>Ashley</p> Understanding Society User Support - Support #1239 (Resolved): Using weights on a subsample of UKHLShttps://iserredex.essex.ac.uk/support/issues/12392019-09-07T12:31:55ZAmanda Moorghen
<p>Hi,<br />I am running analysis (logit) on a subsample of UKHLS - wave 6 only, people under the age of 30.</p>
<p>I am using the following weights: <br />svyset f_psu [pweight=f_indinui_xw], strata (f_strata) singleunit(centered)</p>
<p>I wanted to check that this was the correct approach? I am unsure whether the weights should be used in the same way for a subsample of UKHLS as if you were analysing the full sample.</p>
<p>Thanks<br />AM</p> Understanding Society User Support - Support #987 (Resolved): Weighting of sub-samplehttps://iserredex.essex.ac.uk/support/issues/9872018-06-26T22:51:48ZAnte Bab2242@cam.ac.uk
<p>Dear Sir or Madam,</p>
<p>I would like to compare the means of several variables of a sub-sample (e.g. income, education) after data cleansing with those of the initial sample to test for representativeness of the sub-sample. If all variables are from the same wave (i.e. wave 4 of the UKHLS), cross-sectional weights can be applied. However, the sub-sample contains two variables that were not surveyed in wave 4, so they were carried forward from wave 1 and 3. Should in this case the variables for the comparison be weighted with the longitudinal weights of the last wave (i.e. wave 4) or should cross-sectional weights be used (i.e. cross-sectional weights from wave 1 and 3 for the two carried-forward variables and for the remaining variables, cross-sectional weights from wave 4)? The variables are from household level questionnaires and self-completion interviews, so that the lowest level of hierarchy is 1, which would suggest to use d_indscus_lw if longitudinal weights are appropriate? Do you agree?</p>
<p>Thank you for your help.</p>
<p>Best regards<br />Ante</p> Understanding Society User Support - Support #498 (Closed): weight youth self-completion + adult https://iserredex.essex.ac.uk/support/issues/4982016-02-04T10:02:59ZCarolina Zuccotticarolina.zuccotti@eui.eu
<p>Hello,<br />I would like to follow individuals (14-15 yrs) who completed the self-completion youth questionnaire into the adult questionnaire (16+). I am interested in the questions on parental involvement and how this affects their adult outcomes.<br />How should I weight this?<br />Let's say that I consider 14-15 yrs individuals in wave 1 and I follow them in wave 2 (and/or 3).<br />Many thanks,<br />Carolina</p> Understanding Society User Support - Support #456 (Closed): comparing across waveshttps://iserredex.essex.ac.uk/support/issues/4562015-11-27T16:55:32ZCarolina Zuccotticarolina.zuccotti@eui.eu
<p>Hello,<br />I wanted to know if it is possible to compare the effect of a variable in wave 1 with its effect in wave 5.<br />For example, has education a stronger effect in the probabilities of employment in 2009/2010 than in 2013/14?<br />At the naked eye, there seems to be a difference in the effect across waves. However, do you know if there might be a way to actually test this?<br />I would need to pool waves I assume. In that case, how should I weight the cases?<br />Many thanks in advance.<br />Carolina</p> Understanding Society User Support - Support #412 (Closed): Weights for BHPS and Understanding So...https://iserredex.essex.ac.uk/support/issues/4122015-09-09T13:04:23ZAndreas Wiedemannawiedem@mit.edu
<p>Hello,</p>
<p>I’ve merged the BHPS with the BHPS-subset of Understanding Society to create a longitudinal panel of BHPS respondents up until 2012 (i.e. I use the BHPS portion of Understanding Society). I am not entirely sure which weights I should use for the analysis. I’ve read the documentation of both dataset, but it is still not clear which weights are the best for my purpose. My goal is to re-create the same underlying population in both datasets, either for the UK or GB. Most importantly, however, I want to be consistent across these two dataset in order to analyze trends in, e.g., income over a time span covering both datasets. Most of my variables of interest are at the household level, but some are at the individual level. <br />Should I use the longitudinal BHPS weights (indin91_lw for individuals or the cross-sectional hhdenbh_xw for households)? And do I have to use weights only in the Understanding Society-part or also in the BHPS part of my panel.</p>
<p>Many thanks for your help,<br />Andreas</p> Understanding Society User Support - Support #291 (Closed): USOC cross-section household weight f...https://iserredex.essex.ac.uk/support/issues/2912014-07-29T10:19:38ZAlex Hurrellalexhurrell1@gmail.com
<p>I'm looking for the BHPS-sample cross-sectional household weight (n_hhdenbh_xw) for USOC wave 3. It is available in wave 2 but I can't find it in wave 2. Does it exist? If not, is it possible to derive it?</p>
<p>Many thanks,<br />Alex</p>