Understanding Society User Support: Issueshttps://iserredex.essex.ac.uk/support/https://iserredex.essex.ac.uk/support/support/favicon.ico?15995719382023-06-07T15:21:51ZUnderstanding Society User Support
Redmine Support #1916 (Resolved): xwavedat vs xwaveidhttps://iserredex.essex.ac.uk/support/issues/19162023-06-07T15:21:51ZAlexey Bessudnov
<p>Hello,</p>
<p>What is the difference in the universe between the xwavedat file (150,393 rows) and xwaveid file (128,036 rows)?</p>
<p>Thank you,</p>
<p>Alexey</p> Support #1902 (Resolved): weights individual files waves 10 -11 wide formathttps://iserredex.essex.ac.uk/support/issues/19022023-05-15T13:20:37ZAelen Valen
<p>Hi,</p>
<p>I am trying to merge individual files across waves 10 and 11 into wide format to create a 2019 calendar year dataset.<br />I used this method from "Box 1: Example syntax for pooled analysis for cross-sectional estimation relating <br />to calendar year 2011, with weight re-scaling" in <a class="external" href="https://www.understandingsociety.ac.uk/sites/default/files/downloads/documentation/user-guides/mainstage/weighting_faqs.pdf">https://www.understandingsociety.ac.uk/sites/default/files/downloads/documentation/user-guides/mainstage/weighting_faqs.pdf</a></p>
<p>ge wts=0 <br />replace wts=indpxui_xw if month>=13 & month<=24 <br />ge ind=1 <br />sum ind [aw=indpxui_xw] if month>=1 & month<=12 <br />gen jwtdtot=r(sum_w) <br />sum ind [aw=indpxui_xw] if month>=1 & month<=12 <br />gen kwtdtot=r(sum_w) <br />replace wts=indpxui_xw*(jwtdtot/kwtdtot) if month>=1 & month<=12</p>
<p>For the purpose of the research I am working on, I am using the equivalised household income and other variables referring to parental occupation, education and place of birth.</p>
<p>Since I am using it together with EUSILC 2019 for different EU countries, I was comparing the weights with the weights in EUSILC. While the sum of the weights in the latter equals on average the 80% of the real population in each country, the sum of weights of the dataset I created for UK 2019 (with the merge of wave 10 and 11) gives a number way lower than the census 2019 UK population.</p>
<p>Could you please help me understanding how those weights are constructed, which characteristics of the population they consider, whether they can comparable to ones in EUSILC and whether the procedure I followed to merge the two waves is correct. <br />Many thanks in advance for the support!</p> Support #1896 (Resolved): why there are so many cases inapplicable in terms of occupational class...https://iserredex.essex.ac.uk/support/issues/18962023-04-23T08:31:08ZSijia Du
<p>Dear User Support Team,</p>
<p>I was wondering why there were so many cases inapplicable in terms of occupational class (current job) for example in the l_indresp file. Any guidance and suggestions would be greatly appreciated.</p>
<p>Best wishes,</p>
<p>Sijia Du</p>
<p><a class="email" href="mailto:sijia.du@manchester.ac.uk">sijia.du@manchester.ac.uk</a><br /><a class="email" href="mailto:sijia.du@nankai.edu.cn">sijia.du@nankai.edu.cn</a></p> Support #1501 (Resolved): xtreg and fixed effectshttps://iserredex.essex.ac.uk/support/issues/15012021-02-04T18:16:38ZLeilah Plant-Tchenguiz
<p>Hi,</p>
<p>I have been trying to estimate a panel regression using xtreg, fe in Stata. My observations are at the level of individuals for multiple waves. Since svy is not supported, how can I run the fixed effects estimation taking account of sample design? I have included pweight, but don't know how to account for clustering and strata. Would I end the xtreg command with "vce(cluster psu)"? Or should I use "vce(cluster pidp)"? Or one source I read suggested creating a new variable consisting of unique combinations of psu and strata (call it uniquevar) and then using "vce(cluster uniquevar)". I don't really understand the econometrics behind this, so any help would be appreciated.</p> Support #1323 (Resolved): youth self completion longitudinal weighthttps://iserredex.essex.ac.uk/support/issues/13232020-03-18T13:37:54ZLinda Ng
<p>Hi US Support,</p>
<p>I have a question about the youth self-completion weight.</p>
<p>My analytical sample comprises of youths and young adults (aged 12-19) who filled in the Self-completion booklet (SC), using data from wave 3 and wave 6 involving the following combinations</p>
<p>A: Wave 3: Youth Sc -> Wave 6: Youth Sc<br />B: Wave 3:Youth Sc -> Wave 6: Young Adult SC<br />C: Wave 3:Young Adults SC -> Wave 6: Young Adult Sc</p>
<p>For C, I can use the young adult SC longitudinal weight f_indscub_lw<br />However, I cannot find a longitudinal weight for youth SC, (Group A), why is that?<br />Also, what should I do about weighting for youths SC who became young adults in Wave 6 (Group B)?</p>
<p>Thank you in advance.</p>
<p>Kind regards,<br />Linda</p> Support #919 (Resolved): weights for pooled cross-sections over waves (a)-(g) https://iserredex.essex.ac.uk/support/issues/9192018-02-14T11:15:55ZNico Ochmannnico.ochmann@postgrad.manchester.ac.uk
<p>Dear Peter, <br />I have a quick follow-up question with regard to my past issue <a class="issue tracker-3 status-5 priority-4 priority-default closed" title="Support: Pooling data from all waves, 1-6, using all subsamples of USoc (GPS, EMBS, BHPS, IEMB) (Closed)" href="https://iserredex.essex.ac.uk/support/issues/839">#839</a>. You wrote the following below:<br />A simpler approach would be to select based on current wave GOR. This will not remove the sample imbalance problem, but should reduce it to a minimum, so may be good enough.<br />For me to implement this simpler approach and to minimize the sample imbalance, you suggest to drop Scot, Wales, NI for the current wave g only? Or do you mean to drop all Scot, Wales, NI for all seven waves? <br />I would appreciate a short reply and sorry about the very delayed follow-up question. <br />Best. <br />Nico</p> Support #758 (Closed): weights for pooled cross-sections over waves (a)-(f) https://iserredex.essex.ac.uk/support/issues/7582017-03-29T17:32:21ZNico Ochmannnico.ochmann@postgrad.manchester.ac.uk
<p>Hi there,</p>
<p>I am running hourly wage (constructed with w_paygu_dv) on a number of regressors in a pooled cross-section over all six waves. So far, I am using the whole sample based on GPS, EMBS, BHPS, IEMBS. I am not sure what kind of weights to use in this context given that I want to use all four samples. f_indinui_xw is available for all four for wave 6, so do I just go ahead and use that one? <br />Any piece of advice would be terrific. <br />Thanks a lot!</p> Support #532 (Closed): what is the head of household variablehttps://iserredex.essex.ac.uk/support/issues/5322016-03-18T17:17:57Zaudrina limli.audrina@gmail.com
<p>I'm trying to identify the variable of the head of household to be combined with the head data from BHPS. BHPS has this direct indicator, however I searched every data set in the Understanding Society and could find it.</p>
<p>I also searched online and found that the head of household is called the reference person here. It says that people who answered the household survey is the reference person (or head of household). I looked at the household survey data (_hhresp) and there is more than one person who answered the survey (I used the variable _ivh1 to _ivh16).</p>
<p>Could you please help and direct me to the right variable of head of household? Thanks!</p> Support #362 (Closed): weights for longitudinal analysishttps://iserredex.essex.ac.uk/support/issues/3622015-04-27T13:38:08ZAnonymous
<p>Dear Alita Nandi,</p>
<p>I am postgraduate student aat the University of Lund. I am working on my final dissertation and I am using the UKHLS survey. I want to study the so-called Healthy Immigrant Effect which says that immigrants tend to be healthier than native-born when they arrive to the country, but they converge to the health of natives as time passes.</p>
<p>It is the first time that I use the UKHLS survey and I have used the material of the online course, in particular the weighting section. I must admit that everything is really well explained and do files are specially helpful. However, I still have a doubt about which weight to pick for my analysis and I was wondering if you could help me on that.</p>
<p>To first study the inequalitites in health status between immigrants and natives I will use the wave 1. Therefore I think tthat I have to use the weight a_indpxus_xw since I am using adult main and proxy interview data.</p>
<p>To study the "health assymilation effect", i.e: wether immigrants´ health converge to native levels, I plan to run a regression where the dependent variable is the difference in health between wave 4 and wave 1. I will include as explanatory variables country of origin and other control variables for wave 1. Since in this model I will also use data from wave 4 (health status in wave 4) I am guessing that if I use the cross-section weight for wave 1, I won´t be controlling for attrition. Therefore, I am really confused about what weight I should use. I would really appreciate if you could give me any advice about it.</p>
<p>Thanks in advance</p> Support #329 (Closed): year resp left f-t education / year first jobhttps://iserredex.essex.ac.uk/support/issues/3292014-12-09T07:58:54ZJanina Zeh
<p>Hello,<br />is there a variable which tells me the year the resp left f-t education or the year of the first job of the resp?<br />i've found one variable in w2 (bledendy), but not for the other years.<br />Kind regards,<br />Janina</p> Support #323 (Closed): which weight to usehttps://iserredex.essex.ac.uk/support/issues/3232014-11-14T12:57:41ZVeronique Siegler
<p>I am currently working on a publication with a very tight schedule using data from the Understanding Society. The publication will use different measures at an individual level, sometimes extracted from the questionnaire, sometimes from the self-completed questionnaire. Despite reading the Understanding Society literature, I am still unsure on which weight to use in which situation and would be grateful if you could clarify the following:</p>
<p>- lets say I would like to know the proportion of people who feel they belong to their local area (wave 3, adult self-completed questionnaire). After reading the user guide, my understanding is that I should use the following weight: c_indscub_xw (for combined cross-sectional adult self-completion weight) . However, a colleague is advising me to use the following weight: c_indinub_xw (for combined cross-sectional adult main interview weight).</p>
<p>- lets say I would like to know the proportion of people who are male and female (wave 3, adult questionnaire) . After the user guide, my understanding is that I should the following weight: c_indinub_xw (for combined cross-sectional adult main interview weight).</p>
<p>- lets now say I would like to know the proportion of people who feel they belong to their local area by age (one variable from the adult self-completed questionnaire and one variable from the questionnaire, both wave 3). The user guide suggests that "for individual level analysis you may want to combine information from different questionnaire sources. In this situation please select the weight suitable for the lowest level according to the hierarchy below" - which suggests to me that in that case, I should use the following weight: c_indscub_xw (for combined cross-sectional adult self-completion weight). However, again my colleague is suggesting that I should use the following weight: c_indinub_xw (for combined cross-sectional adult main interview weight). </p>
<pre><code>I expect the same logic would apply to other previous waves? (waves 2, 1 and BHPS) and future wave (wave 4)</code></pre>
<p>I would be so grateful if you could advise me regarding that important matter.</p> Support #296 (Closed): zero value weights using c_indnsub_xwhttps://iserredex.essex.ac.uk/support/issues/2962014-08-21T10:16:02ZEsther Curnockesther.curnock@glasgow.ac.uk
<p>Hi,</p>
<p>I have been carrying out cross-sectional analyses using the combined wave 2 & 3 nurse assessment dataset 'xindresp_ns', along with data from the main survey of the corresponding wave. Having read through the user guide I have been using the survey weight c_indnsub_xw, as I believe this is the correct one for the type of analyses I'm doing. However there seems to be a significant proportion (7.8%) of the combined nurse assessment dataset that have a weighting value of zero when this is applied, and I would be grateful if you could explain this so I can decide whether to continue using the weighting.</p>
<p>The user guide seems to indicate that c_indnsub_xw is equal to the longitudinal weight c_indnsub_lw for households with no TSM, and that c_indnsub_lw itself is calculated using a method that includes multiplication by the nurse inclusion weight b_indnsub_li. I think it might be this inclusion weight that leads to the zero weights for c_indnsub_xw. If so, does this mean that individuals with a zero weight are basically those that had nurse data collected despite falling outside the inclusion criteria such an assessment?</p>
<p>To try to understand the problem, I also looked at the separate nurse assessment datasets for each wave ('b_indresp_ns' and 'c_indresp_ns'). It seems that applying the cross-sectional weight 'b_indnsus_xw' to the wave 2 dataset results in only 0.7% of cases with weights of zero, whereas applying 'c_indnsbh_xw' to wave 3 dataset has 10.8% with weighting values of zero. I wasn't sure why there should be such a big difference between the two, but in any case, as the proportion of the combined dataset derived from the wave 2 GPS sample is much greater than those coming from the wave 3 BHPS group this didn't really explain the issue of high zero weights in the combined dataset.</p>
<p>Many thanks,<br />Esther</p> Support #145 (Closed): working with two waves and weightshttps://iserredex.essex.ac.uk/support/issues/1452013-05-08T18:07:11ZGiulia Montresorgmontr@essex.ac.uk
<p>Hello,<br />I want to use both waves 1-2 of US. In such case I think I need to keep only those individuals that responded in both waves.<br />I have to estimate the mean of life satisfaction for different groups of immigrants over the two years.<br />Therefore I need to use the longitudinal self-completion weight, b_indscus_lw.<br />I don't understand one thing:<br />The weight regards only observations of wave 2, therefore I need to keep only these observations? doing so I end up with a miserable number of individuals, and I cannot estimate the life satisfaction mean for the groups because the observations have zero weight. I attached my do file. I look forward to your kind reply, thanks</p> Support #142 (Closed): where is work experience in the BHPS?https://iserredex.essex.ac.uk/support/issues/1422013-04-16T15:50:10ZJames Nieldjames_nield@outlook.com
<p>Hi,</p>
<p>how do I get BHPS data on individual level work experience as appears in studies such as Blundell et al 2005 "job changes, hours changes and the path of labour supply adjustment"?</p>
<p>Many thanks</p>
<p>James</p> Support #128 (Closed): working with England & Waleshttps://iserredex.essex.ac.uk/support/issues/1282013-03-05T11:38:21ZCarolina Zuccotticarolina.zuccotti@eui.eu
<p>Hello, <br />I want to work with England and Wales only. Do I need to do any adjustments to the weights? <br />Or just drop the cases that do not belong to these countries?<br />Thanks,<br />Carolina</p>