Understanding Society User Support: Issueshttps://iserredex.essex.ac.uk/support/https://iserredex.essex.ac.uk/support/support/favicon.ico?15995719382024-02-27T13:21:37ZUnderstanding Society User Support
Redmine Understanding Society User Support - Support #2060 (Resolved): Design weights taken account of in...https://iserredex.essex.ac.uk/support/issues/20602024-02-27T13:21:37ZRosie Cornish
<p>I think the answer to this is yes, but can you confirm that the household enumeration weights (e.g. a_hhdenus_xw) take account of the design weights - i.e. they are the product of the design weight and a household response weight?</p> Understanding Society User Support - Support #2058 (Resolved): Using longitudinal weights when co...https://iserredex.essex.ac.uk/support/issues/20582024-02-22T16:48:24ZJames Laurence
<p>Hi there,</p>
<p>I was just hoping to get some more advice regarding correctly weighting my analysis combining the mainstage and Covid-19 waves of the UKHLS. You kindly helped with a previous weighting issue I had for treating the data as repeated cross-sections. However, I am also hoping to conduct some fixed effects panel data analysis of the combined mainstage and Covid-19 waves (web survey only).</p>
<p>As a basic set-up, I am combining wave 9 of the UKHLS mainstage survey (the last mainstage survey that doesn’t cover the pandemic) with waves 1 to 9 of the COVID-19 survey. The data are in long format. As I would like to do some fixed effects longitudinal analysis, I believe I need to use the longitudinal weights. From my reading, I need to choose the longitudinal weight from the last wave of the survey I will be using – in this case wave 9 of the Covid-19 survey: ci_betaindin_lw</p>
<p>Applying this weight [ci_betaindin_lw] will give me a balanced panel, restricting the sample to everyone who participated in all 9-waves of the Covid-19 survey. However, I would also like to analyse wave 9 of the mainstage survey as part of a longitudinal, fixed effects analysis covering mainstage wave 9 and Covid survey waves 1-9. Is this possible? If so, is one approach to feed back the ci_betaindin_lw weight so that the people who were in wave 9 of the mainstage survey who were also present in all 9-waves of the Covid-19 survey have the weight value of ci_betaindin_lw? Therefore, the ci_betaindin_lw weight would cover the mainstage wave 9 sample and the Covid-19 sample.</p>
<p>In case it’s not clear, to make-up an example of the data in long-format, which contains wave 9 of the mainstage survey and waves 1-9 of the Covid survey. Pidp no. 111111 was present in wave 9 of the mainstage sirvey and all 9 waves of the Covid survey and had a value of 1.5 for their longitudinal weight at wave 9 of the covid survey (ci_betaindin_lw). So, my data would just look like this:</p>
<p><strong>[PIDP]</strong> <strong>[WAVE] [Value of ci_betaindin_lw]</strong><br />111111 Mainstage wave 9 <em>Missing Value</em><br />111111 COVID wave 1 1.5<br />111111 COVID wave 2 1.5<br />111111 COVID wave 3 1.5<br />111111 COVID wave 4 1.5<br />111111 COVID wave 5 1.5<br />111111 COVID wave 6 1.5<br />111111 COVID wave 7 1.5<br />111111 COVID wave 8 1.5<br />111111 COVID wave 9 1.5</p>
<p>Is just feeding back the value of ci_betaindin_lw (1.5) what I need to do? So, it would now look like:</p>
<p><strong>[PIDP]</strong> <strong>[WAVE] [Value of ci_betaindin_lw]</strong><br />111111 Mainstage wave 9 <strong>1.5</strong><br />111111 COVID wave 1 1.5<br />111111 COVID wave 2 1.5<br />111111 COVID wave 3 1.5<br />111111 COVID wave 4 1.5<br />111111 COVID wave 5 1.5<br />111111 COVID wave 6 1.5<br />111111 COVID wave 7 1.5<br />111111 COVID wave 8 1.5<br />111111 COVID wave 9 1.5</p>
<p>If so, could this method apply if I wanted to include more mainstage waves of data? So, if I wanted to include waves 6, 7, 8 and wave 9 of the mainstage survey alongside waves 1-9 of the Covid survey - would I just feed back an individuals' weight value for ci_betaindin_lw back so the individual have that weight value for mainstage waves, 6, 7, 8 and 9?</p>
<p>I may be completely misunderstanding how to use the longitudinal weights, or have missed something crucial meaning you can't applying the Covid longitudinal weights to the pre-Covid survey mainstage waves. If so, apologies in advance and any advice would be hugely appreciated.</p>
<p>Best wishes,</p>
<p>James</p> Understanding Society User Support - Support #2040 (Resolved): Survey Weightshttps://iserredex.essex.ac.uk/support/issues/20402024-01-25T10:27:40ZMartha Tindall
<p>Hi</p>
<p>I am conducting an analysis an dam struggling to determine the best weights to use and was hoping you could give me some guidance. My analysis uses data from the years 2018 to 2021 (inclusive) to conduct a TWFE linear model. My model includes a main effects and interaction term involving a binary variable for pre-pandemic and during-pandemic. I have the following questions regarding weighting.</p>
<p>1. Currently my pandemic cut off is March 2020, given the term of interest involves time, is it necessary to start the year 2018 in March and extend the data to March 2022 to ensure there is equal representation of sample months in each group, or is it okay to just go January 2018 to December 2021 (keeping the cut off in March 2020)?</p>
<p>2. I wish to use an unbalanced panel design as the subgroups I when I use longitudinal weights, my sample becomes just 12% of what it would be using an unbalanced panel. My question is how do I choose these weights? Guidance on the Understanding Society website is for creating balanced panels and using _lw weights, however in my situation this is not possible. Is it appropriate to apply the cross sectional weight for each observation in a given wave or is there something else I should be doing?</p>
<p>3. On the Understanding Society website you mention rescaling of weights for analysing by calendar year. First, is this required in my situation? Second, do you provide guidance for doing so in R as the only advice available is for stata which I am not familiar with.</p>
<p>Thank you in advance for your time and please let me know if you need any more information from me.</p>
<p>Martha</p> Understanding Society User Support - Support #2012 (Resolved): longitudinal weighthttps://iserredex.essex.ac.uk/support/issues/20122023-12-14T15:03:18ZMargherita Agnoletto
<p>Dear Understanding Society Team,</p>
<p>I am currently examining the relationship between flexible work arrangements (FWA) and some employees' outcomes.</p>
<p>Given that questions about FWA are asked every two waves, I have chosen to conduct a longitudinal analysis (FE) using waves 2, 4, 6, 8, and 10. Some of my outcomes come from the self-completion questionnaire. <br />As I understand, it is recommended to use the appropriate longitudinal weight from the last wave in my analysis (i.e. i_indinus_lw). However, I observe a significant loss of observations. <br />Given that my panel is unbalanced, could I use the corresponding longitudinal weight from the last available wave for each individual? For instance, if an individual 'i' has information until wave 8, I propose imputing the appropriate longitudinal weight from wave 8. Similarly, if individual 'k' has information until wave 6, I suggest imputing the weight from wave 6.</p>
<p>Thank you for your attention.</p>
<p>Kind regards</p> Understanding Society User Support - Support #2006 (Resolved): Longitudinal analysis using calend...https://iserredex.essex.ac.uk/support/issues/20062023-12-12T13:52:21ZMarina Kousta
<p>Hello,</p>
<p>I am reaching out to kindly request help on how to conduct longitudinal analysis using calendar year datasets.<br />1) Although online you state the published calendar year data are meant to be used for cross-sectional analysis, does that also stand for when we create our own calendar year datasets? Or is it meant to be a guidance only for when you release the pre-made calendar year data? If that is the case regardless, is there some way for us to still conduct longitudinal analysis after creating our own calendar year data?<br />2) Although you recommend using the w_month (sample month) to create calendar year data, would it still be ok to instead use the interview date instead, when the exact date is of great importance to the research question itself (i.e. when testing the introduction or removal of a social policy).</p>
<p>Many thanks in advance for your time and consideration.</p>
<p>Best wishes,<br />Marina</p> Understanding Society User Support - Support #2004 (Resolved): Selection of weightshttps://iserredex.essex.ac.uk/support/issues/20042023-12-11T16:11:12ZJoanna Clifton-SpriggJ.M.Clifton-Sprigg@bath.ac.uk
<p>Hello,</p>
<p>I am looking to use information on new parents (newmum/newdad), specifically dates of leave taken when child was born, in a difference in difference approach around the shared parental leave reform (2015).</p>
<p>Essentially, I will be comparing cohorts of parents who had a child before & after the reform. I will not be following specific parents longitudinally, at least not for the first part of the project.</p>
<p>I would like to run this analysis in calendar years, not waves, given that the reform happened in April 2015 & I will be comparing those with children born pre-April 2015 and post.</p>
<p>I have pooled waves 2-12 data and set this up in a long format. Now I am wondering what weights to apply.</p>
<p>1) Am I correct in thinking in this scenario cross-sectional weights will work? I would like to preserve as big a sample as possible as even without weighting sample size is a challenge.</p>
<p>2) If I can use cross-sectional weights, how can I apply them to this pooled data file, which includes waves 2-12? It is not clear to me from the user guide.</p>
<p>3) At which stage do I adjust for the calendar year analysis?</p>
<p>Thank you.</p> Understanding Society User Support - Support #1985 (Resolved): Representativeness of housing tenu...https://iserredex.essex.ac.uk/support/issues/19852023-10-24T13:13:42ZEoghan O'Brien
<p>I am looking at wave 11 responses in the hhresp table for the breakdown of housing tenure (tenure_dv) at the household level.</p>
<p>The screenshots attached include the % of each category (unweighted and weighted using "hhdenui_xw").</p>
<p>Comparing these figures with census results for tenure status in England and Wales (% of households by tenure), it appears that the number of private renters (in USoc "Rented private unfurnished" and "Rented private furnished" appears to be under represented (11.7% when weighted) relative to the census figures for England and Wales in 2021 (20.3%). I have tried limiting the USoc sample to just England and Wales household, but it does not materially change the results.</p>
<p>Link to census data here: <a class="external" href="https://www.ons.gov.uk/peoplepopulationandcommunity/housing/bulletins/housingenglandandwales/census2021">https://www.ons.gov.uk/peoplepopulationandcommunity/housing/bulletins/housingenglandandwales/census2021</a></p>
<p>Any info on why I may be finding this discrepancy would be very much appreciated.</p> Understanding Society User Support - Support #1982 (Resolved): reference person weights https://iserredex.essex.ac.uk/support/issues/19822023-10-12T11:19:12ZAmelia Wattsamelia.watts678@outlook.com
<p>Dear Olena/support team,</p>
<p>I'm selecting reference persons from households across waves to form a panel. Can the individual longitudinal weights for these respondents in the last wave be used as suboptimal weights in the analysis?</p>
<p>Many thanks, <br />Amelia</p> Understanding Society User Support - Support #1975 (Resolved): Weights - Cross-sectional Analysis...https://iserredex.essex.ac.uk/support/issues/19752023-09-19T09:55:04ZCaitlin Schmid
<p>Good morning,</p>
<p>Using the main survey, I aim to run a cross-sectional analysis on a number of variables to analyse sex differences between adults and their variation across Local Authority Districts. To increase the sample sizes, I want to pool UKHLS Waves 11 and 12. Do I require tailored weights or can I proceed with the two provided cross-sectional adults weights of the respective waves (_indinui_xw)?</p>
<p>Many thanks and best wishes,</p>
<p>Caitlin</p> Understanding Society User Support - Support #1908 (Resolved): Weights using the BHPS Consolidate...https://iserredex.essex.ac.uk/support/issues/19082023-05-26T21:16:07ZNatalia Carralero
<p>Hello. I am studying differences in single/partnered parents. To do so, I am using the British Household Panel Survey Consolidated Marital, Cohabitation and Fertility Histories (1991-2009) to identify my sample of single/non-single parents, and then, merging it with the BHPS individual questionnaire to get the relevant variables.<br />My question is, which weights should I be using? I was thinking on indin91_lw, but I am not entirely sure. <br />Besides, which type of weights are they? Frequency or analytic weights? <br />Thank you!</p> Understanding Society User Support - Support #1904 (Resolved): Using weights when variables have ...https://iserredex.essex.ac.uk/support/issues/19042023-05-17T09:59:53ZRichard Belcher
<p>Dear Olena,</p>
<p>I am running a pooled cross sectional individual level analysis (waves 1-9), using cross-sectional weights, but I am worried that by removing cases where sf-12 responses are -9, my sample is no longer nationally representative.</p>
<p>I am selected weights in my analysis that are appropriate for how the questions leading to the variables I want to use were administered. E.g. I am using self-completion questionnaire cross-sectional weights (waves 2+) as sf-12 is my dependent variable of interest in later models (weight appropriate for each wave). After aggregating the cross wave data there are a number of cases where cases with non-zero weights have missing value codes attached to them (or are NA due to me merging in household level data which is occasionally not collected). It is understandable that errors and non-response happens during the survey process. Am I safe to assume that some are random, e.g. the lack of household interviews being undertaken is random, so it wouldn't impact the weighting removing responses without that information. I am however worried that some may not be random and there may be some demographic or regional bias to -9 codes in the sf-12 variable, which prevent my sample from being nationally representative when weighted. I have 96% of the non-zero weighted samples remaining after removing those with errors, most of the reduction (3%) comes from "sf12mcs_dv" responses with the code -9.</p>
<p>Thanks for your help,</p>
<p>All the best,</p>
<p>Richard</p> Understanding Society User Support - Support #1902 (Resolved): weights individual files waves 10 ...https://iserredex.essex.ac.uk/support/issues/19022023-05-15T13:20:37ZAelen Valen
<p>Hi,</p>
<p>I am trying to merge individual files across waves 10 and 11 into wide format to create a 2019 calendar year dataset.<br />I used this method from "Box 1: Example syntax for pooled analysis for cross-sectional estimation relating <br />to calendar year 2011, with weight re-scaling" in <a class="external" href="https://www.understandingsociety.ac.uk/sites/default/files/downloads/documentation/user-guides/mainstage/weighting_faqs.pdf">https://www.understandingsociety.ac.uk/sites/default/files/downloads/documentation/user-guides/mainstage/weighting_faqs.pdf</a></p>
<p>ge wts=0 <br />replace wts=indpxui_xw if month>=13 & month<=24 <br />ge ind=1 <br />sum ind [aw=indpxui_xw] if month>=1 & month<=12 <br />gen jwtdtot=r(sum_w) <br />sum ind [aw=indpxui_xw] if month>=1 & month<=12 <br />gen kwtdtot=r(sum_w) <br />replace wts=indpxui_xw*(jwtdtot/kwtdtot) if month>=1 & month<=12</p>
<p>For the purpose of the research I am working on, I am using the equivalised household income and other variables referring to parental occupation, education and place of birth.</p>
<p>Since I am using it together with EUSILC 2019 for different EU countries, I was comparing the weights with the weights in EUSILC. While the sum of the weights in the latter equals on average the 80% of the real population in each country, the sum of weights of the dataset I created for UK 2019 (with the merge of wave 10 and 11) gives a number way lower than the census 2019 UK population.</p>
<p>Could you please help me understanding how those weights are constructed, which characteristics of the population they consider, whether they can comparable to ones in EUSILC and whether the procedure I followed to merge the two waves is correct. <br />Many thanks in advance for the support!</p> Understanding Society User Support - Support #1899 (Resolved): Household weightshttps://iserredex.essex.ac.uk/support/issues/18992023-05-10T11:13:29ZImogen Farthing
<p>Hello.</p>
<p>I am using individual data which I have joined onto household data, by hidp for each wave then put all of the waves into a dataframe (g to l). Then for each household (and wave), I have aggregated up some of the individual responses (e.g. max personal income, average financial security response etc). So I have a dataframe which has, for each household and wave, some household responses and some new columns which I have created. I am hoping to aggregate these up by region and year (e.g. in London in 2019 the average household income was x, and the average financial security answer by household was y) so need to use weights - however I am unsure which weights to use as I'm doing a longitudinal study on households (I plan to use the xhhrel file to link the households between waves).</p>
<p>Any advice would be much appreciated.</p> Understanding Society User Support - Support #1894 (Resolved): Weight for unbalanced and merged U...https://iserredex.essex.ac.uk/support/issues/18942023-04-21T14:30:20ZYanan Zhangzhangyanan0918@gmail.com
<p>Dear Sir/Madam,</p>
<p>I hope this message finds you in good health and high spirits.</p>
<p>I am currently working with individual-level data from the merged Waves 1-18 of the BHPS and Waves 1-8 of the UKHLS datasets. I have a couple of questions regarding the use of weights in my analysis. I would appreciate any guidance you could provide.</p>
<p>1. In my study, I am employing fixed effects estimates to analyze the relationship between two variables, x and y. Given this approach, is it necessary to apply weights to the analysis?</p>
<p>2. I have followed the guidelines and used the longitudinal weight provided in Wave 8 of the UKHLS. However, I understand that this weight is applicable only to those who have participated in all waves. Since many individuals have only participated in parts of the waves, I am unsure how to generate weights for these participants. Could you please advise on the appropriate way to handle this situation?</p>
<p>Thanks for your time!</p> Understanding Society User Support - Support #1892 (Resolved): Question regarding longitudinal we...https://iserredex.essex.ac.uk/support/issues/18922023-04-19T15:58:11ZJohanna Pauliks
<p>Dear support,</p>
<p>I've got another question regarding longitudinal weights in UKHLS. I'm using data from adult respondents (individuals) from wave 2 up to wave 10 and using the fixed-effects model (so I'm doing longitudinal analysis). According to the user guide, longitudinal weights are appropriate for this. But if I use the longitudinal weight from wave 10 I lose all cases of individuals who drop out of the sample in previous waves, even so they participated in all waves prior to this (let's say they participated from wave 2-8). This is a problem, as some of my subgroups are very small to begin with. Would it be possible to use the longitudinal weight from wave 10 for everyone who participated in all waves up to wave 10, the longitudinal weight from wave 9 for every respondent who participated until wave 9 and so on, or would this not be appropriate, and I need to create my own tailored weights?</p>
<p>Best regards<br />Johanna</p>