Project

General

Profile

Actions

Support #2145

open

Inapplicable values in family relationship variables

Added by Kuoshi Hu 4 months ago. Updated 3 months ago.

Status:
Feedback
Priority:
Normal
Category:
Data documentation
Start date:
09/06/2024
% Done:

50%


Description

Dear Support Team,
Hi! I am investigating variables about the intergenerational relationship between adult and their parents (e.g. masee macon mafar pasee pacon pafar paidu paaid).
After identifying those who are in the same household with at least one of their parents/step-parents in egoalt (anwstat and alwstat are considered), and those who mentioned their parent/step-parent who are alive and not living with them (LVREL), and those parents who live together (parmar), I still find a lot of inapplicable values in masee macon mafar pasee pacon pafar. The large number of inapplicable values also influence my data cleaning for paidu and paaid.
I would like to ask what may be reasons of these inapplicable values after identifying parents (or step-parents) in the same household (in egoalt) or not living together with the interviewees (in indresp) ?
I also noticed that in xhhrel, some interviewees’ total number of biological parents is 0. Does it mean that their biological parent is not participated in this survey, or not alive, or due to other reasons?
Thank you so much for your help!
Have a nice day!

Best wishes,
Kuoshi


Files

2145.do (1.89 KB) 2145.do Understanding Society User Support Team, 09/16/2024 03:54 PM
Actions #1

Updated by Understanding Society User Support Team 3 months ago

  • Category set to Data documentation
  • Status changed from New to Feedback
  • % Done changed from 0 to 50
  • Private changed from Yes to No

Hello Kuoshi

Are you using Stata to process your data? If so, could you share your code so I can follow along and review the numbers you’re getting?

Regarding your question about the variable "bpx_N - total number of biological parents," according to the Family Matrix (xhhrel file) User Guide (https://www.understandingsociety.ac.uk/wp-content/uploads/documentation/user-guides/6614_main_survey_user_guide_family_matrix_xhhrel.pdf) section 4.1 Identifying individuals who have specific relatives in the study, individuals with bpx_N > 0 are those whose biological parents are not part of the Study. Since the xhhrel file creates an individual-level cross-wave dataset containing familial relationship identifiers for all sample members (i.e., those ever enumerated in the Study), this indicates that for participants with bpx_N > 0, their biological parents were never included in the Study.

Additionally, you can further identify participants with no biological parents in the Study (bpx_N > 0) who may have adoptive parents (apx_N > 0) or stepparents (spx_N > 0).

I hope this information is helpful.

Best wishes,
Roberto Cavazos
Understanding Society User Support Team

Actions #2

Updated by Kuoshi Hu 3 months ago

Understanding Society User Support Team wrote in #note-1:

Hello Kuoshi

Are you using Stata to process your data? If so, could you share your code so I can follow along and review the numbers you’re getting?

Regarding your question about the variable "bpx_N - total number of biological parents," according to the Family Matrix (xhhrel file) User Guide (https://www.understandingsociety.ac.uk/wp-content/uploads/documentation/user-guides/6614_main_survey_user_guide_family_matrix_xhhrel.pdf) section 4.1 Identifying individuals who have specific relatives in the study, individuals with bpx_N > 0 are those whose biological parents are not part of the Study. Since the xhhrel file creates an individual-level cross-wave dataset containing familial relationship identifiers for all sample members (i.e., those ever enumerated in the Study), this indicates that for participants with bpx_N > 0, their biological parents were never included in the Study.

Additionally, you can further identify participants with no biological parents in the Study (bpx_N > 0) who may have adoptive parents (apx_N > 0) or stepparents (spx_N > 0).

I hope this information is helpful.

Best wishes,
Roberto Cavazos
Understanding Society User Support Team

Dear Roberto,
Thank you so much for your response. I will share my code for cleaning mafar in wave3 as an example.
First, I try to clean mafar data in indresp wave 3.
Code: ****mother (biological or step-) alive?
gen c_mo=0 if c_lvrel1==0
replace c_mo=0 if c_lvrel9==0
replace c_mo=1 if c_lvrel1==1
replace c_mo=1 if c_lvrel9==1
recode c_mo (.=9) //proxy or other system missing
replace c_mafar=99 if c_mo==0
tab c_mafar
gen wave3=1

Then I know 23,767 interviewees did not mention they have a mother (biological or step-) alive and not living with them (c_mafar=99).

Second, I tried to find those mothers who are not mentioned due to co-residence in egoalt.
Code:
keep if c_relationship_dv==4|c_relationship_dv==7
gen c_mother=1 if c_relationship_dv==4&c_asex==2
gen c_mother_s=1 if c_relationship_dv==7&c_asex==2
gen c_moco=1 if c_mother==1|c_mother_s==1

sort pidp apidp
bysort pidp: gen nr = _n

keep pidp c_mother c_mother_s c_moco nr

reshape wide c_mother c_mother_s c_moco, i(pidp) j(nr)
gen c_mother=1 if c_mother1==1|c_mother2==1|c_mother_s1==1|c_mother_s2==1|c_mother_s3==1|c_mother3==1
replace c_mother=0 if c_mother1==.&c_mother2==.&c_mother_s1==.&c_mother_s2==.&c_mother3==.&c_mother_s3==.
gen c_moco=1 if c_moco1==1|c_moco2==1|c_moco3==1
replace c_moco=0 if c_moco1==.&c_moco2==.&c_moco3==.

Third, I merge these two datasets, after merging:
Code: keep if wave3==1
tab c_mafar
replace c_mafar=123 if c_moco==1& c_mafar==99
tab c_mafar
Then I know 5,174 interviewees’ mother are living with them, therefore they did not mention their mothers’ information and answer this question in indresp. However, I still do not know whether the remaining 18,593 interviewees’ mothers are alive or not.
I know that the co-residence situation may be included egoalt in wave2 or wave4, there is no big change in number of interviewees did not mentioned their mothers’ information after including these coresident information from wave 2 and 4. Therefore, I am wondering why so many interviewees did not mention their mothers in this survey and whether their mothers are still alive or not.
Could you please correct me if there is something wrong with my understanding of this dataset or my coding?
Have a nice day!

Best wishes,
Kuoshi

Actions #3

Updated by Understanding Society User Support Team 3 months ago

Hello Kuoshi,

Thank you for sharing your code, I believe you are correct. I did a similar exercise using the file “xhhrel” to identify the total number of respondents for whom we have information about their parents across the Study, and I arrived at similar numbers.

Just a reminder, the variable "Lvrel" excludes relatives who were living in the household at the time of the interview or who are no longer alive.

Out of the 23,767 respondents who did not mention a biological or adoptive/stepmother as a relative living outside their household or alive, 5,525 may have had a (biological, adoptive, step, or foster) mother living with them at some point in the study. For the remaining 18,242, we have no information about their mothers throughout the study, which suggests that they have passed away.

I’m attaching my code for reference.

I hope this information is helpful.

Best wishes,
Roberto Cavazos
Understanding Society User Support Team

Actions

Also available in: Atom PDF