Archived Content

Information identified as archived is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please contact us to request a format other than those available.

5. National Household Survey non-response bias indicators

5.1 Introduction

Because of the voluntary nature of the NHS, Statistics Canada put a great deal of effort into maximizing household participation, particularly using mixed collection methods and the wave collection methodology (see Chapter 1 on Data collection). Despite these efforts and a response rate similar to that of other voluntary surveys conducted by Statistics Canada (68.6%), almost one in three households did not participate. At the processing stage, total non-response was treated using a reweighting methodology. The purpose of reweighting is mainly to reduce the non-response bias in the estimates.

Non-response has two possible effects on the data. First, it contributes to an increase in the total variance of the estimates, since the actual sample size is smaller than the planned size. Second, it can introduce a bias in the estimates when non-respondents differ from respondents in the characteristics measured.Footnote1 This is referred to as non-response bias. Consequently, to assess data quality, it is recommended that efforts be made to determine whether there is a potential non-response bias and, if possible, to derive an indicator of its magnitude.

A non-response bias indicator is a tool attempting to identify the presence of a potential residual bias, i.e., after reweighting. Non-response bias occurs when the characteristics of interest of respondents differ from those of non-respondents. Unlike the sampling variance, it does not necessarily decline with an increase in sample size.

A positive non-response bias indicator indicates that the estimator overestimates the parameter of interest on average, and a negative non-response bias indicator indicates an underestimate. The magnitude of the non-response bias indicator indicates the extent of the potential bias in the estimate. Thus, a high value for the non-response bias indicator may indicate substantial differences between the characteristics of respondents and the characteristics of non-respondents.

5.2 Methodology of non-response bias indicators

The indicators of bias due to unit non-response were produced from a linked file of the 2011 and 2006 censuses. Using family name, address and date of birth, 73% of 2011 Census respondents were linked to their responses from the 2006 Census. This included responses from the 2006 long-form for a large portion of the 2011 NHS sample, including both respondents and non-respondents to the NHS.

The weights of the resulting matched sample were calibrated to estimates of the number of persons in the population common to both the 2006 and 2011 censuses. This calibration was done separately for 2011 NHS matched respondents and matched non-respondents. A second set of weights was calculated for the matched respondents. These weights used the 2011 non-response adjustment factors and were calibrated to the estimates of the total number of persons in the population common to both the 2006 and 2011 censuses. Using the two sets of weights, benchmark totals were obtained from the entire matched sample (i.e., 2011 NHS respondents and non-respondents) using the 2006 Census long-form questionnaire data and adjusted totals were obtained from the matched respondents sample again using the 2006 Census long-form questionnaire data.

The difference between an adjusted total and a benchmark total was used as an indicator of the bias due to total non-response in the 2011 NHS. Algebraically, for category k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWGRbaaaa@3707@  of the 2006 variable x j MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWG4bWdamaaBaaaleaapeGaamOAaaWdaeqaaaaa@385C@  this bias indicator can be expressed as B ̂ ( X ̂ jk a )= X ̂ jk a X ̂ jk MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaCbiaeaaqa aaaaaaaaWdbiaadkeaaSWdaeqabaWdbiablkWaKaaakiaacIcapaWa aCbiaeaapeGaamiwaaWcpaqabeaapeGaeSOadqcaaOWdamaaDaaale aapeGaamOAaiaadUgaa8aabaWdbiaadggaaaGcpaGaaiykaiabg2da 9maaxacabaWdbiaadIfaaSWdaeqabaWdbiablkWaKaaak8aadaqhaa WcbaWdbiaadQgacaWGRbaapaqaa8qacaWGHbaaaOGaeyOeI0Ydamaa xacabaWdbiaadIfaaSWdaeqabaWdbiablkWaKaaak8aadaWgaaWcba WdbiaadQgacaWGRbaapaqabaaaaa@4A7A@  with the benchmark X ̂ jk =  i s m w i x ijk MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaCbiaeaaqa aaaaaaaaWdbiaadIfaaSWdaeqabaWdbiablkWaKaaak8aadaWgaaWc baWdbiaadQgacaWGRbaapaqabaGcpeGaeyypa0JaaiiOamaawafabe WcpaqaaaWdbeqan8aabaWdbmaaqabakeaacaWG3bWdamaaBaaabaWd biaadMgaa8aabeaapeGaamiEa8aadaWgaaqaa8qacaWGPbGaamOAai aadUgaa8aabeaaaSWdbeaacaWGPbGaeyicI4Saam4Ca8aadaWgaaqa a8qacaWGTbaapaqabaaao8qabeqdcqGHris5aaaaaaa@4A8A@  and the adjusted total X ̂ jk a = i s m,r a i w i x jk MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaCbiaeaaqa aaaaaaaaWdbiaadIfaaSWdaeqabaWdbiablkWaKaaak8aadaqhaaWc baWdbiaadQgacaWGRbaapaqaa8qacaWGHbaaaOGaeyypa0Zaaybuae qal8aabaaapeqab0WdaeaapeWaaabeaeaacaWGHbWdamaaBaaaoeaa peGaamyAaaWdaeqaa0WdbiaadEhapaWaaSbaa4qaa8qacaWGPbaapa qabaqdpeGaamiEa8aadaWgaaGdbaWdbiaadQgacaWGRbaapaqabaaa peqaaiaadMgacqGHiiIZcaWGZbWdamaaBaaabaWdbiaad2gacaGGSa GaamOCaaWdaeqaaaWdbeqaniabggHiLdaaaaaa@4D5A@ , where, for unit i, a i MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWGPbGaaiilaiaadggapaWaaSbaaSqaa8qacaWGPbaapaqabaaa aa@39E2@  is a total non-response adjustment, w i MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWG3bWdamaaBaaaleaapeGaamyAaaWdaeqaaaaa@385A@  is a calibrated weight, x ijk MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWG4bWdamaaBaaaleaapeGaamyAaiaadQgacaWGRbaapaqabaaa aa@3A3A@  is the value of variable j MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWGQbaaaa@3705@  for category k, s m MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWGRbGaaiilaiaadohapaWaaSbaaSqaa8qacaWGTbaapaqabaaa aa@39FA@  is the matched sample and s m,r MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaWGZbWdamaaBaaaleaapeGaamyBaiaacYcacaWGYbaapaqabaaa aa@3A01@  is the set of 2011 respondents in the matched sample.

5.3 Limitations of the NHS non-response bias indicators

There are several limitations to these indicators. The indicator reflects total non-response bias and thus does not account for item non-response. It is an indicator for the population in common between 2006 and 2011. In other words, the potential bias for births, recent immigrants (last 5 years) and non-permanent residents (last 5 years) is neglected. The indicator is based on a matched sample, so unmatched units that were part of the common population are not represented. The weight adjustment that accounts for the unmatched portion is likely imperfect. The indicator is based on the 2006 variables which are proxies to the 2011 variables. It lacks precision for geographic areas with a small matched sample, often the smaller geographic areas. Finally, its reliability is also affected by non-response to the 2006 Census short and long forms as well as to the 2011 Census. In sum, the bias indicators are exactly as the name implies, indicators of bias and not measures or estimates of bias.

5.4 Some NHS non-response bias indicators

For illustrative purposes, the non-response bias indicators shown in Table 5.4.1 were calculated at the Canada level for a few characteristics where there is potentially a higher risk of bias.

  • Immigration status (landed immigrant, non-permanent resident)
  • Aboriginal identity (First Nations (North American Indian) single identity, Métis single identity, Inuk (Inuit) single identity)
  • Visible minority (South Asian, Chinese, Black, Filipino, Latin American)
  • Highest certificate, diploma or degree (no certificate, diploma or degree; secondary (high) school diploma or equivalent; trades certificate or college diplomaFootnote2; bachelor's degree).

Table 5.4.1
Bias indicators at the Canada level for the 'Immigration status,' 'Aboriginal identity,' 'Visible minority' and 'Highest certificate, diploma or degree' characteristics

Table summary
This table displays the results of Bias indicators at the Canada level for the 'Immigration status. The information is grouped by Characteristic (appearing as row headers), Bias indicator
(%) (appearing as column headers).
Characteristic Bias indicator
(%)
Source: Statistics Canada, 2011 National Household Survey.
Immigration status
Landed immigrant -1.2
Non-permanent resident -0.8
Aboriginal identity
First Nations (North American Indian) single identity 1.7
Métis single identity 0.2
Inuk (Inuit) single identity 6.1
Visible minority
South Asian 1.3
Chinese 0.9
Black -3.4
Filipino 7.3
Latin American -1.1
Highest certificate, diploma or degree
No certificate, diploma or degree -0.7
Secondary (high) school diploma or equivalent -0.9
Trades certificate or college diploma 0
Bachelor’s degree 1.9
Date modified: