Information identified as archived is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please contact us to request a format other than those available.
Although there are high quality standards governing the collection and processing of census data, it is not possible to eliminate all errors. There are two kinds of population coverage error. Population undercoverage is the extent to which persons who should have been enumerated are not included in census data while population overcoverage is the degree to which census data include persons who were enumerated more than once, usually twice.
Undercoverage can occur in the first stage of the census if the list of dwellings constructed to cover the census dwelling universe is incomplete. This risk is higher, for example, if a dwelling is under construction. Conversely, overcoverage can occur if a dwelling is listed twice.
Coverage error is also likely to occur during the field data collection stage. Respondent error is responsible for coverage error when the person completing the census form either excludes or omits someone whose usual place of residence, according to the census rules, is the dwelling; this is undercoverage. Or, he/she includes someone whose usual place of residence is not the dwelling; this will become overcoverage if this person is also included at his/her usual place of residence or somewhere else. In most cases, it is easy to determine someone's usual place of residence. However, there are a number of situations where the process, as stated in the previous section, is not elementary and special rules have been created in order to define an individual's usual place of residence. Although the rules are set out in the census form, the list is long and there may be comprehension challenges. Coverage error may result when the rules are not consulted or when they are incorrectly applied. The notion of Census Day as the reference date, for example, for determining usual residence is also critical to the potential for coverage error.
Coverage errors can also be introduced during the processing stage at any point where records for persons or households are added or removed from the census database. Records can be erroneously cancelled or lost. Questionnaires may be linked to the wrong record or returned too late to be included.
Although efforts are made to enumerate the homeless population, the risk of undercoverage is high for this population. Some other living arrangements are particularly vulnerable to coverage error. Young adults newly away from home, for example, can be either undercovered because neither the roommates nor a parent lists them, or overcovered because the person is listed on both census forms. Similarly, persons who maintain a second residence because of their employment may be at risk of coverage error.
Users should also be aware of the extent to which Indian reserves and Indian settlements participated in the 2006 Census. In some cases enumeration was not permitted or was interrupted before it could be completed. In other cases the quality of the enumeration was considered inadequate. These geographic areas, a total of 22, are called incompletely enumerated Indian reserves and Indian settlements. Data for 2006 are therefore not available for the incompletely enumerated reserves and settlements, and are not included in tabulations. Similar problems have occurred in previous censuses. In the 2001 Census there were 30 Indian reserves and Indian settlements that were declared incompletely enumerated. Among these, 14 became participating reserves in the 2006 Census.In order to produce population estimates covering persons living on the 22 incompletely enumerated Indian reserves and Indian settlements, model-based estimates are produced. Since no reliable source exists to verify the assumptions used in the model, the estimates must be used with caution. You can find more information in Section 12.2.
Potential sources of coverage error were recognized during the planning of the 2006 Census, and the following measures were taken to minimize them:
These procedures, along with appropriate training, supervisory checks, and quality control systems during census collection and processing, helped to reduce the number of coverage errors.
Algebraic definitions of coverage errors are given in this section. Let represent the total or 'true' number of persons in the census target population. Then, let
be the published census count of the number of persons in the census target population. The error in using
instead of
is then:
This error, denoted as , is the net population coverage error.
Let denote population undercoverage.
is the number of persons not included in
who should have been.
Let denote population overcoverage where
is the number of persons included in
who should not have been. There are two components to
. The first is persons who were enumerated more than once. These duplicate enumerations should not have been included in
. The census coverage studies focus on duplicate enumerations. The second component of
is persons who were included in
who are not in the census target population. Foreign residents visiting Canada, for example, who are listed on a census form as usual residents of a dwelling should not be included in
. Fictitious persons are another example. The number of persons included that are not in the census target population has been seen by previous studies to be negligibly small. Therefore, the 2006 Census coverage studies did not measure this component of coverage error.
Since refers to persons who should be included in
and
refers to persons who should not be included in
, the difference between
and
is
less
. That is:
The true number of persons in the census target population is then:
An estimate of is given by
where:
is an estimate of the number of persons not included in
that should have been; and
is an estimate of the number of persons included in
who should not have been. Let us assume that overcoverage from persons included in
who are not in the census target population is zero. Therefore,
is restricted to an estimate of the number of duplicate enumerations. It is the goal of the census coverage studies to produce
and
.
Census population coverage error can be usefully expressed as rates relative to the true population: The undercoverage rate is
expressed as a percentage of
. The overcoverage rate
is
expressed as a percentage of
. The net undercoverage rate
is the difference between
and
expressed as a percentage of the census target population. These three rates can be estimated by
,
, and
as follows:
A positive net undercoverage rate indicates that undercoverage is larger than overcoverage. That is, there are more people not included in the published census count than the number of duplicated enumerations. This has been, and continues to be, the experience of the Canadian census. For some domains of interest, however, negative net undercoverage has recently been observed.
As defined above, is the number of persons not included in
who should have been. The census count
is composed of two elements:
where:
the number of enumerations. This is the number of people who were listed on a census form.
the number of imputed persons. This is an estimate of the number of persons missed in non-response dwellings and in occupied dwellings erroneously classified as unoccupied. More information on whole household imputation (WHI) can be found in Section 5.7.
Undercoverage, therefore, is a subset of all persons who were not listed on a census form but should have been. It does not include those who were not enumerated either because no completed census form was returned for the dwelling (non-response dwelling) or the dwelling did not receive a form because they were erroneously classified as unoccupied (misclassified occupied dwelling).
In summary, the true population is composed of the census count
and net undercoverage
.
consists of
plus the number of persons added in WHI
where the imputations are for persons living in non-response dwellings or in misclassified occupied dwellings.
is undercoverage
less overcoverage
.
Two postcensal studies were carried out to estimate 2006 Census population coverage error. The Reverse Record Check (RRC) provided estimates for population undercoverage while the Census Overcoverage Study (COS) estimated population overcoverage.
The RRC and the COS were conducted after census field collection and processing were complete. Preliminary estimates of 2006 Census population coverage error were released March 27, 2008. Following a lengthy and detailed validation exercise with the Demography Division and the provincial and territorial statistical focal points, final estimates were released on September 29, 2008. This release was concurrent with the release of new official population estimates reflecting the update of the base population to the 2006 Census. Census population counts adjusted for net population undercoverage formed the updated base population.The methodology of the two census coverage studies can be briefly described as follows:
Reverse Record Check (RRC)
In the RRC, a random sample of individuals representing the 2006 Census target population was taken from frames independent of the census such as a list of persons enumerated in the 2001 Census and a list of intercensal births according to provincial birth registries. The 2006 RRC sample consisted of 67,813 persons in the provinces and 1,938 persons in the territories. In addition, 84,522 enumerated persons with a weight of one contributed to the territorial estimates1. The 2006 Census database was searched to determine if the persons selected in the sample had indeed been enumerated.
When required, a telephone interview via computer-assisted telephone interviewing (CATI) out of the regional offices (ROs) was conducted to collect further information to declare the individual as in scope or not in scope for the census, and when in scope, to provide further data for searching. An interview was achieved for 84.2% of the 20,114 cases sent to the ROs. Sampling weights were adjusted to account for non-response whereby the total sampling weights of the non‑respondents was shared among a group respondents most like the non‑respondents in their propensity to respond.
Estimates of population undercoverage are based on persons in the RRC sample who were classified 'missed.' These persons have been found to be in scope for the 2006 Census but no evidence of enumeration in the 2006 Census could be found in the 2006 Census Response Database. Nationally, there were 5,431 persons selected by the RRC classified as missed in the provinces and 676 in the territories.
Census Overcoverage Study (COS)
Overcoverage was measured by matching the final 2006 Census database to a partial list of persons who should have been enumerated constructed from administrative data sources, and by matching the 2006 Census database to itself. The COS applied automated exact matching to the administrative sources and probabilistic matching to the census database. Probabilistic matching identifies matches that are close but not exact. Pairs of these potential duplicates were sampled and name and demographic characteristics were examined to identify overcoverage.
Note: