During the first two weeks of May 2006, 70% of Canadian households received census questionnaires through the mail, while the remaining 30% had their questionnaire delivered by an enumerator, as in past censuses. In 2006, respondents "counted themselves in" when they completed their questionnaire online or mailed it back.
The processing phase of the census began as early as May 2, since households could choose to complete their forms online as soon as they received their paper questionnaire. This phase began with the process of translating responses from approximately 12.7 million households into meaningful data. This part of the census cycle is divided into six main activities:
block canvass - a physical verification and validation field exercise to update address listings in most urban areas to produce a complete and reliable list of addresses (which are listed on the Block Canvass Register), allowing for the mail-out of census questionnaires in selected urban and urbanized areas. Block canvass covers about 70% of dwellings in Canada.
census - a statistical portrait of Canada on one particular day: May 16, 2006. The census reflects demographic, social and economic information about people, housing units and agricultural operations in Canada on that day.
collection unit (CU) - refers to all dwellings in a geographic unit. There are 50,000 collection units for the 2006 Census.
list/leave (L/L) - refers to the delivery (as opposed to mail-out) of questionnaires to 30% of dwellings in Canada by a census enumerator.
visitation record (VR) - is a document used by enumerators in L/L areas to record summary information about each dwelling. There is one VR for each collection unit.
Data Processing Centre staff in Gatineau, Quebec, were responsible for the registration of completed census questionnaires. Initial receipt, via the reading of barcodes through the see-through portion of the return envelope, was conducted by Canada Post.
For the 2006 Census, Canada Post delivered the mailed-back paper questionnaires to the Data Processing Centre while electronic questionnaires were transmitted directly. Electronic questionnaires were registered automatically and paper questionnaires were checked in by scanning the barcode on the front of the questionnaire.
The 2006 Census was the first census to capture data automatically using automated capture technologies rather than manual keying.
Some automated completion editing simulating edits an enumerator would have done in previous censuses were performed at this stage to check for completeness, consistency and coverage. Multiple responses for one household may have been received and flagged for subsequent interactive verification if an error was identified.
When a missing or invalid response is uncovered, interactive verification may be used to resolve failures by manually examining the captured data and scanned images (where available) to help determine the appropriate response. When necessary, failed household questionnaire data were transmitted to a regional Census Help Line site for failed edit follow-up, where an operator contacted the respondent and completed the information using a computer-assisted telephone interviewing application. The data was transmitted back to the Data Processing Centre for reintegration into the system for subsequent processing.
The 2B long-form questionnaire contained questions where answers could be checked off against a list, as well as questions where the respondent had to write in an answer in the boxes provided. These written responses had to be converted to numerical codes before they could be tabulated for release purposes. For the 2006 Census, all written responses on the long questionnaires underwent automated and computer-assisted coding to assign each one a numerical code using Statistics Canada reference files, code sets and standard classifications. Reference files were built using actual responses from past censuses for the automated match process. Specially trained coders and experts resolved cases that could be assigned automatically.
The variables for which coding applied were:
In 2006, it is expected that over 40 million write-ins will be coded, of which an average of about 75% will be coded automatically.
The data collected in any survey or census contain omissions or inconsistencies. These errors can be the result of respondents missing a question, or they can be due to errors generated during processing. For example, a respondent may be unwilling to answer a question, may fail to remember the right answer, or may misunderstand the question. Census staff may code responses incorrectly or may make other mistakes during processing.
After the capture, initial editing and corrections, and coding operations are complete, the data are processed through the final edit and imputation activity. The editing process detects errors and the imputation process corrects them. The edit and imputation phase is important because:
Questions on age, sex, marital status, mother tongue and relationship to Person 1 are asked of 100% of the population. However, the bulk of census information is acquired on a 20% sample basis using the additional questions on the 2B questionnaire. "Weighting" is used to project the information gathered from the 20% sample to the entire population.
The weighting method provides 100% representative estimates for the 20% data and maximizes the quality of sample estimates.
For the 2006 Census, weighting will employ the same methodology used in the 2001 Census, known as calibration estimation. This begins with initial weights of approximately 5 and then adjusts them by the smallest possible amount needed to ensure closer agreements between the sample estimates (e.g., number of males, number of people aged 15 to 19) and the population counts for age, sex, marital status, common-law status and household.