 
      Comparing two methods of reweighting a survey file to small area data
    
      
        Cite  this article
        as: R. Tanton, P. Williamson, A. Harding; 2014; Comparing two methods of reweighting a survey file to small area data; International Journal of Microsimulation; 7(1); 76-99.
      doi: 10.34196/ijm.00094
      
  
- Article
- Figures and data
- Jump to
Figures
Tables
            Table 1
          
    
    
            Benchmarks used for creating small-area weights.
| Census table | Type | Dimensiona | Fully specifiedb | Benchmarks (no.) | 
|---|---|---|---|---|
| Age by sex by labour force status | Person | Multi | Yes | 32 | 
| Residents in different types of non-private dwelling | Person | Single | Yes | 8 | 
| Household Type | Household | Single | No | 1 | 
| Household size – number usual residents | Household | Single | Yes | 7 | 
| Dwelling tenure by weekly household rent | Household | Multi | No | 7 | 
| Dwelling tenure by household type | Household | Multi | Yes | 15 | 
| Dwelling tenure by weekly household income | Household | Multi | No | 16 | 
| Monthly household mortgage by weekly household income | Household | Multi | Yes | 12 | 
| Weekly household rental by weekly household income | Household | Multi | Yes | 20 | 
| Dwelling structure by household family composition | Household | Multi | No | 12 | 
| Total number of benchmark tabulations | 10 | |||
| Total number of benchmarks | 130 | |||
- 
                        aMulti-dimensional means cross-tabulations of variables. 
- 
                        bNot fully specified means that one or more of the cells in a benchmark tabulation were not used for weight production. For example, for the benchmark table of ‘Household Type’, the count of ‘Private households’ was extracted for use as a benchmark, whilst the count of ‘Non-private dwellings’ was excluded from the reweighting process. 
            Table 2
          
    
    
            Comparison of methods in summary.
| GREGWT | CO | |
|---|---|---|
| Approach | National household weights from a national survey dataset are reweighted to household weights for SLAs by constraining to small-area census counts | Selection of a combination of households from a national survey microdata set that best fit small-area census counts | 
| Weights | In fractional numbers | In integer numbers | 
| Preparation of census data | Needs to address re-allocation of ‘not-stated’ and ‘not applicable’ counts | Needs to address re-allocation of ‘not-stated’ and ‘not applicable’ counts | 
| Conflicting benchmark counts due to statistical disclosure measures | Causes non-convergence because no set of weights can be found that simultaneously satisfies all benchmarks | Seeks to minimise the difference between the final weights and the target benchmarks which typically results in weights that matchthe average of any discrepant benchmarks | 
| Optimisation strategy | Algorithm reaches an optimised solution when residual (i.e. difference between an synthetic estimate and the benchmark count) approaches zero | Minimise absolute or proportional error | 
| ‘Convergent’ & ‘non-convergent’ SLAs | In some cases no convergent solution may be found; Average Household Absolute Sum of Residuals is >1 provides a proxy indicator for this non-convergence. | No convergence issues, although final ‘optimal’ estimates may still fail to fit all user-supplied benchmarks | 
- 
                        
                        Source: NATSEM (GREGWT) and Williamson (CO). 
            Table 3
          
    
    
            Summary measures of goodness of fit.
| Measure | Description | 
|---|---|
| Overall Total Absolute Error (OTAE) | Absolute Sum of Residuals summed across all benchmark counts | 
| Overall Total Absolute Error per household (OTAE/HH) | Absolute sum of residuals per household across all benchmark counts | 
| Overall Total Absolute Proportional Error (OTAPE) | Absolute difference between benchmark counts when expressed as fraction of the table total | 
| Overall relative sum of Z-square scores (ORSumZ2) | For each benchmark table, the Z-score of each benchmark count squared, and summed for the table; then divide by chi-square critical value for table (--> RSumZ2), then sum across all tables (--> ORSumZ2). For a given table, RSumZ2 > 1 shows it is not fitting. | 
            Table 4
          
    
    
            Results for constrained variables, Australian Capital Territory.
(GREGWT ‘convergent’ SLAs only)
| Measure | GREGWT | CO (Min Proportion) | CO (Min Absolute) | 
|---|---|---|---|
| OTAE | 139.6 | 133.4 | 92.2 | 
| OTAE/HH | 0.1 | 0.1 | 0.1 | 
| OTAPE | 0.4 | 0.2 | 0.2 | 
| ORSumZ2 | 48.4 | 0.5 | 27.8 | 
- 
                        
                        Note: Lower numbers signify greater accuracy. 
            Table 5
          
    
    
            Results for constrained variables, New South Wales.
(GREGWT ‘convergent’ SLAs only)
| Measure | GREGWT | CO (Min Proportion) | CO (Min Absolute) | 
|---|---|---|---|
| OTAE | 602.9 | 483.1 | 979.3 | 
| OTAE/HH | 0.1 | 0.1 | 0.1 | 
| OTAPE | 0.2 | 0.1 | 0.2 | 
| ORSumZ2 | 60.5 | 1.9 | 29.2 | 
- 
                        
                        Note: Lower numbers signify greater accuracy. 
            Table 6
          
    
    
            Results for housing stress, Australian Capital Territory and New South Wales.
(GREGWT ‘convergent’ SLAs only)
| Number Unaffordable | ||||
|---|---|---|---|---|
| State | ABS | GREGWT | CO (Min Proportion) | CO (Min Absolute) | 
| Australian Capital Territory | 5,526 | 6,147 | 5,924 | 5,821 | 
| New South Wales | 169,823 | 194,394 | 191,720 | 189,269 | 
| Total | 175,349 | 200,541 | 197,644 | 195,090 | 
| % Unaffordable | ||||
| Australian Capital Territory | 5.9 | 5.9 | 5.7 | 5.6 | 
| New South Wales | 9.1 | 9.2 | 9.1 | 8.9 | 
| Combined | 9.0 | 9.0 | 8.9 | 8.8 | 
            Table 7
          
    
    
            
            Table 8
          
    
    
            Weights for GREGWT and CO.
Note: GREGWT convergent SLAs only.
| Maximum | Average non-zero value | |||
|---|---|---|---|---|
| Method | New South Wales | Australian Capital Territory | New South Wales Average | Australian Capital Territory Average | 
| CO (Min Proportion) | 443 | 24 | 3.49 | 1.45 | 
| GREGWT | 647 | 18 | 1.11 | 0.15 | 
Download links
A two-part list of links to download the article, or parts of the article, in various formats.
     
       
             
               
               
               
              