The Quarterly Labour Force Survey (QLFS) is a household-based sample survey conducted by Statistics South Africa (Stats SA). It collects data on the labour market activities of individuals aged 15 years or older who live in South Africa.
Kind of Data
Sample survey data [ssd]
Unit of Analysis
v2.1: Edited, anonymised dataset for public distribution
Version 2.0 of the QLFS 2010 Q2 was downloaded from the Statistics South Africa (Stats SA) website by DataFirst in January 2012. This version differs in a number of ways from a version obtained by DataFirst (from Stats SA) at some undeteremined time prior.
The first of the observable differences between old and new versions is the way in which observations that fit into "unspecified", "not applicable" or "missing" type categories are coded for certain variables. For example, in the older version of the QLFS 2010 Q1 the "Occup" variable is coded 88, with the associated label "Not applicable", for 65,445 observations. In the newer version this category of responses is assigned the code 0 and is not labelled (as it was in the previous version) for the same 65,445 observations. This recoding process has been applied to a large number of categorical variables in the datafile. A few other categorical variables have been recoded in a similar vein but as different (non-zero) values. For example, values of 888 for "Q4212TOTALHRS" have been redefined as having the value 88.
Second, a number of extra variables were introduced in the later version. It is unclear why these are not present in the older version of the datafile as they are detailed in metadata that was released at the same time as the original data:
1) "Metro_code" - Metropolitan area code (e.g. Cape Town, eThekwini, Johannesburg, etc.)
2) "Status_Exp" - Expanded unemployment status.
3) "Stratum" - 6 digit number representing stratum formed during master sample 2006 where digit 1 represents province, based on 2005 provincial boundaries, digits 2-3 represent the metro/non-metro area and digit 4 confers geography type.
Third, there are substantive differences between versions for “Geo_type” variable values. In total, there are 88 differences between versions for the “Geo_type” variable. These differences can be summarised as follows:
· 7 observations that were previously coded as “1” are now coded as “2”
· 3 observations that were previously coded as “0” are now coded as “1”
· 78 observations that were previously coded as “0” are now coded as “2”
The reason for these changes is unclear.
Finally, the two versions have different weights. To DataFirst's knowledge, the weighting changes are not clearly documented by Stats SA. The most likely explanation for the difference between the two sets of weights is that the newer version is calibrated to an updated set of mid-year population estimates. Users are advised to remain aware of these slight calibration differences when employing weights.
A new version of this dataset was released in 2014. The new version (our version 2.1) was reweighted to reflect the new population benchmarks from Census 2011.
This version, version 2.1, includes the new weights for the QLFS 2008-2013 series.
INDIVIDUALS: labour market activity, labour preferences, labour market history, demographic characteristics, marital status, employment status, education, grants, tax.
labour and employment
trade, industry and markets
population and demography.
Provincial and metropolitan level
The QLFS sample covers the non-institutional population except for those in workers' hostels. However, persons living in private dwelling units within institutions are enumerated. For example, within a school compound, one would enumerate the schoolmaster's house and teachers' accommodation because these are private dwellings. Students living in a dormitory on the school compound would, however, be excluded.
Producers and sponsors
Statistics South Africa
The QLFS frame has been developed as a general purpose household survey frame that can be used by all other household surveys irrespective of the sample size requirement of the survey. The sample size for the QLFS is roughly 30 000 dwellings per quarter.
The sample is based on information collected during the 2001 Population Census conducted by Stats SA. In preparation for the 2001 Census, the country was divided into 80 787 enumeration areas (EAs). Stats SA's household-based surveys use a Master Sample of Primary Sampling Units (PSUs) which comprises of EAs that are drawn from across the country.
The sample is designed to be representative at the provincial level and within provinces at the metro/non-metro level. Within the metros, the sample is further distributed by geography type. The four geography types are: urban formal, urban informal, farms and tribal. This implies, for example, that within a metropolitan area the sample is representative at the different geography types that may exist within that metro.
The current sample size is 3 080 PSUs. It is divided equally into four sub-groups or panels called rotation groups. The rotation groups are designed in such a way that each of these groups has the same distribution pattern as that which is observed in the whole sample. They are numbered from one to four and these numbers also correspond to the quarters of the year in which the sample will be rotated for the particular group.
The sample for the QLFS is based on a stratified two-stage design with probability proportional to size (PPS) sampling of primary sampling units (PSUs) in the first stage, and sampling of dwelling units (DUs) with systematic sampling in the second stage.
Statistics South Africa. Quarterly Labour Force Survey 2010: Q2 [dataset]. Version 2.0. Pretoria: Statistics South Africa [producer], 2010. Cape Town: DataFirst [distributor], 2012. DOI: https://doi.org/10.25828/wnm3-hp45