Sunday, Oct 02, 2022

A statistical embarrassment

Data claiming 45-year peak in unemployment is misleading. Indian institutions are still operating with the technology and outlook of 70 years ago.

These data were for the period July 2017-June 2018, and this is the last official government survey on the labour market — it yielded the result that unemployment, at 6.1 per cent, was at a 45-year high. (Illustration: CR Sasikumar)

There is an old saying — no, it is not a Chinese proverb — which says, “Be careful what you wish for, it might come true”. At present, the political Opposition has made jobs and unemployment their number one issue to unseat the Narendra Modi government. If I were a Congress strategist, I would do exactly what they are doing. Jobs, and the economy, is where the BJP maybe the most vulnerable. And the perception of vulnerability is enhanced by the BJP’s misguided stubbornness to not release the Periodic Labour Force Survey (PLFS) data on the labour market. These data were for the period July 2017-June 2018, and this is the last official government survey on the labour market — it yielded the result that unemployment, at 6.1 per cent, was at a 45-year high.

Statisticians and economic experts, who should know better and probably do, are also siding with the Opposition with the belief (wish!) that the government will be intransigent enough not to release the data and thereby allow the perception to continue that the economy is really as weak as indicated by the PLFS data. This is where being careful of what you wish for comes in; once released, the PLFS data is likely to show that its conclusions are extremely flawed, and embarrassingly inaccurate.

In the next few paragraphs (and table) I report on labour market data for 2011-12, and 2017-18. As the reader peruses the evidence, she will realise that the PLFS data, and results, are in a class of their own — they don’t conform to any known economic behaviour. But we won’t know that until the data are released.

While the media (obviously!) has concentrated on the reported 45-year high in the unemployment rate (6.1 per cent in 2017-18 vs the 2.2 per cent in 2011-12), it is prudent to interpret in a detached manner other economic trends contained in the “leaked” report.

Subscriber Only Stories
From The Explained editor: The ban on PFI, forests at night, and free gra...Premium
From the Opinion Editor: An election for the Congress partyPremium
From Nehru to JP, the political leaders mentored by GandhiPremium
Uttarakhand resort murder: Amid questions within, BJP may revamp Dhami go...Premium

Count the inexplicable anomalies as we list them. First, some background about labour-market reports in India. The NSSO believes (erroneously) that the ratios obtained are sacrosanct, but not the survey method on which they are based. For example, to derive the important data on jobs, NSSO documents advise users to use the ratios that they provide (labour force participation rate, worker participation rate, etc.) and multiply the ratios by census-based population. Good advice, but how do you believe the building blocks of the survey-based ratios? For example, first the PLFS reports an urbanisation rate (a ratio) in 2017-18 of 29.3 per cent compared to an estimated 31.2 per cent in the NSSO’s own survey for 2011-12. A decline of 2 percentage points, when the reality (yielded by every other observer) is an increase in urbanisation. How did this cross-check escape the NSSO or its oversight body, the National Statistical Commission (NSC)?

Second, the estimate for overall population in 2017-18 is off the charts — it is too low. The NSSO 2011-12 survey

reported a population of 1,088 million; in 2017-18, it reports a population that is 14 million less. On a survey/census ratio (since the NSSO and NSC are fond of ratios) basis, the 2017-18 ratio of 82 per cent is the lowest observed for the last 40 years, if not ever. This is both an Indian and world record for population under-estimation.


Third, what explains this statistical junk? As lucidly pointed out by Avik Sarkar (‘Unemployment in India: The real reason behind low employment numbers’, Financial Express, February 28), it is the sampling technique, stupid. Inexplicably, and seemingly without much consultation with outside experts, the NSSO decided to radically change their sampling method — it was to be now based on educational attainment rather than household per capita consumption. It is one of the biggest statistical no-nos to select a sample based on a criteria that affects the behaviour you are trying to explain. Education affects labour market participation — the last thing you want to do is “select” on the basis of education. James Heckman got a Nobel prize in 2000 for obtaining reliable results when the sample behaviourally selects itself. And here we are imposing a selection bias.

Simply put, the PLFS 2017-18 survey is a huge statistical embarrassment and someone should inquire as to how it passed all the statistical checks and balances of the experts. My best “recovery” of what happened — the expert masters felt that the radical change in sampling method would not make much difference to the results.

A large number of households with lower educational attainment (with no member of the household having more than the secondary level of education) were missed out in the PLFS survey. These households are likely to be poorer, and more at work than members of the middle class. Missing them means that you are likely to skew the wage pattern in unbelievable directions. For example, poorer casual workers (those working on daily wages, including agricultural wages) show a big decline in their share in the work-force — a 20 per cent decline from 30.5 per cent of the work force in 2011-12 to 24.9 per cent in 2017-18. The higher income salaried workers show a large increase in employment share — from 19.3 per cent in 2011/12 to 22.8 per cent in 2017-18 — again, a near 20 per cent increase. Both these increases are unprecedented and suggest that contrary to the unemployment rate jump, the economy is doing very well.


But wait, there is more to “release” — all puns intended. Let us look at what has happened to the real incomes (wage) increase for the two sets of workers comprising almost half the working population (the rest are self-employed, like farmers, from whom the NSSO abstains from collecting any income data.

The higher income middle-class-salaried workers show a cumulative 18 per cent decline in their real incomes. Again, unprecedented. Well, there can be an explanation — look, everybody is getting educated (the sample selection is based on education), there is excess supply of educated people, and demand hasn’t kept pace. Ostensibly because of GST and demonetisation. But wait, the real wages of the poorer casual workers, the ones who are out of jobs because of demonetisation, GST and lack of demand, have increased by 29 per cent.

If true, the PLFS data would indicate that the Modi government has unleashed the most inclusive growth anywhere, and at any time in human history. There should be a special Nobel Prize for this achievement. Forget farm distress — you are not counting income right. All the farmers have become farm labourers, in order to enjoy these high wages. And inequality-wallahs please note — Modi has engineered the highest inequality decline in history — and in just five years.

The PLFS results can be described in two words — shockingly untrue. The statistical masters who passed the PLFS data report with a “good house-keeping seal” should hang their heads in statistical shame. All the data are presented in the table — all the data are from the much leaked (and vaunted) media reports. Please, will any of the concerned citizens who have voiced shame, shame, censorship of government data, decline of public institutions, explain the array of evidence presented.

My view is that the PLFS data is a statistical embarrassment to India and the NSSO. We all complain about how institutions have gone down, and there are myriad complaints about the authenticity, or transparency, of Indian statistical data. The NSSO was a much-respected institution, as was the National Statistical Commission, which was first formed in 2006 under the chairmanship of the late Suresh Tendulkar (I was proud to serve under his leadership for the first three years of the Commission, 2006 to 2009).


Sadly, and extremely unfortunately, many, no, make it most, no make it all Indian institutions are still operating with the same mind-set and technology and outlook that the pioneers did 70 years ago. The world, and technology, has moved on. But not us.

The writer is contributing editor, The Indian Express and Consultant, Network 18.

Views are personal

First published on: 13-04-2019 at 12:34:35 am
Next Story

No rift between sons, Tej Pratap will come home soon: Rabri Devi

Latest Comment
Post Comment
Read Comments