Vaccination Coverage

Vaccination Coverage

In the chart above, each dot represents a country. The size of the black outer circle represents the total population. The filled inner circle represents the population vaccinated (which received at least one dose), in absolute numbers.

The data source is Our World in Data. In this source, not all countries have data on the vaccinated population, only on the number of doses administered. In these cases, the population vaccinated with at least one dose and the population with complete vaccination are estimated using linear regression with the ratio of total vaccinations and population*. Data from the last available day for each country are used. Only countries with more than 1 million inhabitants are shown.

Charts posted daily on Twitter at @robertodepinho

Other graphics available at: Covid-19 graphics

* In earlier versions, the fully vaccinated were estimated based on the median of the ratio between applied doses and covered population observed in other countries. The vaccinated were estimated based on the assumption that it takes 2 doses to fully vaccinate a person. This second measure would thus be underestimated in those countries that use the Janssen vaccine.

Posted by Roberto de Pinho, 0 comments
Five Indicators

Five Indicators

During the past Brazilian presidential election campaign, a friend asked me to choose a few indicators to evaluate the next government. There is no right answer to this question and my answer reveals my personal views on the matter [of evaluating a government].

I chose five:

  1. GINI index;
  2. Human Development Index;
  3. Unemployment, total (% of total labor force);
  4. Intentional homicides (per 100,000 people);
  5. medianGDP 1.

These would roughly cover five chosen dimensions: inequality, human development, unemployment, violent crime and personal safety and inequality-adjusted income per capita.

For each one of these dimensions there are alternatives. I simply chose the ones I am most used with and starting at my international indicators source of choice: World Development Indicators from The World Bank. Only data for HDI comes from another source, the UN.

Using these international sources has its advantages: data is available for multiple countries and goes through checking so that international comparisons are feasible.

On the down side, data is usually only available for each year and sometimes with quite big delays. One option is to a find local alternative that might not be a perfect match for the chosen indicator but may accurately portray trends.

For each of these indicators, I intend to, but do not promise to:

  • build a chart and graph commenting on past behavior;
  • find a short term alternative and post updates;
  • comment on new available data.

There should be a new post every month. Perhaps more.

I thank Camilo Telles and René Dvorak for their comments and suggestions.

1. from WDI indicators: GDP per capita, PPP (constant 2011 international $) (NY.GDP.PCAP.PP.KD) and GINI (SI.POV.GINI).

Photo by Isaac Smith on Unsplash

Posted by Roberto de Pinho, 0 comments
About spoons, shovels and looms

About spoons, shovels and looms

The story plays like this: Milton Friedman is visiting some country considered to be backward, when he sees some huge construction project being built with shovels instead of tractors and other pieces of machinery. Asking why, he learns that this is to increase the number of workers needed for the job, as providing jobs is one of the main objectives of that government endeavor.  He then goes on to suggest, mockingly, that workers should be using spoons instead of shovels, thus the need for workers would be even greater.

I have seen this anecdote used to ridicule many government interventions in the economy, from stimulus projects to labor regulations, if not every government intervention in the economy. The problem with interventions would be that they sacrifice efficiency to no one’s benefit, that they would be as pointless as building a canal with spoons.

The problem with this line of thought is that it tries to group together and equally discredit actions that have very different profiles when talking about efficiency, labor productivity and ultimately income distribution.

To show this, let’s frame our story a little differently, adding some more color as we go along.  We have three options here: spoons, shovels and tractors, three different final costs for the construction project and three different number of worker hours needed for its construction, but the same outcome: a completed canal or dam or whatever.

Another thing that remains constant in this story, and goes without saying, is how much work is performed by each of the workers involved.  This is never mentioned. Nonetheless, a constant amount of hours of work per worker is required for the story to work.

We have deeply embedded in us how many hours of work a worker should do every day or week. Think about somewhere around 40 hours per week and see if it resonates with you. Now let´s remove this constraint and rethink our story: let’s use tractors and trucks and reduce the number of hours each worker has to do.

From the comfort of one’s office armchair, the spoons option and the option of reduced working hours might look the same, as they have  more or less the same costs, and much higher than the choice of using trucks and the minimum amount of workers. Nonetheless, from the workers point of view, and we also might say, for society as a whole, these are very different scenarios. Now  a worker will have time to take care of his or her children, or learn something new, or just have a better, healthier life. Our analyst might suspect this looking at an index, labor productivity per hour worked, which shows very different behaviours for the spoons and the trucks options.  In fact, labour productivity per hour would roughly be the same whenever using tractors and trucks, either having few workers working full time or many workers working a few hours per week.

So, concerning the option of reduced working hours, we can simultaneously say that it is as productive as employing trucks and few workers, and that it is as costly as the spoons option. How can both these apparently incompatible assertions be true?

To disentangle this, we need to add the perspective of the company running the project. For the company, adding trucks and tractors to a project means increasing productivity and allowing for hiring less workers, thus increasing profits.  In this scenario, productivity gains are kept by the company. If we say now that each worker can only work a reduced number of hours per week, it is still in the best interest of everyone involved to employ machinery, but now the derived productivity gains are shared with the workers, which is reflected in higher costs from the firms perspective.

What is hidden in the original story is a discussion on income distribution. Yes, there has been plenty of regulations that certainly hurt productivity, such as the need for operators even in automated elevators1, but one cannot say the same of labour market interventions such as setting a maximum workweek or establishing a minimum wage. What they most importantly do is to level the play field between workers and employers. Yes, if your enterprise’s quality and competitiveness are only based on having cheap labor, you might be driven out of business. If not, you have other options, such as having a better pay structure.

Minimum wage and other work related regulations must be discussed in tandem with labour productivity and corporate profit behaviour, as well as trends in unemployment. To disregard these in evaluating  the minimum wage is to miss the point.

This debate is not new, as it is also not new the opposition to  labor regulation, implying the ruin of the industry if it is adopted. This excerpt is from the Oxford Economist Nassau Senior, in 1837, opposing a limit of 10 hour work day at factories:

“I have no doubt, therefore, that a ten hours’ bill would be utterly ruinous. And I do not believe that any restriction whatever, of the present hours of work, could be safely made. … The manufacturer is tired of regulations?what he asks is tranquility?implora pace.”

As we all know, despite those dire warnings, the United Kingdom economy became everything but ruinous.

I thank Beto Boullosa, Camilo Telles and Walter Hupsel for their comments and suggestions.

1. Lei Municipal 1.626/90 (Rio de Janeiro)

Versão em Português / Portuguese Version

Photo by Mari Potter on Unsplash

Posted by Roberto de Pinho, 0 comments
The World Cup as a flow map

The World Cup as a flow map

A Flow Map/Sankey Diagram showing the complete results from the World Cup model:

The chart has alternating nodes (blue bars) for teams and matches. Size of flows are proportional to the chances of a country being part of a match (from country to match bar) or proportional to the chances of winning a match (from match to country). Numbers after a country name are the chances of that National Team proceeding to the next phase. From  the quarter finals onwards, only teams with more than 1% chance of winning the World Cup are presented individually. The others are grouped under “others”.

A more complete version of the diagram, which takes a while to load, can be found here.

If it does not load on your device, you can look at a screen capture of it here.

This is the table of country codes:


Thanks to Viviane Malheiros for comments and suggestions.

Posted by Roberto de Pinho, 0 comments

The mean, the median and the GDP – part II


A version of this post was originally published in Portuguese as a guest post at Walter Hupsel’s blog On The Rocks @ Yahoo! Brasil. This continues from The mean, the median and the GDP – part I.

Using simple arithmetic, the GINI index allows for converting the GDP per capita into the median GDP, assuming that incomes follow a Pareto Distribution. For example, Namibia’s GDP per capita for 2011 is USD 6,3261, however, taking into account its 63.90 GINI index, the median GDP is calculated at USD 2,392 . That is a huge difference.

The equation that gives the median GDP, which perhaps would be more appropriately called a GINI adjusted GDP per capita, is:

\begin{equation} \label{eq:medianGDP}

medianGDP = \frac{\sqrt[\alpha]{2} \times (\alpha-1)}{\alpha}\times GDP \textit{ per capita}\text{, where } \alpha = \frac{1}{2\times GINI}+\frac{1}{2}


If you tolerate an error up to about 6% in relation to what is expected given a Pareto distribution, you can simply use $(1- GINI)\times GDP \textit{ per capita}$.

Now let’s take a look at Ukraine, with a GDP per capita that is equivalent to Namibia’s in 2011, at USD 6,365. Ukraine has a much better GINI index of 25.62, yielding a median GDP of USD 5,000, more than twice that of Namibia. This is much more consistent with Ukraine’s High human development and Namibia’s Medium human development, according to the Human Development Index, where they stand at the 78th and 128th positions, respectively, and not at the same position, as suggested by GDP per capita.


Probability density curves for simulated data with same GDP per capita as mean but much different GINI indexes. Uses log scale.

Median GDP can also provide a richer perspective on the progression of GDP of a country and its impact on the population. The USA saw its GDP per capita grow 74% from 1980 to 2012, while its median GDP or mGDP grew somewhat less, at 52%2. Looking at the period from 2007 to 2012, a period encompassing the Great Recession, one gets a picture of full recovery using GDP per capita, while mGDP would show a decrease of almost 3% in the period, still an improvement over the peak of crisis in 2009, but not quite yet the full recovery shown by GDP per capita.


Progression of GDP per capita and median GDP for the USA having values for each variable for year 2007 as indexes (=100).

As with any indicator, this median GDP measure has its shortcomings. The GINI index, which is required for computation, has lower availability than GDP per capita, sometimes only at 10 year intervals at the WDI database . Nonetheless, some procedures could be adopted to minimize this problem. Since GINI fluctuates somewhat less between years in a country than across countries in a year, using the last available year data can still yield better results for comparing multiples countries than the raw GDP per capita measure . Nowcasting procedures could be used on GINI data available at larger intervals and yet produce nicer long term views of the economy. For countries with no GINI index data at all, mGDP could be set at 63% of GDP per capita, assuming the median GINI index of 41. Its adoption will certainly require some work to improve data availability, quality and comparability, a challenge even for the ever present GDP, as Bill Gates made the case recently .

All of us who make a living out of statistics know that they can become an adverse influence on policy. When we focus on GDP per capita, we are taking into account a non-existing person, focusing on a measure that can improve regardless of what happens to the bulk of the population (i.e., the mean actually represents nobody instead of the average of everyone). We should be focusing instead on the mythical average Joe or Joana or Tomihiro or Neo, the one figure that divides the population in the middle, the one that only changes if a good chunk of the population does, and that’s what mGDP can show us. If we start to see this number on the home page of World Bank, or brilliantly promoted by Roslings’ Gapminder, or perhaps on the cover of The Guardian, perhaps we can hope that policies may be at least a little bit diverted towards the bulk of the population of our countries.

Some data tables, R scripts and a view of similar and equivalent approaches is next on this series. I kindly thank  comments and suggestions received from Andre Luchine, Beto Boullosa, Camilo Telles, Eduardo Viotti, Emilia Spitz, Joniel da Silva, Leonardo Fialho, René Dvorak, Vini Pitta and Walter Hupsel.

1. Unless otherwise noted, all figures from World Development Indicators, access on December, 31st , 2013. Indicators: GDP per capita, PPP (constant 2005 international $): NY.GDP.PCAP.PP.KD; GINI:SI.POV.GINI, latest available year.

2. GINI data for the USA from FRED: http://research.stlouisfed.org/fred2/series/GINIALLRH , not compatible with WDI data;

Some of the R Scripts for this post

Posted by Roberto de Pinho, 0 comments

The mean, the median and the GDP – part I

A version of this post was originally published in Portuguese as a guest post at Walter Hupsel’s blog On The Rocks @ Yahoo! Brasil.

For over 50 years we have had Huff’s “How to Lie with Statistics” telling us that we should know better. And yet, we still rely on the wrong average in one of our most important tools for evaluating the world and countries’ economies: the GDP per capita. We are still using a mean in places where a median would be the better choice.

Gross Domestic Product (GDP) per capita1 is a simple and effective indicator, coming from a straightforward division of GDP by the population, being used appropriately and elegantly in many instances. Nonetheless, the GDP per capita cannot escape the hard reality that it is a mean average. Thus, as Huff warns us, it has shortcomings and flaws: it fails to capture the effects of inequality in a given reality. From this, one could say that this “mean” average is mean in the sense that it’s cruel and unkind.

But… Can we do better now?

The GINI index has reached mainstream status and is now the de facto standard for measuring income inequality. It measures how much the distribution of income deviates from an even division. A value of 0 in GINI would then be found only in an absolutely egalitarian society where everyone earns exactly the same. In contrast, a value of 100 would imply the entire income earned by a single individual or household. In the real world, it ranges from the low 20s (better distribution) for countries like Denmark and Belarus, to over 60, in the case of such unequal societies as Namibia or Botswana. Brazil’s GINI is 552.

It is time to move on to the median GDP, derived from GDP and GINI. It is a fresh metric that may better reflect both the changes in the economy’s output and trends in income distribution, while accounting for population sizes. It is to the GDP per capita what the median is to the mean.

While the mean is the average of all values in a given set of values (the sum of all values divided by the set size), the median represents the value found in the middle of the set, dividing it in two equally sized halves. Means are affected by extreme values, whereas medians are not. As we can see in the classical “How to lie…” example, an increase in earning of the best paid employee would change mean pay, whereas the median would not move. To move the median requires a change of pay for those in the middle section of the population, those that are neither the wealthiest nor the poorest. This is to say that this median GDP would better reflect the reality of our imaginary average Sally or Joe.

Policies that target economic growth regardless of its [human] costs have support in GDP per capita, which rises even if only a few benefit from these policies. Median GDP would not be fooled or let us be fooled by that.

 to be followed with more detail and examples. I kindly thank  comments and suggestions received from Andre Luchine, Beto Boullosa, Camilo Telles, Eduardo Viotti, Emilia Spitz, Joniel da Silva, Leonardo Fialho, René Dvorak, Vini Pitta and Walter Hupsel.

1. The article could similarly discuss GNI per capita. GDP per capita is chosen due to its wider use;
2. Unless otherwise noted, all figures from World Development Indicators, access on December, 31st , 2013. Indicators: GDP per capita, PPP (constant 2005 international $): NY.GDP.PCAP.PP.KD; GINI:SI.POV.GINI, latest available year.

Posted by Roberto de Pinho, 0 comments