Quandl + NVD3 = Interactive Data Plotter

During my daily work, I do a lot quick data checks on various (energy and development) indicators. For sure the classic way is to go EIA, BP, World Bank or one of the other large, established databases and copy the data into Excel or so and start staring. Nowadays, I would find it actually more convenient – and sometimes faster – to just directly load the data from the databank into a pandas dataframe and do a prettier-than-Excel plot with matplotlib. But if I want to share the data with somebody or make it interactive, this doesn’t cut it. Then I would turn to D3 and NVD3. However, loading the data into D3 is fairly cumbersome, this is not even the hardest part. If you want to eliminate the intermediary step of processing and formatting with pandas, then you have some serious work to do.

Most of the online databases give you the data in either an Excel file and rarely a csv or an XML. The data almost never comes in native Javascript JSON format. And if mining the data was not cumbersome enough, converting it into a D3 – readable format will take up a significant amount of your time. Until now: meet Quandl. Quandl is a data-aggregator that takes numeric data from the large online databases (or individuals) and normalizes them and gives you access in all formats, including JSON. This means it is enough to write a not-so-complicated Javascript data-parser and all you have to do later is to change the Quandl database codes to get pretty plots – a data-blogger’s dream 🙂 This is exactly what I did.

Quandl

Quandl data plotter

Quandl uses short database codes, similar to that of the World Bank to reference their data. These contain the country names usually in a 2 or 3 letter ISO format. So first, you need to grab the country names and their corresponding ISO codes from a public csv file with D3. Then, after doing some background searching among the Quandl data, you can define the short code that you want to load and you can obverse how is the country code embedded in the link structure. Then, using asynchronous JavaScript requests, you can load the desired indicator for the desired country/year. Voila – the bonus great thing about Quandl is that you can also upload (via plugins) and use your own datasets for your visualizations, as long you follow their conventions.

 

Made with d3, nvd3 and Quandl. The main outcome is the NVD3 + Quandl block.

 

From now on, I will start slowly shifting to d3plus, developed by Alexander Simoes at the MIT Media Lab, because of its superiority of handling multiple visualization types, compared to nvd3.

The Geocenter of Formula 1 venues between 1950-2014

Click here if you prefer to read this post on Medium.com (3 minute read).
Click here to look at the data visualization only on visualizing.org.

Recently, there has been a doubt over the venue 2015 German Formula 1 Grand Prix, with rumors about it being completely cancelled. Operators of other historical European venues have recently voiced their increasing concerns about the much higher race commissions new Asian circuits are able to offer to Formula One Group, the sport’s ruling body. Back in 2010, CNN raised the point that Formula 1 was drifting towards Asia, which can be regarded as a business decision for the organization whose principal sources of income are race organizing fees and advertising. Now we have made a map which visualizes that shift over time!

Let us plot on the world map all of the circuits where F1 Grands Prix have been organized between 1950-2014. Then, based on the geographic coordinates of the circuits, we can calculate the geocenter (geographic center of mass) of all venues. Since many of the venues hosted much more than one Grand Prix, we can get a more realistic picture if we can calculate the geocenter in a weighted manner, using the number of F1 races at each venue (interactive plot).

All F1 venues

All Formula 1 venues between 1950-2014 and their geocenter – click for interactive

We can see that the geocenter obtained this way falls roughly on Tunisia. While the overwhelming majority of the races have been organized in Europe (especially before the year 2000), there have been roughly an equal number races in the Americas and Asia+Australia. The Southern hemisphere hosted very few races, with only two circuits in both South Africa and Australia. However, let us take a look at the evolution in time of the F1 racing calendar! We have created an interactive map that displays all of the F1 circuits hosting Grand Prix for a given year, for the period 1950-2014.

Formula 1 venues between 1950-2014

The Geocenter of Formula 1 venues between 1950-2014 – click for interactive

Read More