Visualizing the PyPI Pandas download statustics using Tableau - Downloads by location

For some background, read my previous posts on the topic - simple queries to start understanding Pandas downloads (https://rahulporuri.blogspot.in/2017/01/on-whos-downloading-pandas.html),  building up queries incrementally to understand Pandas downloads (https://rahulporuri.blogspot.in/2016/12/pandas-download-statistics-pypi-and.html) and using Tableau Public to visualize Pandas downloads by version over the last 6 months (https://rahulporuri.blogspot.in/2017/01/visualizing-pypi-pandas-download.html)

Having visualized the total number of downloads per version per month of Pandas in the last post, we now come to the total number of downloads per month by location.

The relevant query is

SELECT
  STRFTIME_UTC_USEC(timestamp, "%Y-%m") AS yyyymm,
  country_code,
  COUNT(*) as total_downloads,
FROM
  TABLE_DATE_RANGE(
    [the-psf:pypi.downloads],
    DATE_ADD(CURRENT_TIMESTAMP(), -6, "month"),
    CURRENT_TIMESTAMP()
  )
WHERE
  file.project = 'pandas'
GROUP BY
  country_code,
  yyyymm
ORDER BY
  total_downloads DESC

and the resultant data can be found at https://drive.google.com/file/d/0BxwQdgnuTo6JNEhqeHZnckpla1U/view?usp=sharing.

I visualized the above dataset using Tableau Public and I ended up with -

Popular posts from this blog

Animation using GNUPlot

Arxiv author affiliations using Python

Thoughts on the National Deep Tech Startup Policy