Learning Objectives

  • gain programming skills in RStudio
  • gain skills in data wrangling techniques, including combining data across a variety of sources and formats, data harmonization, and creation of new variables
  • create effective visual displays of complex information using R and Tableau
  • create interactive visual presentations using Tableau

Case Study Goals

Using global health data from the World Bank, CIA Factbook, or a complex data source of your own choosing,

  • produce compelling R graphics communicating an aspect of our world in data
  • create a Tableau dashboard interactive visual presentation to illustrate an aspect of our world in data
  • communicate your graphical findings in a dynamic class presentation

Data

  • North Carolina Birth Records: these data cannot be shared beyond class participants. Please see Sakai (Resources) for the data and related documentation, and do not disseminate the data any further.
  • Worldwide country-level data on maternal and infant outcomes: publicly available

  • popDF.RData

  • infMortDF.RData

  • codeMapDF.RData

  • allCtryData.RData

  • full.RData

Resources

This case study draws on materials developed in Chapter 11 of Data Science in R: A Case Studies Approach to Computational Reasoning and Problem Solving by Nolan and Temple Lang (free e-book through Duke Libraries)

Use these instructions to download Tableau.

Tableau Cheat Sheet

Fancy Tableau Options

Reports

  • Interim report: outline the general story you wish to tell graphically. Data sources and variables to be used in the data visualizations, along with short descriptions (e.g., captions) for planned visualizations, should also be included. Maximum of 3 pages.

  • Final report: in-class presentation (5 minutes), R/Tableau visualizations, and a two-page document that clearly references data sources, provides a rationale for the data chosen to tell the story, and summarizes the story told in the graphics.