Throughout the lab, you'll occasionally see questions marked with an asterisk ('*'). To get credit for this lab, you'll post your answers to Piazza.

Lab 4 pt. 1: SPLOMs

Scatterplot matrices are a useful tool to help visualize relationships between multiple quantitative (numerical) variables.

In Tableau, you create a scatterplot matrix by placing multiple measures on the Columns and Rows shelves. Today, we'll create a scatterplot matrix comparing five dimensions

  1. Start by downloading the CSV file containing the College dataset, which contains statistics for a large number of US Colleges from the 1995 issue of US News and World Report.

  2. Load the dataset into Tableau and explore the dimensions, which look something like this:

  3. Descriptions for each of the dimsensions are below:
  4. Let's try dragging the Apps, Accept, and Enroll measures onto both the rows and columns shelves to see how they relate to one another. Don't forget to turn off Analysis>Aggregate Measures!

  5. Notice that there is a positive correlation between all pairs of these variables. This makes sense: colleges that receive large numbers of applications may in fact be larger institutions overall, and so would be likely to accept more students. Similarly, an institution that accepts more students intuitively would have a larger number of students enroll.

    Bear in mind: these are absolute comparisons, meaning we are looking at the actual number of students, rather than a percentage.


  6. *Piazza Q1:* Create calculated fields to convert the accept and enroll measures to percentages:

    Now create a SPLOM using these measures. What is different?


Lab 4 pt. 2: Parallel Coordinates

Parallel coordinates is a common way of visualizing and exploring high-dimensional, multivariate data. While Tableau unfortunately doesn't directly support Parallel Coordinates, we'll use a little bit of magic to convince it to do our bidding.