Creating color-coded scatterplots in Excel: a nightmare

Scatterplots are extremely popular and useful graphical displays for examining the relationship between two numeric variables. They get even better when we add the use of color/hue and shape to include information on a third, categorical variable (or we can use size to include information on an additional numerical variable, to produce a “bubble chart”). For example, say we want to examine the relationship between the happiness of a nation and the percent of the population that live in poverty conditions — using 2004 survey data from the World Database of Happiness. We can create a scatterplot with “Happiness” on … Continue reading Creating color-coded scatterplots in Excel: a nightmare

Data Exploration Celebration: The ENBIS 2009 Challenge

The European Network for Business and Industrial Statistics (ENBIS) has released the 2009 ENBIS Challenge. The challenge this time is to use an exploratory data analysis (EDA) tool to answer a bunch of questions regarding sales of laptop computers in London. The data on nearly 200,000 transactions include 3 files: sales data (for each computer sold, with time stamps and zipcode locations of customer and store), computer configuration information, and geographic information linking zipcodes to GIS coordinates. Participants are challenged to answer a set of 11 questions using EDA. The challenge is sponsored by JMP (by SAS), who are obviously … Continue reading Data Exploration Celebration: The ENBIS 2009 Challenge