TEDxKinda Project: Identifying Clusters


Your challenge is to cluster data in different ways. Using Google’s Public Data Explorer, explore a few data sets, and attempt to cluster data within the sets. If you have chosen data sets other than those in Google’s collection, you will have to use one of the other Tools for Big Data Analysis, which may be more challenging, but also much more fruitful for your research.

First, take a screenshot of a data visualization that demonstrates clusters of patterns, and then draw lines on the screenshot that bisect the plane and create clusters. You should cluster your data in two or more different ways, so use two different datasets if one dataset cannot be clustered in two ways. Choose datasets that are related to your TEDxKinda topic, so that you can apply your work to your end presentation.


Submit a document (e.g., .doc or .pdf) that includes the following items:

  1. A proper heading (including names, date, assignment, and title).
  2. The images of clustering (see example below).
  3. An explanation of how and why you clustered the data.
  4. A hypothesis detailing what phenomenon the data clusters represent.