Instructions

Cluster Analytics

Download Cluster Analytics Application

In order to run this application, you need to have JVM already installed on your computer. Consult the Java website to download the appropriate version for your computer.
Click the link/button provided and save the ClusterAnalytics.jnlp file to the desired location on your local computer. Double click the file and say yes to the security box that pops up.
Each time the application updates and this includes the first time you launch the application you need to wait for the update process before proceeding. You also need to allow the digital jar signature, so that, the application can access your local dataset file.

Layout

The User Interface is divided into several parts: Menu, Data panel (top panel on the left), and graphic display panel (on the right).

Loading Data

The Cluster Analytics Application requires data in order for it to run. In order to open a data file that has already been created follow the instructions below. If you want to make your own dataset file, use the format detailed below in the Format section of this page.
Click the 'File'-'Open' menu, and choose your local dataset file.
By clicking the link below you can download the Iris Data set to your local computer.

Sample Iris Data set

Format Data

  • Write each data object in a single line.
  • For each data object(line), write all items divided by comma (without leaving blanks between items).
  • Read example iris dataset file

Using and Understanding the Cluster Analytics App.

  • When the dataset is loaded, choose the cluster algorithm you like. You can switch between four clusters by clicking the tab matching your algorithm. Some of the cluster algorithms may need additional input -
    • Basic K-means: choose the initialize centroid data
    • Bisecting K-means and Agglomerative Hierarchical: set final number of clusters
  • By dragging the slider, you can see the result of each iteration (if it takes more than one iteration for cluster algorithm to get the final cluster result).