Everything should be made as simple as possible, but not simpler
The need to group data
- The world is complex and multidimensional
- Univariate analysis focuses on only one dimension
- Sometimes, world issues are best understood as multivariate. E.g.
- Percentage of foreign-born Vs. What is a neighborhood?
- Years of schooling Vs. Human development
- Monthly income Vs. Deprivation
Grouping as simplifying
- Define a given number of categories based on many characteristics (multi-dimensional)
- Find the category where each observation fits best
- Reduce complexity, keep all the relevant information
- Produce easier-to-understand outputs
Types of grouping
- Non-spatial clustering
- Regionalisation