Skip to content

Question on Clustering in Simplified detection of urban types #358

Answered by martinfleis
cdkang asked this question in Q&A
Discussion options

You must be logged in to vote

Hello, this step is standardising values across all columns to ensure they all spread within a roughly the same range of values. The K-Means is based on an Euclidean distance and without this step, characters with large values would overpower those with low.

Do you know any reference book or paper on it?
The whole example is based on this paper.

Fleischmann M, Feliciotti A, Romice O and Porta S (2021) Methodological Foundation of a Numerical Taxonomy of Urban Form. Environment and Planning B: Urban Analytics and City Science, doi: 10.1177/23998083211059835

If you want to read only on standardisation, Wikipedia is great - https://en.wikipedia.org/wiki/Standard_score

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@cdkang
Comment options

Answer selected by martinfleis
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants