- On the clustering of huge categorical data
- On the clustering of huge categorical data
- ㆍ 저자명
- Kim. Dae-Hak
- ㆍ 간행물명
- 한국데이터정보과학회지
- ㆍ 권/호정보
- 2010년|21권 6호|pp.1353-1359 (7 pages)
- ㆍ 발행정보
- 한국데이터정보과학회
- ㆍ 파일정보
- 정기간행물|ENG| PDF텍스트
- ㆍ 주제분야
- 기타
Basic objective in cluster analysis is to discover natural groupings of items. In general, clustering is conducted based on some similarity (or dissimilarity) matrix or the original input data. Various measures of similarities between objects are developed. In this paper, we consider a clustering of huge categorical real data set which shows the aspects of time-location-activity of Korean people. Some useful similarity measure for the data set, are developed and adopted for the categorical variables. Hierarchical and nonhierarchical clustering method are applied for the considered data set which is huge and consists of many categorical variables.