On the clustering of huge categorical data

On the clustering of huge categorical data
On the clustering of huge categorical data

ㆍ 저자명: Kim. Dae-Hak
ㆍ 간행물명: 한국데이터정보과학회지
ㆍ 권/호정보: 2010년|21권 6호|pp.1353-1359 (7 pages)
ㆍ 발행정보: 한국데이터정보과학회
ㆍ 파일정보: 정기간행물|ENG|
PDF텍스트
ㆍ 주제분야: 기타

이 논문은 한국과학기술정보연구원과 논문 연계를 통해 무료로 제공되는 원문입니다.

서지반출

기타언어초록

Basic objective in cluster analysis is to discover natural groupings of items. In general, clustering is conducted based on some similarity (or dissimilarity) matrix or the original input data. Various measures of similarities between objects are developed. In this paper, we consider a clustering of huge categorical real data set which shows the aspects of time-location-activity of Korean people. Some useful similarity measure for the data set, are developed and adopted for the categorical variables. Hierarchical and nonhierarchical clustering method are applied for the considered data set which is huge and consists of many categorical variables.

키워드

Categorical variable clustering hierarchical clustering huge data

다운URL