음성/음악 판별을 위한 특징 파라미터와 분류기의 성능비교

음성/음악 판별을 위한 특징 파라미터와 분류기의 성능비교

ㆍ 저자명: 김형순,김수미,Kim. Hyung Soon,Kim. Su Mi
ㆍ 간행물명: 말소리
ㆍ 권/호정보: 2003년|46권 1호|pp.37-50 (14 pages)
ㆍ 발행정보: 대한음성학회
ㆍ 파일정보: 정기간행물|
PDF텍스트
ㆍ 주제분야: 기타

이 논문은 한국과학기술정보연구원과 논문 연계를 통해 무료로 제공되는 원문입니다.

서지반출

기타언어초록

In this paper, we evaluate and compare the performance of speech/music discrimination based on various feature parameters and classifiers. As for feature parameters, we consider High Zero Crossing Rate Ratio (HZCRR), Low Short Time Energy Ratio (LSTER), Spectral Flux (SF), Line Spectral Pair (LSP) distance, entropy and dynamism. We also examine three classifiers: k Nearest Neighbor (k-NN), Gaussian Mixure Model (GMM), and Hidden Markov Model (HMM). According to our experiments, LSP distance and phoneme-recognizer-based feature set (entropy and dunamism) show good performance, while performance differences due to different classifiers are not significant. When all the six feature parameters are employed, average speech/music discrimination accuracy up to 96.6% is achieved.

키워드

speech/musci discrimination classifier feature parameter

다운URL