- 한국어 음성인식 플랫폼의 설계
- ㆍ 저자명
- 권오욱,김회린,유창동,김봉완,이용주,Kwon. Oh-Wook,Kim. Hoi-Rin,Yoo. Changdong,Kim. Bong-Wan,Lee. Yong-Ju
- ㆍ 간행물명
- 말소리
- ㆍ 권/호정보
- 2004년|51권 1호|pp.151-165 (15 pages)
- ㆍ 발행정보
- 대한음성학회
- ㆍ 파일정보
- 정기간행물| PDF텍스트
- ㆍ 주제분야
- 기타
For educational and research purposes, a Korean speech recognition platform is designed. It is based on an object-oriented architecture and can be easily modified so that researchers can readily evaluate the performance of a recognition algorithm of interest. This platform will save development time for many who are interested in speech recognition. The platform includes the following modules: Noise reduction, end-point detection, met-frequency cepstral coefficient (MFCC) and perceptually linear prediction (PLP)-based feature extraction, hidden Markov model (HMM)-based acoustic modeling, n-gram language modeling, n-best search, and Korean language processing. The decoder of the platform can handle both lexical search trees for large vocabulary speech recognition and finite-state networks for small-to-medium vocabulary speech recognition. It performs word-dependent n-best search algorithm with a bigram language model in the first forward search stage and then extracts a word lattice and restores each lattice path with a trigram language model in the second stage.