- 음소인식 오류에 강인한 N-gram 기반 음성 문서 검색
- ㆍ 저자명
- 이수장,박경미,오영환,Lee. Su-Jang,Park. Kyung-Mi,Oh. Yung-Hwan
- ㆍ 간행물명
- 말소리
- ㆍ 권/호정보
- 2008년|67권 1호|pp.149-166 (18 pages)
- ㆍ 발행정보
- 대한음성학회
- ㆍ 파일정보
- 정기간행물| PDF텍스트
- ㆍ 주제분야
- 기타
In spoken document retrievals (SDR), subword (typically phonemes) indexing term is used to avoid the out-of-vocabulary (OOV) problem. It makes the indexing and retrieval process independent from any vocabulary. It also requires a small corpus to train the acoustic model. However, subword indexing term approach has a major drawback. It shows higher word error rates than the large vocabulary continuous speech recognition (LVCSR) system. In this paper, we propose an probabilistic slot detection and n-gram based string matching method for phone based spoken document retrievals to overcome high error rates of phone recognizer. Experimental results have shown 9.25% relative improvement in the mean average precision (mAP) with 1.7 times speed up in comparison with the baseline system.