- 음성 질의 기반 디지털 사진 검색 기법
- ㆍ 저자명
- 김태성,서영주,이용주,김회린,Kim. Tae-Sung,Suh. Young-Joo,Lee. Yong-Ju,Kim. Hoi-Rin
- ㆍ 간행물명
- 말소리
- ㆍ 권/호정보
- 2006년|57권 1호|pp.99-112 (14 pages)
- ㆍ 발행정보
- 대한음성학회
- ㆍ 파일정보
- 정기간행물| PDF텍스트
- ㆍ 주제분야
- 기타
In this paper, we introduce two retrieval methods for photos with speech documents. We compare the pattern of speech query with those of speech documents recorded in digital cameras, and measure the similarities, and retrieve photos corresponding to the speech documents which have high similarity scores. As the first approach, a phoneme recognition scheme is used as the pre-processor for the pattern matching, and in the second one, the vector quantization (VQ) and the dynamic time warping (DTW) are applied to match the speech query with the documents in signal domain itself. Experimental results show that the performance of the first approach is highly dependent on that of phoneme recognition while the processing time is short. The second method provides a great improvement of performance. While the processing time is longer than that of the first method due to DTW, but we can reduce it by taking approximated methods.