- 웹 페이지 비교통합 기반의 정보 수집 시스템 설계 및 개발에 대한 연구
- ㆍ 저자명
- 장진욱,Jang. Jin-Wook
- ㆍ 간행물명
- 한국IT서비스학회지= Journal of the Korea society of IT services
- ㆍ 권/호정보
- 2014년|13권 1호|pp.147-159 (13 pages)
- ㆍ 발행정보
- 한국IT서비스학회
- ㆍ 파일정보
- 정기간행물| PDF텍스트
- ㆍ 주제분야
- 기타
Recently, the quantity of information that is accessible from the Internet is being dramatically increased. Searching the Web for useful information has therefore become increasingly difficult. Thus, much research has been done on web robots which perform internet information filtering based on user interest. If a web site which users want to visit is found, its content is searched by following the searching list or Web sites links in order. This search process takes a long time according as the number of page or site increases so that its performance need to be improved. In order to minimize unnecessary search with web robots, this paper proposes an efficient information collection system based on compare and merge method. In the proposed system, a web robot initially collects information from web sites which users register. From the next visit to the web sites, the web robot compares what it collected with what the web sites have currently. If they are different, the web robot updates what it collected. Only updated web page information is classified according to subject and provided to users so that users can access the updated information quickly.