With the rapid development of the Internet,crawling event information from various media,such as mi-croblog,post bar,forum and news website,becomes essential to Internet information processing systems.Facing with these media resources in the era of big data,how to comprehensively and quickly obtain concerned event infor-mation is worthy of further study.We reveal event constraint effect,which provides the guideline for the structure of event monitoring term and simplest-event monitoring terms,and analyze the overlapping relation between simp-lest-event monitoring terms.We propose the method of reducing event monitoring terms,which reduces the number of monitoring terms for event search crawling.Taking municipal regional SaaS platform and fire control in-dustry SaaS platform,we conduct an experiment with mainstream built-in search engines to evaluate the selection ration of event monitoring terms and event crawling efficiency.The experimental results show that the proposed re-duction method of event monitoring term reduces the number of crawling information and improves the performance of event crawling.
关键词
事件信息采集/内置搜索引擎/事件约束效应/事件监测项归约
Key words
event crawling/built-in search engines/event constraint effect/event monitoring term reduction