In this paper,template-based information extraction algorithm is used.After the target entity separator is identified by the rule generator,the template generator is used to realize the effective configuration of segmentation marks in the template,and then the information extractor is used to extract the required information according to the generated template.Then,the Web information intelligent extraction system is tested and analyzed.Through comparison with other information extraction systems,it is found that this system can complete the fast and accurate extraction of various webpage information according to the template,and has the advantages of accurate information extraction,high information recall rate and efficient information extraction.
关键词
计算机网络/Web信息/智能信息抽取系统
Key words
computer network/Web information/intelligent information extraction system