APPLIED STUDY ON TEXT MINING TECHNIQUE TO S&T MANAGEMENT FIELD HOT TOPIC EXTRACTION DIRECTION
The S&T management field hot topic extraction process mainly undergoes three stages:data acquisition and cleaning,information retrieval, and topic analysis. As for hot topic extraction, TF-IDF information extraction algorithm is applied; in terms of topic clustering, agglomerative clustering from concurrence method is applied. By means of hot topic extraction, trend analysis and clustering analysis, the forecast and scientific decision making for field hot work can be realized, which helps promote the government business field information intellectualization and knowledge-driving.