Computing Method of Chinese Short Text Similarity Based on Part of Speech,Semantic and Word Order Factors
According to the characteristics of Chinese short texts,a method of calculating text similarity is proposed,which combines parts of speech,semantics and word order factors.This method relates the part of speech,meaning and position of words in Chinese short text,and on the basis of cosine formula,through the correlation between the similarity of words of text vectors and the weight of part of speech,this paper improves the method of Chinese short text similarity calculation,and introduces word order similarity to optimize text similarity.The experimental results show that this method has better accuracy and recall than other meth-ods,and is more in line with people's subjective judgment.
Chinese short text similaritypart of speechsemanticsword order