Multi-classifier for Car Review Sentiment Classification Based on ER Rule
With the rapid development of the next-generation information technology,more and more users are accustomed to sharing personal experience and opinions through the Internet,such as online reviews of book,movie,product usage experience and so on,which always contain positive and negative sentiment of users.Text sentiment analysis aims to use computer technology to detect and extract diverse sentiments,attitudes,opinions and other perceptual information in text documents,thereby converting qualitative user expressions into quantifi-able data to serve decision-making and strategic planning.For users,these product reviews can provide them with sufficient information that will help them make informed purchasing decisions to the greatest extent and mini-mize the degree of regret after consumption.For manufacturers,consumers’needs can be acquired timely through the reviews,thus adjusting their marketing strategies in a targeted manner and improving the design and quality of products.Currently,due to the exponential growth in the number of these review texts on the Internet,traditional manual analysis methods can hardly satisfy the rapidly changing market demand.Deep learning-based methods may fall into the dilemma of weak interpretability.Therefore,how to automatically obtain users’senti-ment information from numerous comments via a rational and intelligent way is a challenging issue.For the problem of sentimental dichotomy on car commentary corpus,a text sentiment classification method based on ER rule multi-classifier fusion is proposed in this paper.Firstly,the research explores sentiment feature construction by examining the classification effects of various feature models,including unigram,bigram and unigram+bigram.The CHI Square test is adopted for text feature extraction.This method is particularly effective in managing high-dimensional feature spaces,facilitating more accurate sentiment classification by highlighting the most relevant features for analysis.Secondly,the improved TF-IDF method is proposed to enhance the discrimination of terms relevant to sentiment analysis.It incorporates the CHI Square values to assess the distinc-tiveness of terms across different document classes,and refines the traditional TF-IDF calculation.This adjust-ment accounts for the distribution of terms within categories,making the sentiment-related terms more impactful for classification tasks.Thirdly,on the basis of fully considering the weights and reliabilities of different classifiers,the ER rule is introduced to fuse multiple classifiers for text sentiment polarity analysis in order to integrate the advantages of different classifiers.Specifically,the classifier is regarded as evidence,and the weight of classifier is dynamically formed by the Euclidean distance between evidence and the difference in judg-ments of different categories within the evidence.The weight of a classifier is negative with the difference between the results of that classifier and those of all other classifiers,while it is positive with the discrepancy among the judgments of different categories within the classifier.Meanwhile,the accuracy of classifier is assumed to be reliability of the classifier,in order to produce better classification results.In order to verify the effectiveness and rationality of the proposed method,the automobile review data set crawled from the network is used for verification.The result shows that the multi-classifier fusion method based on ER rule can achieve better results in text sentiment classification than single classification algorithm,ensemble algorithm and deep learning algorithm.In addition,to reduce the influence of contingency and single data set,the results are verified using original data sets of hotel comments published in other fields under the same experi-mental conditions.The experimental comparison results show that the fusion method based on ER rules achieves the best results in F1 value and Accuracy index,and also performs well in Precision and Recall indexes.So this method can be well generalized and applied to text sentiment classification tasks in different fields.At the same time,ablation experiments are conducted on the proposed improved method in terms of feature models selection and feature weights calculation.The experimental results show the effectiveness of the improved method in text sentiment classification performance.In summary,the ER rule considers both the weight and reliability of each classifier to fuse multiple classifiers,and integrates the advantages of different classifiers.The method can effec-tively reduce the classification limitations caused by different types and topics of text.The final sentiment classifi-cation results are stable and balanced,which has a wider applicability in the practice of sentiment classification.
ER rulemulti-classifier fusionTFIDF weightdeep learning algorithmensemble learning algorithm