On Early Prediction of Patent Value by Fusing Semantic Features of Claims
[Research purpose]Investigating efficient methods to achieve early patent value prediction can not only increase the opportuni-ties for the subsequent cultivation of high-value patents,but also can produce early identification of low-value patents,such as questiona-ble patents,to lessen the depletion of public resources,which is essential to support the high-quality development of intellectual property.[Research method]An early prediction method of patent value by fusing semantic features of claims is proposed.The semantic features are initially extracted from the parsed text of patent claims.Then,an index system for early patent value prediction is built with an integra-tion of the semantic features of claims and then optimized with pearson correlation analysis.Finally,the CatBoost classification model and the SHAP method are combined to realize the early mining of high-value patents and the early identification of low-value patents.[Re-search conclusion]The results show that the early prediction model of patent value built in this study has a classification accuracy of 90.79%,which has the advantages of higher accuracy,better interpretation,and wider applicability compared with the conventional ma-chine learning methods.The semantic features of claims have a significant impact on patent value prediction,and the index system built by integrating this feature has some advantages.
patent valuepatent claimsemantic featuresprediction modelevaluation index systemCatBoostSHAPinterpretable machine learning