Large Language Model-Generated Text Detection Based on Linguistic Feature Ensemble Learning
The rapid development of large language model(LLM)has provided great convenience for daily life and work,but has also brought challenges for individuals and society.Therefore,there is an urgent need for detectors that can detect text generated by large language models.For good detection performance and generalization ability,this paper proposed a large language model-generated text detection method based on linguistic feature learning—EBF detection.EBF detection combined the fine-tuned pre-trained language model and higher-order natural language statistical features,and used the decision mechanism to realize the LLM-generated text detection.Experimental results show that EBF Detection not only achieves an average detection accuracy of 98.72%on in-domain data,but also achieves an average detection accuracy of 96.79%on out-of-domain data.
large language modelLLM-generated text detectionensemble learninglinguistic feature