Identification Methods of Artificial Intelligence Generated Content in Online Communities
[Purpose/significance]Generative artificial intelligence will cause a certain degree of AI information pollution in online communities.The various AIGC identification methods studied in this paper are of great significance to prevent the negative impact of rapidly evolving generated artificial intelligence.[Design/Methodology]This paper first constructed a HAC data set in multiple online community platforms with 54 major categories of topics of Sina Weibo,which contained 100,873 pieces of information written by humans and generated artificial intelligence respectively.Then it explored whether the current 6 kinds of mainstream deep learning and 7 kinds of machine learning methods can identify whether the information in the online community was written by human beings or generated by artificial intelligence.Finally,the BEM-RCNN method was proposed to further improve the recognition of AIGC precision.[Findings/Conclusion]From the perspective of constructed data set,it is found that generated artificial intelligence has a strong"human-like expression",which can simulate human beings to post and reply on social media platforms.The experimental results show that the method proposed in this paper has an accuracy of 96.4%,which can well identify whether the content on the online community is written by humans or AI.It is superior to the 13 other mainstream methods such as BERT,ERNIE,and TextRNN in terms of precision,recall rate,F1-value,and accuracy,verifying its performance advantages.At the same time,many exploratory experiments have also proved that although the current mainstream machine learning methods are less accurate than the method in this paper,they can also be competent for some AIGC recognition tasks.[Originality/Value]Multiple methods are used in this paper to identify AIGC on social media,and prevent information pollution caused by generative artificial intelligence on social media platforms.
Generative artificial intelligenceArtificial Intelligence Generated Content(AIGC)Online communitiesMachine learningAI information pollution