Data Risks and Legal Regulations for Generating Artificial Intelligence:Taking ChatGPT as an Example
As a representative of the generation of artificial intelligence technology,ChatGPT needs to obtain intelligent conclusions and make trend predictions on the basis of large-scale data refinement,analysis,and learning.Specifically,as a large-scale natural language processing model,ChatGPT involves four types of data in the training and operation process:pre-training data,manually labeled data,capture data,and human-computer interaction data.Different types of data face different legal risks and challenges in the application process.Based on data which is the basic raw material of AI and algorithm models,it is of urgent practical significance to carry out risk analysis,legal response,and technical regulation.In this regard,from the perspective of conforming to industrial development,balancing security and efficiency,China may consider clarifying the border of obligations,improving regulations,improving the legislative system,and forming a targeted standardized system with a cautious attitude in cooperation with the technological development trend in order to seize the opportunity to"overtake"in the era of digital economy with the help of AI technology.