The legal tone and development of generative artificial intelligence data training
Data training is the core of ensuring the high quality landing of artificial intelligence applications.With the wide application of generative artificial intelligence large model products,the data training process involves the basic data of users,the behavior trajectories of various agents and the complex changes in the rights and interests of multiple agents,which may have a negative impact on market compe-tition,enterprise innovation and even national security.Therefore,it is necessary to ensure the legitimacy of data sources,improve the credibility of data quality,and follow the basic requirements of"legitimate data sources—credible data quality—data value release"to mark different stages of data training.In the stage of data calculation and application,we should pay attention to the pollution of training data and ab-normal operation data brought by deep synthesis technology.In the stage of data opening and sharing,per-sonal information protection and intellectual property rights infringement risks should be comprehensively and accurately examined;the law of scientific and technological development should be followed;the rela-tionship between technical accessibility,practical feasibility and value legitimacy should be balanced and space should be reserved for innovative development on the basis of consolidating safe development.To this end,the data training of generative AI should take security as the bottom line,promote innovative de-velopment while optimizing the collaborative regulatory framework and methods and take into account the legitimate rights and interests of multiple entities.
generative artificial intelligencedata trainingdata qualitysafe developmentinnovative developmentstandardized development