网络与信息安全学报2024,Vol.10Issue(6) :151-163.DOI:10.11959/j.issn.2096-109x.2024087

基于知识图谱的隐私政策合规性检测与分析

Privacy policy compliance detection and analysis based on knowledge graph

张西珩 李昕 唐鹏 黄锐奇 何渊 邱卫东
网络与信息安全学报2024,Vol.10Issue(6) :151-163.DOI:10.11959/j.issn.2096-109x.2024087

基于知识图谱的隐私政策合规性检测与分析

Privacy policy compliance detection and analysis based on knowledge graph

张西珩 1李昕 1唐鹏 1黄锐奇 1何渊 1邱卫东1
扫码查看

作者信息

  • 1. 上海交通大学网络空间安全学院,上海 200210
  • 折叠

摘要

《中华人民共和国个人信息保护法》(以下简称"个保法")作为保护个人信息权益的重要法律,对个人信息处理者收集、存储、使用、分享等信息处理活动提出明确规范,并要求个人信息处理者在所提供服务的隐私政策中予以说明.这意味着任何公司在国内提供服务时,需首先提供符合"个保法"要求的隐私政策.为了实现自动分析面向"个保法"的隐私政策合规性,提出了基于知识图谱构建隐私政策合规性智能检测方法.首先,对"个保法"进行全面分析并构建相应的多级隐私政策知识图谱,涵盖需要在隐私政策中予以说明的信息保护相关概念.然后,构建半自动化的隐私政策收集方法并收集400份中文App的隐私政策,对其中100份基于知识图谱进行交叉标注后形成首个面向"个保法"的中文隐私政策语料库APPCP-100,使用剩余300份隐私政策构建中文概念分类器模型CPP-BERT,实现高效的隐私政策合规性智能检测.最后,应用知识图谱对隐私政策进行全面的合规性分析,结果显示当前中文App隐私政策对"个保法"中细粒度概念的合规性仍有待提高.

Abstract

The personal information protection law(PIPL)of the People's Republic of China served as an impor-tant legal framework for safeguarding personal information rights.It established clear regulations for personal infor-mation controllers in their activities involving the collecting,storing,using,and sharing of personal information.It also required that these controllers provide explanations within their privacy policies for the services they offered.This meant that any company providing services in China must first offer a privacy policy that complied with the re-quirements of the PIPL.Therefore,in order to analyze the compliance of privacy policies with respect to the PIPL,an intelligent method was presented for assessing privacy policy compliance based on a knowledge graph.First,a comprehensive analysis of the PIPL was conducted,and a multi-level privacy policy knowledge graph was pro-posed that covered concepts related to information protection that needed to be explained in privacy policies.Next,a semi-automated method was built for collecting privacy policies and collected the privacy policies of 400 Chinese Apps.100 policies were cross-annotated based on the knowledge graph,resulting in the creation of the first Chi-nese privacy policy corpus tailored to the PIPL called APPCP-100(APP-privacy-policy-corpus-for-PIPL-100).Us-ing this corpus,a Chinese concept classifier model CPP-BERT was constructed to achieve efficient detection of pri-vacy policy compliance.Finally,the knowledge graph was applied to conduct a comprehensive compliance analysis of privacy policies,and the results indicate that the current compliance of Chinese App privacy policies with the fine-grained concepts of the PIPL still needs improvement.

关键词

中华人民共和国个人信息保护法/隐私政策/隐私保护/合规性检测

Key words

personal information protection law of the People's Republic of China/privacy policy/privacy protec-tion/compliance check

引用本文复制引用

出版年

2024
网络与信息安全学报
人民邮电出版社

网络与信息安全学报

CSTPCD
ISSN:2096-109X
段落导航相关论文