工程管理学报2024,Vol.38Issue(5) :131-136.DOI:10.13991/j.cnki.jem.2024.05.022

建设工程事故文本的知识发现:以PPE类不安全行为为例

Knowledge Discovery of Construction Accident Texts:An Example of Unsafe Behavior Related to PPE

吴迪 贾心雨 韩博雯 张先锋 郭聖煜
工程管理学报2024,Vol.38Issue(5) :131-136.DOI:10.13991/j.cnki.jem.2024.05.022

建设工程事故文本的知识发现:以PPE类不安全行为为例

Knowledge Discovery of Construction Accident Texts:An Example of Unsafe Behavior Related to PPE

吴迪 1贾心雨 1韩博雯 1张先锋 1郭聖煜1
扫码查看

作者信息

  • 1. 中国地质大学(武汉) 经济管理学院,湖北 武汉 430074
  • 折叠

摘要

为了丰富建设工程领域的安全知识,从事故文本中挖掘和发现施工人员的不安全行为,以个人防护用品 PPE 类不安全行为为例,采用基于规则的自然语言处理方法,从事故文本中自动抽取此类不安全行为.从政府官网等收集 195份建设工程事故调查报告作为文本挖掘语料,通过哈尔滨工业大学的语言技术平台 LTP 展开词法分析和依存句法分析,构建 PPE类不安全行为的 11 条抽取规则并确定抽取流程.再以网络爬虫收集的 427 份事故调查报告展开实例应用,按照流程自动抽取PPE类不安全行为.结果表明:平均抽取准确率为 94.70%,召回率为 67.57%.研究能够为建设工程事故文本的知识发现提供理论启示和实践路径.

Abstract

To enrich the safety knowledge in the field of construction industry,the unsafe behaviors of workers are mined and discovered from accident texts.This paper took the unsafe behavior related to personal protective equipment(PPE)as an example,and this kind of unsafe behavior was automatically extracted from accident texts using a rule-based natural language processing method.195 construction accident investigation reports were collected from government websites to form a text mining corpus.The lexical analysis and dependency parsing of the texts were carried out through the Language Technology Platform(LTP)of Harbin Institute of Technology.The 11 extraction rules of the unsafe behavior related to PPE were constructed and the extraction process is determined.Then,another 427 construction accident investigation reports collected by web crawlers were used as examples to automatically extract the unsafe behavior related to PPE according to the extraction process.The results show that the average extraction accuracy is 94.70%and recall rate is 67.57%.The study can provide a theoretical inspiration and practical path for knowledge discovery in construction accident texts.

关键词

知识发现/事故文本/PPE类不安全行为/自然语言处理

Key words

knowledge discovery/accident texts/PPE related unsafe behaviors/natural language processing

引用本文复制引用

基金项目

国家社会科学基金重点项目(23AZD072)

知识创新专项-曙光计划项目(2022010801020217)

中央高校基本科研业务费专项资金资助项目(CUG2642022006)

中国地质大学(武汉)教学实验室开放基金资助项目(SKJ2023180)

出版年

2024
工程管理学报
哈尔滨工业大学 中国建筑业协会管理现代化专业委员会

工程管理学报

CSTPCDCHSSCD
影响因子:1.613
ISSN:1674-8859
参考文献量5
段落导航相关论文