首页|基于LDA与共现网络动态分析的技术机会识别

基于LDA与共现网络动态分析的技术机会识别

Identification of technology opportunities based on the LDA model and co-occurrence network dynamic analysis

扫码查看
直面未来科技发展的高度不确定性以及技术复杂性不断叠加的外部环境,亟须采用科学高效的分析方法识别技术创新机会.为此,本文从专利文本挖掘与关键词共现网络融合的动态分析视角构建了技术机会识别路径.首先,进行专利数据的获取及预处理,进而运用LDA模型提取具象领域的技术主题及关键词,并引入TF-IDF指标进行重要度分析;其次,分别构建技术关键词整体共现网络和基于技术主题的包含时间窗口的关键词共现子网络;再次,基于共现强度分析及共现网络动态分析探究技术主题及关键词的演化过程,以此将关键词划分为持续型、衰退型、新兴型和舍弃型等4种类型,进而识别出具有发展潜力的技术机会;最后,以无人船为例,识别了动力技术主题、姿态测量技术主题和定位导航技术主题中路径规划、传感器等持续型的技术机会和激光雷达、惯性导航等新兴型的技术机会.本文不仅弥补了静态共现网络难以揭示技术领域动态演化过程的局限性,而且避免了在创新过程中掩盖或误判一些技术机会的问题,为企业高效识别技术创新机会提供了可资借鉴的决策参考依据.
The external environment for future technological development is characterized by high uncertainty and increasing tech-nological complexity.It is crucial to adopt a scientific and efficient analytical approach to identify technological innovation oppor-tunities.In the era of data science,scientific knowledge is experiencing explosive growth,making it increasingly difficult to eval-uate and predict technological trends.Previous studies relying on qualitative or static analysis is no longer sufficient to accurately identify technical opportunities.In order to make informed decisions about scientific development policies,mitigate investment risks,and accurately grasp the direction of scientific development,it is necessary to increase research efforts on the scientific knowledge network and attempt to mine potential knowledge through the co-occurrence network.Therefore,this paper has con-structed a path of technology opportunity identification from a dynamic analysis perspective that integrates patent text mining and keyword co-occurrence networks.Firstly,this paper obtained and preprocessed patent data,and then applied the LDA model to extract technology topics and keywords from the patent data of specific technology domains.In particularly,the LDA model operation was performed using the jieba word splitting tool in Python software and the scikit-learn library.This requires setting the parameters a and β separately and obtaining the optimal topic number K by calculating the perplexity.In the meanwhile,the TF-IDF indicator was also intro-duced for importance analysis.Secondly,the paper constructed the overall co-occurrence matrix of technology keywords and the co-occurrence matrix based on technology topics with time windows.Subsequently,the overall co-occurrence network of tech-nology keywords and the keyword co-occurrence sub-network based on technology topics were generated based on the co-oc-currence matrix.In particularly,this paper systematically investigated and judged the more important and influential technologies in a specific technology domain by constructing an overall co-occurrence network of technology keywords and calculating the co-occurrence intensity of technology keywords.In the meanwhile,the trends in the evolution of technical topics and keywords o-ver time were further analyzed through co-occurrence sub-networks.This requires the generation of a dynamically changing co-occurrence sub-network using the well-established keyword lexicon and the co-occurrence matrix containing time windows.It visualized the keywords and their linkage relationships across years in the same three-dimensional coordinate system.Thirdly,high-frequency technology keywords were identified based on the overall co-occurrence network,while the evo-lution process of technology topics and keywords were analyzed based on the co-occurrence sub-network.And then,technolo-gy keywords were classified into four types:sustained,declining,emerging,and abandoned.Among them,sustained keywords are those with high co-occurrence intensity and a stable upward trend over time;declining keywords are those with high co-oc-currence intensity but a decreasing trend over time;emerging keywords are those with low co-occurrence intensity but a stable upward trend over time;and abandoned'keywords are those with low co-occurrence intensity and a decreasing trend over time.In turn,sustained and emerging were identified from these four types as technology opportunities with growth potential.Finally,using unmanned ships as an example,the technology opportunities contained in the power supply technology topic,attitude measurement technology topic,and positioning and navigation technology topics were identified.For example,sustained technology opportunities such as path planning and sensors,emerging technology opportunities such as lidar and inertial naviga-tion were identified.This paper not only addressed the limitations of static co-occurrence networks in revealing the dynamic evo-lution process of technological domains,but also avoided the problem of covering up or misjudging some technology opportunities in the innovation process.This will provide a useful decision-making reference for companies to efficiently identify technological innovation opportunities.

technology opportunity identificationLDA modelco-occurrence networkdynamic analysisunmanned ship

王金凤、张芷芯、冯立杰、张珂

展开 >

郑州大学 管理学院,河南郑州 450001

上海海事大学 中国(上海)自贸区供应链研究院,上海 201306

上海海事大学 物流工程学院,上海 201306

郑州大学 信息管理学院,河南郑州 450001

展开 >

技术机会识别 LDA模型 共现网络 动态分析 无人船

国家科技部创新方法工作专项国家科技部创新方法工作专项NSFC—河南联合基金重点项目NSFC—河南联合基金重点项目上海市科技计划项目上海市科技计划项目国家重点研发专项国家重点研发专项河南兴文化工程文化研究专项项目河南兴文化工程文化研究专项项目河南省高校人文社会科学研究一般项目河南省高校人文社会科学研究一般项目

2019IM0202002019-2023U1904210-42020-2023200405013002020-20232022YFF06087002022-20262022XWH0822022-20242023-ZZJH-0392022-2024

2024

科研管理
中国科学学与科技政策研究会联合会和清华大学技术创新研究中心

科研管理

CSTPCDCSSCICHSSCD北大核心
影响因子:2.198
ISSN:1000-2995
年,卷(期):2024.45(2)
  • 36