Building Consumption Data Systems Driven by AI Plus Expert for Scientific and Technical Literature Information Resources
[Purpose/Significance]Limited by the constraints of traditional literature classification systems,scientific and technical literature information resources face problems such as inadequate disclosure and resource utilization.At the same time,high-quality user-generated data cannot yet be integrated as data elements into services related to scientific and technical literature services,which prevents these services from adapting to the context of the open science and meeting the diverse knowledge needs of readers.This study aims to harness the technological breakthrough potential of AI to build a consumer-end data system for scientific and technical literature information resources driven by AI and experts.This will help to overcome the shortcomings of traditional services,such as the lack of supporting reading information and low interactivity between users,with the hope of promoting the optimization process of scientific and technical literature information resource services.[Method/Process]First,the study analyzes the four-dimensional value representation of the consumer-end data systems for scientific and technical literature information resources,including the intrinsic value,the tool value,the academic value,and the future value of annotation data.Then,following the processing flow of consumer-end data,namely the collection phase,utilization phase,and management phase,the paper proposes principles for the construction of consumer-end data systems.Furthermore,the paper deconstructs and analyzes the risks associated with the involvement of AI in the construction of consumer-end data systems,including four types of risks:machine algorithm risks,annotation content risks,annotation data risks and application risks.Finally,based on the degree of AI involvement in data annotation work,three innovative models of AI plus expert collaborates with user to accomplish data annotation for scientific and technical literature information resources are designed:the AI plus expert-assisted data annotation model,the AI plus expert collaborative data annotation model,and the AI plus expert-led data annotation model.[Results/Conclusions]Under the AI plus expert-assisted data annotation model,AI acts as a tool to complete surface-level information processing based on rules set by experts to assist users in data annotation.In the AI plus expert collaborative data annotation model,AI completes the review of pre-annotation tags for scientific and technical literature information resources,transforming users from a self-generated tag mode to an AI-generated data tag evaluation and selection mode,with experts assisting in the final review of data tag quality.In the AI plus expert-led data annotation model,users provide data annotation requirements,experts guide the process,and data annotation is automatically completed by the AI4S platform.
scientific and technical literature information resourcesAIsystem constructiondata annotationpattern design