农业垂直领域大语言模型构建流程和技术展望

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据

中文摘要：随着互联网的普及,农业知识和信息的获取变得更加便捷,但信息大多固定且通用,无法针对具体情况提供定制化的解决方案.在此背景下,大语言模型(Large Language Models,LLMs)作为一种高效的人工智能工具,逐渐在农业领域中获得关注和应用.目前,LLMs技术在农业领域大模型的相关综述中只是简单描述,并没有系统地介绍LLMs构建流程.本文重点介绍了农业垂直领域大语言模型构建流程,包括数据采集和预处理、选择适当的LLMs基模型、微调训练、检索增强生成(Retrieval Augmented Generation,RAG)技术、评估过程.以及介绍了LangChain框架在农业问答系统中的构建.最后,总结出当前构建农业垂直领域大语言模型的一些挑战,包括数据安全挑战、模型遗忘挑战和模型幻觉挑战,以及提出了未来农业垂直领域大语言的发展方向,包括多模态数据融合、强时效数据更新、多语言知识表达和微调成本优化,以进一步提高农业生产的智能化和现代化水平.

外文标题：Construction Process and Technological Prospects of Large Language Models in the Agricultural Vertical Domain

外文摘要：With the proliferation of the internet,accessing agricultural knowledge and information has become more convenient.However,this information is often static and generic,failing to provide tailored solutions for specific situations.To address this issue,vertical domain models in agriculture combine agricultural data with large language models(LLMs),utilizing natural language processing and semantic understanding technologies to provide real-time answers to agricultural questions and play a crucial role in agricultural decision-making and extension.This paper details the construction process of LLMs in the agricultural vertical domain,including data collection and preprocessing,selecting appropriate pre-trained LLM base models,fine-tuning training,Retrieval Augmented Generation(RAG),evaluation.The paper also discusses the application of the LangChain framework in agricultural Q&A systems.Finally,the paper summarizes some challenges in building LLMs for the agricultural vertical domain,including data security challenges,model forgetting challenges,and model hallucination challenges,and proposes future development directions for agricultural models,including the utilization of multimodal data,real-time data updates,the integration of multilingual knowledge,and optimization of fine-tuning costs to further promote the intelligence and modernization of agricultural production.

外文关键词：

LLMsRAGLangChainagricultural Q&A systems

作者：

张宇芹、朱景全、董薇、李富忠、郭雷风

展开 >

作者单位：

山西农业大学软件学院,山西晋中 030801

中国农业科学院农业信息研究所,北京 100081

全国农业技术推广服务中心,北京 100125

关键词：

大语言模型检索增强生成 LangChain 农业问答系统

基金：

科技创新2030——重大项目贵州省科技支撑项目

项目编号：

2021ZD0110901黔科合支撑[2023]一般189

出版年：

2024

DOI：

10.19788/j.issn.2096-6369.000052

农业大数据学报

中国农业科学院农业信息研究所

农业大数据学报

ISSN：2096-6369

年,卷(期)：2024.6(3)