垂直领域大模型的定制化:理论基础与关键技术

Domain-Specific Foundation-Model Customization:Theoretical Foundation and Key Technology

陈浩泷 ¹陈罕之 ²韩凯峰 ³朱光旭 ¹赵奕晨 ⁴杜滢³

扫码查看

作者信息

1. 深圳市大数据研究院,深圳 518172;香港中文大学(深圳)理工学院,深圳 518172
2. 深圳市大数据研究院,深圳 518172
3. 中国信息通信研究院,北京 100191
4. 中国移动通信集团终端有限公司,北京 100033
折叠

摘要

随着 ChatGPT 等基于大模型的产品展现出强大的通用性能,学术界和工业界正积极探索如何将这些模型适配到特定行业和应用场景中,即进行垂直领域大模型的定制化.然而,现有的通用大模型可能无法完全适配特定领域数据的格式,或不足以捕捉该领域的独特需求.因此,本文旨在探讨垂直领域大模型定制化的方法论,包括大模型的定义和类别、通用架构的描述、大模型有效性背后的理论基础,以及几种可行的垂直领域大模型构建方法,期望通过这些内容为相关领域的研究者和从业者在垂直领域大模型定制化方面提供指导和参考.

Abstract

As ChatGPT and other foundation-model-based products demonstrate powerful general performance,both academia and industry are actively exploring how to adapt these models to specific industries and application scenarios,a process known as the customization of domain-specific foundation models.However,the existing general-purpose foundation models may not fully accommodate the patterns of domain-specific data or fail to capture the unique needs of the field.Therefore,this paper aims to discuss the methodology for customizing domain-specific foundation models,including the definition and types of foundation models,the description of their general architecture,the theoretical foundations behind the effectiveness of foundation models,and several feasible methods for constructing domain-specific foundation models.By presenting this content,we hope to provide guidance and reference for researchers and practitioners in the customization of domain-specific foundation models.

关键词

人工智能/垂直领域大模型/多模态大模型/预训练大模型/大模型微调

Key words

artificial intelligence/domain-specific foundation model/multimodal large model/pre-trained foundation model/fine tuning of large models

引用本文复制引用

基金项目

广东省基础与应用基础研究重大项目(2023B0303000001)

国家自然科学基金面上项目(62371313)

广东省基础与应用基础研究基金面上项目(2022A1515010109)

出版年

2024

数据采集与处理

中国电子学会中国仪器仪表学会信号处理学会　中国仪器仪表学会中国物理学会微弱信号检测学会　南京航空航天大学

数据采集与处理

CSTPCD北大核心

影响因子：0.679

ISSN：1004-9037

参考文献量9

段落导航