大模型驱动多智能体的军事需求生成框架

Large Language Models Driven Framework for Multi-agent Military Requirement Generation

李嘉晖 ¹张萌萌 ¹陈洪辉¹

扫码查看

作者信息

1. 国防科技大学信息系统工程全国重点实验室长沙 410000
折叠

摘要

联合作战军事需求生成涉及的参与人员多、工作量庞大,生成过程大多依赖个体经验与多来源文档,存在需求生成效率较低等问题,难以有效支撑联合作战体系设计.随着大模型技术的发展,大模型驱动的智能体在诸多领域展现出卓越的性能,多智能体系统通过分布式决策实现群体智能,能够高效处理复杂任务.针对军事需求生成过程中存在的效率低下的问题,提出大模型驱动多智能体的军事需求生成框架.该框架整合了多模态信息获取智能体、军事专家智能体、会议主持人等要素.多模态信息获取智能体集成多模态信息处理工具,能够快速获取军事需求,并与用户进行问答交互;军事专家智能体以自然语言对话的形式模拟人类专家讨论生成需求的场景,大模型驱动军事专家智能体理解环境,并能自主调用开源论文库、搜索引擎等工具以支持对话;会议主持人接收人类用户的指令,利用大模型细化指令内容,生成对话提示词和问题背景描述.以俄乌冲突为实验背景,对相关多模态信息进行军事需求生成.实验结果表明,当多模态信息量在大模型最大处理容量以内时,该框架显著降低了军事需求生成的时间消耗,视频资源节省时间占比达到80％～85％,音频资源节省时间占比为90％～95％.

Abstract

Military requirement generation in joint operation involves many participants and a heavy workload.The process relies on individual experience and multiple sources of documents,which leads to problems such as low efficiency in requirement genera-tion and difficulty supporting the design of joint operation system.With the development of large language models(LLMs),LLMs-driven agents have shown excellent performance in various fields,and multi-agent system can efficiently handle complex tasks by leveraging group intelligence through distributed decision-making.To address the low efficiency in military requirement generation,a framework for military requirement generation with LLMs-driven multi-agent system is proposed.The framework includes a multi-modal information acquisition agent,military expert agents,a moderator and other components.The multi-modal information acquisition agent can rapidly process multi-modal information,extract military requirements and provide the user with a question-and-answer function.Military expert agents simulate human experts discussing the generation of requirements through natural language dialogues.Driven by LLMs,these agents can perceive the environment and autonomously use tools such as Ar-xiv,search engines and other resources to support the dialogues.The moderator receives instructions from the human user,refines the content of the instructions using LLMs and generates dialogue prompts and problem background descriptions.Using the Rus-sia-Ukraine conflict as an experimental case,military requirements are generated from relevant multi-modal information.The ex-perimental results show that when the multi-modal information capacity is within the maximum processing capacity of LLMs,the framework significantly reduces the time consumption for military requirement generation,with time savings of 80％to 85％for video resources and 90％to 95％for audio resources.

关键词

需求生成/多智能体/生成式人工智能/大模型/多模态

Key words

Requirement generation/Multi-agent/Generative AI/LLMs/Multi-modal

引用本文复制引用

出版年

2025

计算机科学

重庆西南信息有限公司（原科技部西南信息中心）

计算机科学

北大核心

影响因子：0.944

ISSN：1002-137X

段落导航