首页|基于非Transformer架构大模型的技术研究及应用探索

基于非Transformer架构大模型的技术研究及应用探索

扫码查看
随着人工智能技术的快速发展,大语言模型对各行各业产生了重大的影响,Transformer架构大模型因其独特的优势和潜力受到广泛关注,但也存在很多问题和疑惑.本文首先介绍了大模型的发展脉络,以及Transformer架构大模型的现状和问题,接着介绍了非Transformer架构的Yan架构大模型的技术架构和实验效果,特别是在训练效果、吞吐量、计算资源消耗等方面与Transformer架构模型的差异和优势.此外,探讨了 Yan架构大模型在某省级电网公司供应商智能客服领域的应用建设,即如何通过优化算法架构提供更快的响应速度和更准确的语义理解能力,以及如何优化供应商服务体验.最后,展望了非Transformer架构大模型在未来智能客服及其他自然语言处理领域的应用前景和发展趋势,指出其在推动AI技术进步和服务创新中的重要作用.
Technology Research Based on Large Models of Non-transformer Architecture and its Application Exploration
With the rapid development of artificial intelligence technology,large language models have a significant impact on various industries,and Transformer architecture large models attracts extensive attention due to their unique advantages and potential,but there are also many questions and doubts.This paper firstly introduces the development of large models,as well as the current situation and problems of transformer architecture large models,and then focuses on the technical architecture and experimental results of the Yan architecture large model with non-transformer architecture,especially on the differences and advantages between the Yan architecture model and the transformer architecture model in terms of training effect,throughput,and computational resource consumption.In addition,the application of the Yan architecture large model in the field of intelligent customer service of State Grid material suppliers is discussed,that is,how to optimize the algorithm architecture to provide faster response speed and more accurate semantic comprehension ability,and how to optimize the customer service experience.Finally,the application prospects and development trends of non-transformer architecture large models in the future in the field of intelligent customer service and other natural language processing fields are prospected,and their important role in promoting AI technology progress and service innovation is pointed out.

large language modelnon-transformer architectureintelligent customer serviceYan

赵明江、刘艳梅、杨婧一、张星奎、贾占宇

展开 >

国网辽宁省电力有限公司物资部,辽宁沈阳 110000

国网辽宁省电力有限公司物资分公司,辽宁 沈阳 110000

国网辽宁省电力有限公司铁岭供电公司,辽宁铁岭 112000

大语言模型 非Transformer架构 Yan 智能客服

2024

电力大数据
贵州电力试验研究院 贵州省电机工程学会

电力大数据

影响因子:0.047
ISSN:2096-4633
年,卷(期):2024.27(6)
  • 20