Technology Research Based on Large Models of Non-transformer Architecture and its Application Exploration
With the rapid development of artificial intelligence technology,large language models have a significant impact on various industries,and Transformer architecture large models attracts extensive attention due to their unique advantages and potential,but there are also many questions and doubts.This paper firstly introduces the development of large models,as well as the current situation and problems of transformer architecture large models,and then focuses on the technical architecture and experimental results of the Yan architecture large model with non-transformer architecture,especially on the differences and advantages between the Yan architecture model and the transformer architecture model in terms of training effect,throughput,and computational resource consumption.In addition,the application of the Yan architecture large model in the field of intelligent customer service of State Grid material suppliers is discussed,that is,how to optimize the algorithm architecture to provide faster response speed and more accurate semantic comprehension ability,and how to optimize the customer service experience.Finally,the application prospects and development trends of non-transformer architecture large models in the future in the field of intelligent customer service and other natural language processing fields are prospected,and their important role in promoting AI technology progress and service innovation is pointed out.
large language modelnon-transformer architectureintelligent customer serviceYan